Skip to content

Commit 09d34cf

Browse files
authored
Update README.md
1 parent 7ad0972 commit 09d34cf

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,11 +7,11 @@ What is Gradient Centralization? = "GC can be viewed as a projected gradient des
77

88
Ranger now uses Gradient Centralization by default, and applies it to all conv and fc layers by default. However, everything is customizable so you can test with and without on your own datasets. (Turn on off via "use_gc" flag at init).
99
</br>
10-
# Best training results - use a 75% flat lr, then step down and run lower lr for 25%, or cosine descend last 25%.
10+
### Best training results - use a 75% flat lr, then step down and run lower lr for 25%, or cosine descend last 25%.
1111
</br>
1212
</br> It's important to note that simply running one learning rate the entire time will not produce optimal results. Effectively Ranger will end up 'hovering' around the an optimal zone but can't descend into it unless it has some additional run time at a lower rate to drop down into the optimal valley.
1313

14-
# Full customization at init:
14+
### Full customization at init:
1515

1616
![](ranger-with-gc-options.jpg)
1717
</br>

0 commit comments

Comments
 (0)