Skip to content

Commit d8b4c34

Browse files
authored
Update README.md
1 parent a501499 commit d8b4c34

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

README.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,17 @@
1212

1313
## What's New
1414

15+
## Oct 16, 2025
16+
* Add an impl of the Muon optimizer (based on https://github.com/KellerJordan/Muon) with customizations
17+
* extra flexibility and improved handling for conv weights and fallbacks for weight shapes not suited for orthogonalization
18+
* small speedup for NS iterations by reducing allocs and using fused (b)add(b)mm ops
19+
* by default uses AdamW (or NAdamW if `nesterov=True`) updates if muon not suitable for parameter shape (or excluded via param group flag)
20+
* like torch impl, select from several LR scale adjustment fns via `adjust_lr_fn`
21+
* select from several NS coefficient presets or specify your own via `ns_coefficients`
22+
* First 2 steps of 'meta' device model initalization supported
23+
* Fix several ops that were breaking creation under 'meta' device context
24+
* Add device & dtype factory kwarg support to all models and modules (anything inherting from nn.Module) in `timm`
25+
1526
## Sept 21, 2025
1627
* Remap DINOv3 ViT weight tags from `lvd_1689m` -> `lvd1689m` to match (same for `sat_493m` -> `sat493m`)
1728
* Release 1.0.20

0 commit comments

Comments
 (0)