File tree Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Original file line number Diff line number Diff line change 1212
1313## What's New  
1414
15+ ## Oct 16, 2025  
16+ *  Add an impl of the Muon optimizer (based on https://github.com/KellerJordan/Muon ) with customizations
17+   *  extra flexibility and improved handling for conv weights and fallbacks for weight shapes not suited for orthogonalization
18+   *  small speedup for NS iterations by reducing allocs and using fused (b)add(b)mm ops
19+   *  by default uses AdamW (or NAdamW if ` nesterov=True ` ) updates if muon not suitable for parameter shape (or excluded via param group flag)
20+   *  like torch impl, select from several LR scale adjustment fns via ` adjust_lr_fn ` 
21+   *  select from several NS coefficient presets or specify your own via ` ns_coefficients ` 
22+ *  First 2 steps of 'meta' device model initalization supported
23+   *  Fix several ops that were breaking creation under 'meta' device context
24+   *  Add device & dtype factory kwarg support to all models and modules (anything inherting from nn.Module) in ` timm ` 
25+ 
1526## Sept 21, 2025  
1627*  Remap DINOv3 ViT weight tags from ` lvd_1689m `  -> ` lvd1689m `  to match (same for ` sat_493m `  -> ` sat493m ` )
1728*  Release 1.0.20
    
 
   
 
     
   
   
          
     
  
    
     
 
    
      
     
 
     
    You can’t perform that action at this time.
  
 
    
  
     
    
      
        
     
 
       
      
     
   
 
    
    
  
 
  
 
     
    
0 commit comments