Merkliste 
 1 Ergebnisse 
 
1

DeepNet: Scaling Transformers to 1,000 Layers:

Wang, Hongyu ; Ma, Shuming ; Dong, Li...
IEEE Transactions on Pattern Analysis and Machine Intelligence.  , 2024