Microsoft Research ブログ
読み込み中…

Microsoft Research ブログ
Swin Transformer supports 3-billion-parameter vision models that can train with higher-resolution images for greater task applicability
| Han Hu と Baining Guo
Early last year, our research team from the Visual Computing Group (opens in new tab) introduced Swin Transformer (opens in new tab), a Transformer-based general-purpose computer vision architecture that for the first time beat convolutional neural networks on the important…