[원문 링크]
https://arxiv.org/pdf/2103.14030
[참고 영상]
https://www.youtube.com/watch?v=SndHALawoag
[개념정리] Distributed Training (0) | 2025.02.25 |
---|---|
[논문리뷰] EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction (0) | 2025.02.20 |
[논문리뷰] Segment Anything (0) | 2025.02.18 |
[논문리뷰] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (0) | 2025.02.17 |
[용어정리] Attributes of Connections within Layer - hidden states (25.2.17) (0) | 2025.02.17 |