|
|
|
|
|
|
|
|
|
|
|
|
|
|
![]() |
| Model | Top-1 | Top-5 |
|---|---|---|
| TimeSformer-HR | 62.5 | - |
| ViViT-L | 65.4 | 89.8 |
| MViT-B | 67.1 | 90.8 |
| Motionformer-L | 68.1 | 91.2 |
| Model | Top-1 | Top-5 |
|---|---|---|
| TimeSformer-HR | 80.7 | 94.7 |
| MViT-B | 81.2 | 95.1 |
| ViViT-L | 81.3 | 94.7 |
| Motionformer-HR | 81.1 | 95.2 |
|
![]() |
Mandela Patrick, Dylan Campbell, Yuki Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, Joao Henriques Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers ArXiv |