- For all the pretraining and finetuning, we adopt spaese/uniform sampling.
-
#Frame
$=$ #input_frame
$\times$ #crop
$\times$ #clip
-
#input_frame
means how many frames are input for model per inference -
#crop
means spatial crops (e.g., 3 for left/right/center) -
#clip
means temporal clips (e.g., 4 means repeted sampling four clips with different start indices)
Model | Setting | Model | Shell |
---|---|---|---|
|
K-Mash-1.1M 300e | 🤗 HF link | run.sh |
|
K-Mash-2M 300e | TBD | run.sh |
Model | Setting | Teacher | Model | Shell |
---|---|---|---|---|
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT | 8x3x4 | 87.6 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 88.1 | TBD | run.sh |
|
K-Mash PT | 8x3x4 | 79.6 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 83.5 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 86.2 | 🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT + K710 FT | 8x3x4 | 91.3 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 91.6 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 91.9 | TBD | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 92.1 | TBD | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 85.4 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 88.4 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 90.4 | 🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT + K710 FT | 8x3x4 | 91.4 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 91.6 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 91.7 | TBD | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 91.9 | TBD | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 86.0 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 88.9 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 90.6 | 🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT + K710 FT | 8x3x4 | 85.0 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 85.4 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 85.7 | TBD | run.sh |
|
K-Mash PT + K710 FT | 16x3x4 | 85.9 | TBD | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 75.7 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 80.5 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT | 8x3x4 | 83.5 | 🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 50.8 | 🤗 HF link | run.sh |
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 51.0 | TBD | run.sh |
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 51.2 | TBD | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT | 8x3x4 | 68.5 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 69.7 | TBD | run.sh |
Model | Setting | #Frame | Top-1 | Model | Shell |
---|---|---|---|---|---|
|
K-Mash PT | 8x3x4 | 77.1 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 77.5 | TBD | run.sh |
|
K-Mash PT | 8x3x4 | 71.6 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 73.5 | 🤗 HF link | run.sh |
|
K-Mash PT | 8x3x4 | 76.4 | 🤗 HF link | run.sh |
Model | Setting | #Frame | Top-1 | mAP | Model | Shell |
---|---|---|---|---|---|---|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 95.9 | 98.2 | TBD | run.sh |
Model | Setting | #Frame | Top-1 | mAP | Model | Shell |
---|---|---|---|---|---|---|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 97.0 | 98.8 | TBD | run.sh |