video classification models

3 models · ranked by HuggingFace downloads

videomae-small-finetuned-kinetics-xd-violence-binary

A VideoMAE-Small model fine-tuned on XD-Violence, a multi-scene violence detection dataset covering realistic violent video clips from films and surveillance footage. The model performs binary video classification (violent/non-violent) using temporal self-supervised pre-training. VideoMAE's masked autoencoder approach requires fewer labelled examples than supervised-only baselines for video tasks.

391,007 ↓ · 0 ♡

vjepa2-vitg-fpc64-256

vjepa2-vitg-fpc64-256 is a ViT model available on HuggingFace without a declared task pipeline. Consult the model card for intended use cases and fine-tuning instructions.

377,363 ↓ · 53 ♡

kandinsky-videomae-large-camera-motion

kandinsky-videomae-large-camera-motion has no registered pipeline_tag. It likely serves as a pretraining base or a specialized evaluation model — review the model card before use.

323,151 ↓ · 5 ♡