Model card for fastvit_mci2.apple_mclip2_dfndr2b

A MobileCLIP v2 (image encoder only) for timm. Equivalent to image tower from https://huggingface.co/timm/MobileCLIP2-S2-OpenCLIP.

Model Details

Dataset: DFNDR-2B
Papers:
- MobileCLIP2: Improving Multi-Modal Reinforced Training: https://arxiv.org/abs/2508.20691

Citation

@article{faghri2025mobileclip2,
          title={MobileCLIP2: Improving Multi-Modal Reinforced Training},
          author={Faghri, Fartash and Vasu, Pavan Kumar Anasosalu and Koc, Cem and Shankar, Vaishaal and Toshev, Alexander and Tuzel, Oncel and Pouransari, Hadi},
          journal={arXiv preprint arXiv:2508.20691},
          year={2025}
        }

Downloads last month: 59

Inference Providers NEW

Image Feature Extraction

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including timm/fastvit_mci2.apple_mclip2_dfndr2b

MobileCLIP-2

Collection

OpenCLIP / timm ports of Apple's MobileCLIP-2 multi-modal and image encoders • 12 items • Updated Sep 19, 2025 • 1

Paper for timm/fastvit_mci2.apple_mclip2_dfndr2b

MobileCLIP2: Improving Multi-Modal Reinforced Training

Paper • 2508.20691 • Published Aug 28, 2025 • 7