Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Shail Shah's picture

Shail Shah

shail-2512

SethTharo's profile picture

shtefcs's profile picture

hotmailuser's profile picture

·

AI & ML interests

None yet

Organizations

shail-2512 's collections 14

MultiModal (Any-to-Any)

gpt-omni/mini-omni2

Any-to-Any • Updated Oct 24, 2024 • 65 • 283
BAAI/Emu3-Gen

Any-to-Any • 8B • Updated Oct 23, 2024 • 2.15k • 226

nvidia/Hymba-1.5B-Instruct

Text Generation • 2B • Updated Jan 2, 2025 • 731 • 246

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12, 2025 • 1.18M • • 2.01k
Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 1.8M • • 688
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

33B • Updated Nov 15, 2024 • 4.08k • 77
deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • 236B • Updated Aug 21, 2024 • 5.35k • 682

Image Generation

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 697k • • 12.7k
black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 732k • • 4.79k
stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 54.1k • • 3.43k

openai/shap-e

Text-to-3D • Updated Dec 11, 2023 • 5.46k • 265
tencent/Hunyuan3D-1

Image-to-3D • Updated Oct 17, 2025 • 1.65k • 309

Speech Recognition

nvidia/canary-1b

Automatic Speech Recognition • Updated Dec 3, 2025 • 1.53k • 457
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 71.2k • 971
nyrahealth/CrisperWhisper

Automatic Speech Recognition • 2B • Updated 12 days ago • 27.2k • 327
openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.63M • • 2.95k

Reranking Models

jinaai/jina-reranker-v2-base-multilingual

Text Ranking • 0.3B • Updated Oct 21, 2025 • 1.19M • 349
BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 7.36M • • 952

ALMs (Audio Language Models)

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12, 2025 • 308k • 527

parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 8.29k • 273
OuteAI/OuteTTS-0.2-500M

Text-to-Speech • Updated Apr 17, 2025 • 488 • 310
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 16.8k • 1.52k

Reasoning (LRMs)

Qwen/QwQ-32B-Preview

Text Generation • 33B • Updated Jan 12, 2025 • 7.97k • • 1.74k
AIDC-AI/Marco-o1

Text Generation • 8B • Updated Nov 23, 2024 • 3.06k • • 712

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 28.2k • 583
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 252 • 1.71k
vidore/colsmolvlm-v0.1

Visual Document Retrieval • Updated Mar 14, 2025 • 53 • 55
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 195k • 1.58k

Video Generation

zai-org/CogVideoX1.5-5B-I2V

Image-to-Video • Updated Mar 18, 2025 • 6.02k • 117
zai-org/CogVideoX-2b

Text-to-Video • Updated Nov 23, 2024 • 27.7k • 360
zai-org/CogVideoX-5b

Text-to-Video • Updated Nov 23, 2024 • 35.1k • • 672
genmo/mochi-1-preview

Text-to-Video • Updated Sep 4, 2025 • 5.32k • • 1.32k

Dataset to fine-tune Embeddings

sentence-transformers/pubmedqa

Viewer • Updated Jun 19, 2024 • 35.4k • 72 • 3
sujet-ai/Sujet-Financial-RAG-EN-Dataset

Viewer • Updated Jul 28, 2024 • 106k • 185 • 11
enelpol/rag-mini-bioasq

Viewer • Updated Jun 27, 2024 • 44.9k • 566 • 13
philschmid/finanical-rag-embedding-dataset

Viewer • Updated Jun 3, 2024 • 7k • 47 • 20

Embedding Models

nomic-ai/nomic-embed-text-v1.5

Sentence Similarity • 0.1B • Updated 12 days ago • 12.1M • 799
BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 16.3M • • 2.93k

MultiModal (Any-to-Any)

gpt-omni/mini-omni2

Any-to-Any • Updated Oct 24, 2024 • 65 • 283
BAAI/Emu3-Gen

Any-to-Any • 8B • Updated Oct 23, 2024 • 2.15k • 226

ALMs (Audio Language Models)

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12, 2025 • 308k • 527

nvidia/Hymba-1.5B-Instruct

Text Generation • 2B • Updated Jan 2, 2025 • 731 • 246

parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 8.29k • 273
OuteAI/OuteTTS-0.2-500M

Text-to-Speech • Updated Apr 17, 2025 • 488 • 310
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 16.8k • 1.52k

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12, 2025 • 1.18M • • 2.01k
Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 1.8M • • 688
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

33B • Updated Nov 15, 2024 • 4.08k • 77
deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • 236B • Updated Aug 21, 2024 • 5.35k • 682

Reasoning (LRMs)

Qwen/QwQ-32B-Preview

Text Generation • 33B • Updated Jan 12, 2025 • 7.97k • • 1.74k
AIDC-AI/Marco-o1

Text Generation • 8B • Updated Nov 23, 2024 • 3.06k • • 712

Image Generation

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 697k • • 12.7k
black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 732k • • 4.79k
stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 54.1k • • 3.43k

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 28.2k • 583
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 252 • 1.71k
vidore/colsmolvlm-v0.1

Visual Document Retrieval • Updated Mar 14, 2025 • 53 • 55
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 195k • 1.58k

openai/shap-e

Text-to-3D • Updated Dec 11, 2023 • 5.46k • 265
tencent/Hunyuan3D-1

Image-to-3D • Updated Oct 17, 2025 • 1.65k • 309

Video Generation

zai-org/CogVideoX1.5-5B-I2V

Image-to-Video • Updated Mar 18, 2025 • 6.02k • 117
zai-org/CogVideoX-2b

Text-to-Video • Updated Nov 23, 2024 • 27.7k • 360
zai-org/CogVideoX-5b

Text-to-Video • Updated Nov 23, 2024 • 35.1k • • 672
genmo/mochi-1-preview

Text-to-Video • Updated Sep 4, 2025 • 5.32k • • 1.32k

Speech Recognition

nvidia/canary-1b

Automatic Speech Recognition • Updated Dec 3, 2025 • 1.53k • 457
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 71.2k • 971
nyrahealth/CrisperWhisper

Automatic Speech Recognition • 2B • Updated 12 days ago • 27.2k • 327
openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.63M • • 2.95k

Dataset to fine-tune Embeddings

sentence-transformers/pubmedqa

Viewer • Updated Jun 19, 2024 • 35.4k • 72 • 3
sujet-ai/Sujet-Financial-RAG-EN-Dataset

Viewer • Updated Jul 28, 2024 • 106k • 185 • 11
enelpol/rag-mini-bioasq

Viewer • Updated Jun 27, 2024 • 44.9k • 566 • 13
philschmid/finanical-rag-embedding-dataset

Viewer • Updated Jun 3, 2024 • 7k • 47 • 20

Reranking Models

jinaai/jina-reranker-v2-base-multilingual

Text Ranking • 0.3B • Updated Oct 21, 2025 • 1.19M • 349
BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 7.36M • • 952

Embedding Models

nomic-ai/nomic-embed-text-v1.5

Sentence Similarity • 0.1B • Updated 12 days ago • 12.1M • 799
BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 16.3M • • 2.93k

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs