Shail Shah
shail-2512
AI & ML interests
None yet
Organizations
LLMs
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 364k • • 1.97k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 569k • • 590 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.1k • 74 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 44k • 674
Image Generation
3D
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.37k • 458 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 227k • 942 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 3.51k • 320 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 2.62M • • 2.77k
Reranking Models
ALMs (Audio Language Models)
TTS
Reasoning (LRMs)
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 22.8k • 571 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 352 • 1.7k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 29 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 95.4k • • 1.55k
Video Generation
Dataset to fine-tune Embeddings
Embedding Models
MultiModal (Any-to-Any)
ALMs (Audio Language Models)
LLMs
TTS
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 364k • • 1.97k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 569k • • 590 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 1.1k • 74 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 44k • 674
Reasoning (LRMs)
Image Generation
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 22.8k • 571 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 352 • 1.7k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 29 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 95.4k • • 1.55k
3D
Video Generation
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.37k • 458 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 227k • 942 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 3.51k • 320 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 2.62M • • 2.77k
Dataset to fine-tune Embeddings
Reranking Models
Embedding Models