Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Muhammad Ramzan's picture

Muhammad Ramzan

iamramzan

Murat231's profile picture

ali14325's profile picture

Altan3265's profile picture

·

https://linktr.ee/ramzanshaheen

i_amramzan
iamramzan
iamramzanai

AI & ML interests

GenAI, Vision & Co

Organizations

iamramzan 's collections 5

Shaheen Collection 🦅

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 424k • • 12.9k
Abirate/english_quotes

Viewer • Updated Oct 25, 2022 • 2.51k • 2.47k • 104
fka/awesome-chatgpt-prompts

Viewer • Updated about 3 hours ago • 954 • 19.7k • 9.54k
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17, 2025 • 1.41k • 156

Vision Foundation Models 🧩

Foundation models for computer vision.

Running

110

Grounding DINO Demo

💻

110

Cutting edge open-vocabulary object detection app
Running

Featured

95

Owlv2

👀

95

State-of-the-art Zero-shot Object Detection
Runtime error

Featured

41

BLIP2 with transformers

🌖

41

BLIP2 (cutting edge image captioning) in 🤗transformers
Build error

Featured

378

IDEFICS Playground

🐨

378

Comprehensive Computer Vision Backbones 🧩

This collection offers a variety of pre-trained computer vision backbones ideal for fine-tuning.

microsoft/resnet-50

Image Classification • 25.6M • Updated Feb 13, 2024 • 229k • • 471
google/vit-base-patch16-224-in21k

Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.14M • 392
google/vit-base-patch32-224-in21k

Image Feature Extraction • 88M • Updated Dec 8, 2022 • 5.74k • 19
facebook/dinov2-large

Image Feature Extraction • 0.3B • Updated Sep 6, 2023 • 3.06M • 99

Top Vision-Language Papers 🖼️💬📝

A curated list of papers on vision-language models, with the most influential ones at the top.

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 39
DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 48
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 11
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

Paper • 2404.01331 • Published Mar 29, 2024 • 27

Cutting-Edge Object Detection Models 🥥

facebook/detr-resnet-50

Object Detection • 41.6M • Updated Apr 10, 2024 • 1.04M • • 919
facebook/detr-resnet-101-dc5

Object Detection • 60.7M • Updated Sep 6, 2023 • 1.4k • 19
facebook/detr-resnet-50-dc5

Object Detection • 41.6M • Updated Sep 7, 2023 • 1.43k • 6
google/owlvit-base-patch32

Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 86.3k • 143

Shaheen Collection 🦅

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 424k • • 12.9k
Abirate/english_quotes

Viewer • Updated Oct 25, 2022 • 2.51k • 2.47k • 104
fka/awesome-chatgpt-prompts

Viewer • Updated about 3 hours ago • 954 • 19.7k • 9.54k
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17, 2025 • 1.41k • 156

Top Vision-Language Papers 🖼️💬📝

A curated list of papers on vision-language models, with the most influential ones at the top.

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 39
DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 48
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 11
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

Paper • 2404.01331 • Published Mar 29, 2024 • 27

Vision Foundation Models 🧩

Foundation models for computer vision.

Running

110

Grounding DINO Demo

💻

110

Cutting edge open-vocabulary object detection app
Running

Featured

95

Owlv2

👀

95

State-of-the-art Zero-shot Object Detection
Runtime error

Featured

41

BLIP2 with transformers

🌖

41

BLIP2 (cutting edge image captioning) in 🤗transformers
Build error

Featured

378

IDEFICS Playground

🐨

378

Cutting-Edge Object Detection Models 🥥

facebook/detr-resnet-50

Object Detection • 41.6M • Updated Apr 10, 2024 • 1.04M • • 919
facebook/detr-resnet-101-dc5

Object Detection • 60.7M • Updated Sep 6, 2023 • 1.4k • 19
facebook/detr-resnet-50-dc5

Object Detection • 41.6M • Updated Sep 7, 2023 • 1.43k • 6
google/owlvit-base-patch32

Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 86.3k • 143

Comprehensive Computer Vision Backbones 🧩

This collection offers a variety of pre-trained computer vision backbones ideal for fine-tuning.

microsoft/resnet-50

Image Classification • 25.6M • Updated Feb 13, 2024 • 229k • • 471
google/vit-base-patch16-224-in21k

Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.14M • 392
google/vit-base-patch32-224-in21k

Image Feature Extraction • 88M • Updated Dec 8, 2022 • 5.74k • 19
facebook/dinov2-large

Image Feature Extraction • 0.3B • Updated Sep 6, 2023 • 3.06M • 99

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs