Any-to-Any
Transformers
Safetensors
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
Instructions to use openbmb/MiniCPM-o-2_6 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openbmb/MiniCPM-o-2_6 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("openbmb/MiniCPM-o-2_6", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
#13
by flyingmrwang - opened
README.md
CHANGED
|
@@ -563,7 +563,7 @@ Note: For proprietary models, we calculate token density based on the image enco
|
|
| 563 |
<td colspan="11" align="left"><strong>Open-Source</strong></td>
|
| 564 |
</tr>
|
| 565 |
<tr>
|
| 566 |
-
<td nowrap="nowrap" align="left">Qwen2-Audio-
|
| 567 |
<td>8B</td>
|
| 568 |
<td>-</td>
|
| 569 |
<td>7.5</td>
|
|
@@ -576,7 +576,7 @@ Note: For proprietary models, we calculate token density based on the image enco
|
|
| 576 |
<td><strong>55.3</strong></td>
|
| 577 |
</tr>
|
| 578 |
<tr>
|
| 579 |
-
<td nowrap="nowrap" align="left">Qwen2-Audio-
|
| 580 |
<td>8B</td>
|
| 581 |
<td>2.6*</td>
|
| 582 |
<td>6.9*</td>
|
|
|
|
| 563 |
<td colspan="11" align="left"><strong>Open-Source</strong></td>
|
| 564 |
</tr>
|
| 565 |
<tr>
|
| 566 |
+
<td nowrap="nowrap" align="left">Qwen2-Audio-7B</td>
|
| 567 |
<td>8B</td>
|
| 568 |
<td>-</td>
|
| 569 |
<td>7.5</td>
|
|
|
|
| 576 |
<td><strong>55.3</strong></td>
|
| 577 |
</tr>
|
| 578 |
<tr>
|
| 579 |
+
<td nowrap="nowrap" align="left">Qwen2-Audio-7B-Instruct</td>
|
| 580 |
<td>8B</td>
|
| 581 |
<td>2.6*</td>
|
| 582 |
<td>6.9*</td>
|