Visual Document Retrieval
PEFT
Safetensors
ColPali
vidore
multimodal_embedding
multilingual_embedding
Text-to-Visual Document (T→VD) retrieval
File size: 81 Bytes
dc8774d
 
 
 
1
2
3
4
5
{
  "max_num_visual_tokens": 1280,
  "processor_class": "ColQwen2_5_Processor"
}