Fine-Tunning xtuner/llava-llama-3-8b-v1_1-gguf Image-to-Text • 8B • Updated Apr 30, 2024 • 2.56k • 221
Vision_LLM vidore/colpali-v1.3 Visual Document Retrieval • Updated Mar 14, 2025 • 33.5k • 84 CohereLabs/aya-vision-8b Image-Text-to-Text • 9B • Updated Oct 30, 2025 • 40k • 315 google/gemma-3-12b-it Image-Text-to-Text • 12B • Updated Mar 21, 2025 • 1.31M • • 605 meta-llama/Llama-4-Scout-17B-16E-Instruct Any-to-Any • 109B • Updated May 22, 2025 • 231k • 1.17k
Fine-Tunning xtuner/llava-llama-3-8b-v1_1-gguf Image-to-Text • 8B • Updated Apr 30, 2024 • 2.56k • 221
Vision_LLM vidore/colpali-v1.3 Visual Document Retrieval • Updated Mar 14, 2025 • 33.5k • 84 CohereLabs/aya-vision-8b Image-Text-to-Text • 9B • Updated Oct 30, 2025 • 40k • 315 google/gemma-3-12b-it Image-Text-to-Text • 12B • Updated Mar 21, 2025 • 1.31M • • 605 meta-llama/Llama-4-Scout-17B-16E-Instruct Any-to-Any • 109B • Updated May 22, 2025 • 231k • 1.17k