Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen2.5-VL-7B-Instruct
like
1.42k
Follow
Qwen
64.1k
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
image-to-text
multimodal
conversational
text-generation-inference
arxiv:
2309.00071
arxiv:
2409.12191
arxiv:
2308.12966
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
63
Deploy
Use this model
Request: DOI
#36
by
mojan3
- opened
Mar 2, 2025
Discussion
mojan3
Mar 2, 2025
See translation
mojan3
Mar 2, 2025
ูุตู
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
ยท
Sign up
or
log in
to comment