Qwen/Qwen3.5-9B
Image-Text-to-Text • 10B • Updated
• 1.01M • • 649
datatrove for all things web-scale data preparation: https://github.com/huggingface/datatrovenanotron for lightweight 4D parallelism LLM training: https://github.com/huggingface/nanotronlighteval for in-training fast parallel LLM evaluations: https://github.com/huggingface/lighteval