Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hanzlajavaid's picture
6 2 15

hanzlajavaid PRO

hanzla
KOOPLA's profile picture plaethos999's profile picture Innovationflow's profile picture
ยท

AI & ML interests

Direct Preference Optimization, Supervised Finetuning, Stable Diffusion

Recent Activity

posted an update 14 days ago
Reinforcement learning can sometimes lead to emergent behavior through much simpler training setups compared to large scale pre-training. I explored this idea by running a small GRPO experiment on Qwen3.5 4B, and the results were pretty exciting. Hypothesis: improving visual mathematical reasoning may also improve the modelโ€™s ability to transcribe LaTeX from images. I wrote a short breakdown of the experiment here: https://hanzlajavaid.github.io/blog/grpo-experiment-exploring-emergent-properties/
updated a model 22 days ago
hanzla/Qwen3.5-4B-mathvista-GRPO
published a model 22 days ago
hanzla/Qwen3.5-4B-mathvista-GRPO
View all activity

Organizations

ZeroGPU Explorers's profile picture Journalists on Hugging Face's profile picture MLX Community's profile picture ModularityAI's profile picture Social Post Explorers's profile picture OpenAI gpt-oss Grants's profile picture

hanzla 's Spaces 3

Sleeping
Agents
9

Falcon3MambaReasoner

๐Ÿ“Š

Generate responses to text prompts in a chat interface

Mar 24, 2025
Runtime error
Agents
3

PlaygroundGemma3

๐Ÿ†

Chat with Gemma 3 about images

Mar 12, 2025
Runtime error
Agents
3

PlaygroundAyaVision

๐Ÿ“š

Generate text descriptions from images and prompts

Mar 11, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs