Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Tencent-Hunyuan-Multimodal-RL

company
https://TODO
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Jiaqi-hkust  authored a paper 39 minutes ago
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?
Jiaqi-hkust  submitted a paper about 4 hours ago
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?
cheese1  authored a paper about 23 hours ago
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
View all activity

Papers

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all Papers

Xiangxin Zhou's profile pictureLazy Beaver's profile pictureBoye Niu's profile pictureRuoyu's profile pictureJiarui Yao's profile pictureJiaqi Tang's profile pictureTianyu Pang's profile picturePU JIAN's profile picturesumail's profile pictureLvfang Tao's profile picture
Tencent-Hunyuan-Multimodal-RL 's papers 3
Submitted by
Tianyu Pang
38

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
41

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
29

Rethinking the Divergence Regularization in LLM RL

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
521 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs