Wang

VincentWang

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago

meituan-longcat/LongCat-Flash-Thinking-2601

liked a dataset about 2 months ago

kensho/DocFinQA

liked a model about 2 months ago

YOYO-AI/Qwen3-30B-A3B-YOYO-Thinking-Chimera

View all activity

Organizations

None yet

liked a model 27 days ago

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated about 1 month ago • 5.21k • 102

liked a dataset about 2 months ago

kensho/DocFinQA

Viewer • Updated Nov 19, 2024 • 7.44k • 920 • 14

liked a model about 2 months ago

YOYO-AI/Qwen3-30B-A3B-YOYO-Thinking-Chimera

Text Generation • 31B • Updated Jan 5 • 30 • 5

liked a model 2 months ago

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 15.2k • • 244

liked 2 datasets 2 months ago

Mxode/Chinese-Instruct

Viewer • Updated May 9, 2025 • 4.85M • 539 • 143

BAAI/IndustryCorpus2

Viewer • Updated Dec 17, 2024 • 826M • 2.28k • 64

liked 2 datasets 3 months ago

nvidia/Nemotron-RL-instruction_following-structured_outputs

Viewer • Updated Jan 12 • 9.95k • 474 • 34

instruction-pretrain/general-instruction-augmented-corpora

Preview • Updated about 5 hours ago • 31.2k • 20

liked a model 5 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • Updated Aug 26, 2025 • 19.7k • 484

liked a dataset 6 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13, 2025 • 216 • 26

liked a model 7 months ago

infly/inf-retriever-v1

liked a dataset 7 months ago

FreedomIntelligence/Evol-Instruct-Chinese-GPT4

Viewer • Updated Dec 6, 2023 • 70k • 31 • 47

liked a model 9 months ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

71B • Updated Apr 13, 2025 • 1.22k • 90

liked 3 datasets 9 months ago

liked a model 9 months ago

TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15, 2025 • 4.81k • • 21

upvoted an article 10 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

106

liked 2 datasets 10 months ago

ucinlp/drop

Viewer • Updated Jan 17, 2024 • 86.9k • 3.2k • 66

deepmind/aqua_rat

Viewer • Updated Jan 9, 2024 • 196k • 3.58k • 72

Wang

AI & ML interests

Recent Activity

Organizations

VincentWang's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment