dan

prayerdan

14 16 12

AI & ML interests

Rag, DeepResearch, Medical LLM

Recent Activity

submitted a paper about 2 months ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

upvoted a paper about 2 months ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

new activity 4 months ago

Qwen/Qwen3.5-397B-A17B:qwen 3.5 系列什么时候在megatron 支持cp

View all activity

Organizations

submitted a paper to Daily Papers about 2 months ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

Paper • 2606.07074 • Published Jun 5 • 12

upvoted a paper about 2 months ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

Paper • 2606.07074 • Published Jun 5 • 12

New activity in Qwen/Qwen3.5-397B-A17B 4 months ago

qwen 3.5 系列什么时候在megatron 支持cp

#67 opened 4 months ago by

prayerdan

New activity in AQ-MedAI/PRGB 6 months ago

Upload en_all_fix_refine_0919.jsonl

#1 opened 6 months ago by

jyh777

updated a model 6 months ago

AQ-MedAI/Diver-Retriever-4B

Text Ranking • 4B • Updated Jan 17 • 334 • 24

upvoted a paper 7 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 89

liked a model 8 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • 685B • Updated Dec 1, 2025 • 2.64k • • 716

updated 2 models 8 months ago

AQ-MedAI/Diver-GroupRank-7B

Text Ranking • 8B • Updated Nov 23, 2025 • 70 • 7

AQ-MedAI/Diver-GroupRank-32B

Text Ranking • 33B • Updated Nov 23, 2025 • 13 • 3

liked 2 models 8 months ago

AQ-MedAI/Diver-GroupRank-32B

Text Ranking • 33B • Updated Nov 23, 2025 • 13 • 3

AQ-MedAI/Diver-GroupRank-7B

Text Ranking • 8B • Updated Nov 23, 2025 • 70 • 7

New activity in AQ-MedAI/RAG-QA-Leaderboard 8 months ago

Update README.md

#11 opened 8 months ago by

jyh777

Update README.md

#10 opened 8 months ago by

jyh777

published a Space 8 months ago

RagQALeaderboard

🥇

RagQALeaderboard

authored a paper 8 months ago

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 59

commented a paper 8 months ago

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 59 •

updated a collection 8 months ago

RagSystem

Collection

9 items • Updated Nov 18, 2025 • 2

published 2 models 8 months ago

AQ-MedAI/Diver-GroupRank-7B

Text Ranking • 8B • Updated Nov 23, 2025 • 70 • 7

AQ-MedAI/Diver-GroupRank-32B

Text Ranking • 33B • Updated Nov 23, 2025 • 13 • 3

commented a paper 8 months ago

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 59 •

dan

AI & ML interests

Recent Activity

Organizations

prayerdan's activity

qwen 3.5 系列 什么时候在megatron 支持cp

Upload en_all_fix_refine_0919.jsonl

Update README.md

Update README.md

RagQALeaderboard

qwen 3.5 系列什么时候在megatron 支持cp