18 17

Иван Поляков

meme-addict

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

JordiFabregat1/s103-assets

upvoted a paper 4 days ago

Healthcare AI GYM for Medical Agents

upvoted a paper 7 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

View all activity

Organizations

None yet

liked a model 1 day ago

JordiFabregat1/s103-assets

Updated 1 day ago • 2

upvoted a paper 4 days ago

Healthcare AI GYM for Medical Agents

Paper • 2605.02943 • Published 15 days ago • 4

upvoted a paper 7 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 9 days ago • 106

liked a model 9 days ago

bigscience/bloom

Text Generation • 176B • Updated Jul 28, 2023 • 6.14k • 5k

upvoted a paper 10 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 13 days ago • 153

liked a dataset 14 days ago

cracklinoatbran/reward_hacking_monitor_2046

Viewer • Updated 14 days ago • 2.05k • 34 • 2

liked a model 22 days ago

Zaytron40k/qwen2511-sketch2art-checkpoints

Updated 22 days ago • 1

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 2.26k • 906

liked a dataset about 1 month ago

felixwangg/prime_vul_minus_splitted_line_diff_mask_skip_indent_ctx5_chat_v2

Viewer • Updated Apr 12 • 4.05k • 62

liked a model about 1 month ago

rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new

Text Generation • Updated Apr 12 • 1

upvoted 2 papers about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 502

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

liked a model about 1 month ago

igor-saprygin/so101-fixed-layout-smolvla-3cam

Robotics • Updated Apr 10 • 1

upvoted 3 papers about 1 month ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

T5Gemma-TTS Technical Report

Paper • 2604.01760 • Published Apr 2 • 11

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 28

liked a dataset about 1 month ago

taresh18/indic-speech

Viewer • Updated Apr 3 • 87.8k • 57

liked a model about 1 month ago

zachyuan/BiRefNet

Image Segmentation • 0.2B • Updated Apr 1 • 10

upvoted a paper about 1 month ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

upvoted a paper about 2 months ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

Иван Поляков

AI & ML interests

Recent Activity

Organizations

meme-addict's activity