arxiv:2504.06947
Mikhail Tikhomirov
RefalMachine
AI & ML interests
NLP
Recent Activity
liked a Space 1 day ago
kaengreg/rusBEIR liked a dataset 9 days ago
redmadrobot-rnd/pii_benchmark upvoted a paper 19 days ago
Trust-Region Behavior Blending for On-Policy Distillation