24 13

Oliver Kowalski

browser-kid

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 9 days ago

Looped World Models

liked a model 25 days ago

coolthor/gemma-4-12B-it-FP8-dynamic

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 12 days ago • 48

upvoted a paper 9 days ago

Looped World Models

Paper • 2606.18208 • Published 13 days ago • 467

liked a model 25 days ago

coolthor/gemma-4-12B-it-FP8-dynamic

Any-to-Any • 12B • Updated 24 days ago • 9.16k • 1

upvoted a paper 25 days ago

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

Paper • 2606.02578 • Published 28 days ago • 6

liked 2 models 27 days ago

sbfisher/tir-sft-calc_only_from_base

Text Generation • 0.5B • Updated 27 days ago • 34 • 1

Cukinator/cpu1-ablation-checkpoints

Text Generation • Updated 16 days ago • 3

upvoted a paper 29 days ago

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Paper • 2605.30161 • Published May 28 • 60

upvoted 2 papers about 1 month ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published May 16 • 97

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

liked a model about 1 month ago

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.33M • • 7.86k

upvoted 2 papers about 1 month ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Paper • 2605.14539 • Published May 14 • 8

upvoted a paper about 2 months ago

Can Muon Fine-tune Adam-Pretrained Models?

Paper • 2605.10468 • Published May 11 • 6

liked 2 datasets about 2 months ago

Maynor996/upload2

Viewer • Updated about 13 hours ago • 2 • 472k • 24

anonymous-24421/DriCo

Viewer • Updated May 12 • 16.9k • 17 • 1

liked a model about 2 months ago

apol/alia-40b-distill-vapol

Text Generation • Updated May 4 • 16 • 2

upvoted 2 papers 2 months ago

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

Paper • 2604.04936 • Published Jan 8 • 26

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published Apr 21 • 88

liked a dataset 3 months ago

Salesforce/GiftEvalPretrain

Preview • Updated Jan 21, 2025 • 202k • 39

upvoted a paper 3 months ago

R3PM-Net: Real-time, Robust, Real-world Point Matching Network

Paper • 2604.05060 • Published Apr 6 • 8

Oliver Kowalski

AI & ML interests

Recent Activity

Organizations

browser-kid's activity