BigCode

non-profit

https://www.bigcode-project.org/

bigcode-project

AI & ML interests

None defined yet.

Recent Activity

lckr authored a paper 16 days ago

StarCoder 2 and The Stack v2: The Next Generation

iNeil77 authored a paper about 2 months ago

Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring

iNeil77 submitted a paper about 2 months ago

Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring

View all activity

Papers

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

View all Papers

Articles

BigCodeArena: Judging code generations end to end with code executions

authored a paper 18 days ago

Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025

Paper • 2606.02255 • Published 19 days ago

submitted a paper to Daily Papers 18 days ago

Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025

Paper • 2606.02255 • Published 19 days ago

submitted a paper to Daily Papers 21 days ago

Reducing Political Manipulation with Consistency Training

Paper • 2605.22771 • Published 23 days ago • 1

jensjorisdecorte

authored a paper 22 days ago

Efficient Text Encoders for Labor Market Analysis

Paper • 2505.24640 • Published May 30, 2025

jensjorisdecorte

authored 3 papers 29 days ago

SkillMatch: Evaluating Self-supervised Learning of Skill Relatedness

Paper • 2410.05006 • Published Oct 7, 2024

Unified Work Embeddings: Contrastive Learning of a Bidirectional Multi-task Ranker

Paper • 2511.07969 • Published Nov 11, 2025

Multilingual JobBERT for Cross-Lingual Job Title Matching

Paper • 2507.21609 • Published Jul 29, 2025

authored a paper about 1 month ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 63

authored a paper about 2 months ago

Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring

Paper • 2605.00754 • Published May 1 • 3

submitted a paper to Daily Papers about 2 months ago

Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring

Paper • 2605.00754 • Published May 1 • 3

authored a paper 2 months ago

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

Paper • 2604.05083 • Published Apr 6

authored a paper 2 months ago

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

Paper • 2604.02007 • Published Apr 2 • 14

posted an update 3 months ago

Post

935

I like these models nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16 and nvidia/NVIDIA-Nemotron-3-Nano-4B-FP8 and TradingAgents: Multi-Agents LLM Financial Trading Framework (2412.20138) and https://arxiv.org/abs/2412.20138

mlabonne/FineTome-100k

authored 2 papers 3 months ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

submitted 2 papers to Daily Papers 3 months ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

RTT1

authored a paper 3 months ago

EvoClaw: Evaluating AI Agents on Continuous Software Evolution

Paper • 2603.13428 • Published Mar 13 • 21

RTT1

submitted a paper to Daily Papers 4 months ago

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

Paper • 2602.07075 • Published Feb 6 • 19

authored a paper 4 months ago

GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities

Paper • 2507.12367 • Published Jul 16, 2025 • 7