Bram Vanroy's picture

Bram Vanroy PRO

BramVanroy

·

https://bramvanroy.github.io/

AI & ML interests

Artificial intelligence, natural language processing, computational linguistics

Recent Activity

upvoted a collection 19 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

updated a dataset 19 days ago

BramVanroy/finewiki-nl-30-to-24k-tokens

liked a Space 19 days ago

dlouapre/eiffel-tower-llama

View all activity

Organizations

upvoted a collection 19 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated 20 days ago • 19

upvoted a collection 21 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 13 days ago • 54

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

558

upvoted a paper about 1 month ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Paper • 2510.13996 • Published Oct 15, 2025 • 8

upvoted a paper 2 months ago

Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training

Paper • 2506.01732 • Published Jun 2, 2025 • 6

upvoted a collection 2 months ago

Leesplank Wim

18 items • Updated Nov 12, 2025 • 1

upvoted a paper 3 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 122

upvoted a collection 3 months ago

Qwen3

84 items • Updated 6 days ago • 1.54k

upvoted an article 4 months ago

Article

mmBERT: ModernBERT goes Multilingual

+4

Sep 9, 2025

•

133

upvoted 2 collections 4 months ago

robots-txt

6 items • Updated Aug 2, 2025 • 8

open-sci-ref-0.01

Research baseline models trained on various open reference datasets • 12 items • Updated Jul 23, 2025 • 4

upvoted an article 4 months ago

Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

Mar 20, 2024

•

30

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

743

upvoted a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

upvoted an article 8 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

74

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted a collection about 1 year ago

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 41

upvoted an article about 1 year ago

Article

EuroLLM-9B

Dec 2, 2024

•

138

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 6 days ago • 672

upvoted a collection about 2 years ago

GEITje 7B: A Large Open Dutch Language Model

All models and datasets relating to GEITje • 8 items • Updated Jan 25, 2025 • 5