FineInstructions

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

craffel updated a model 9 days ago

fineinstructions/pretraining_experiments

craffel authored a paper 10 days ago

TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior

AjayP13 updated a model about 1 month ago

fineinstructions/pretraining_experiments

View all activity

craffel

updated a model 9 days ago

fineinstructions/pretraining_experiments

Updated 9 days ago

craffel

authored a paper 10 days ago

TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior

Paper • 2512.20757 • Published 12 days ago • 16

AjayP13

updated a model about 1 month ago

fineinstructions/pretraining_experiments

Updated 9 days ago

AjayP13

in fineinstructions/finetemplates 2 months ago

can you add a license

#2 opened 2 months ago by

huu-ontocord

craffel

authored a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

craffel

authored a paper 7 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 59

CCB

authored a paper 9 months ago

Concept Lancet: Image Editing with Compositional Representation Transplant

Paper • 2504.02828 • Published Apr 3, 2025 • 16

AjayP13

authored a paper 10 months ago

mStyleDistance: Multilingual Style Embeddings and their Evaluation

Paper • 2502.15168 • Published Feb 21, 2025 • 3

CCB

authored a paper 10 months ago

mStyleDistance: Multilingual Style Embeddings and their Evaluation

Paper • 2502.15168 • Published Feb 21, 2025 • 3

CCB

authored a paper 11 months ago

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20, 2025 • 14

craffel

authored a paper 11 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 253

AjayP13

authored a paper about 1 year ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121

CCB

authored a paper over 1 year ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121

AjayP13

authored 3 papers over 1 year ago

ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer

Paper • 2308.15459 • Published Aug 29, 2023 • 1

Large Language Models Can Self-Improve At Web Agent Tasks

Paper • 2405.20309 • Published May 30, 2024 • 2

TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings

Paper • 2406.15586 • Published Jun 21, 2024 • 2

craffel

authored a paper over 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 99

CCB

authored a paper almost 2 years ago

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16, 2024 • 31

AjayP13

authored a paper almost 2 years ago

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16, 2024 • 31

craffel

authored a paper about 2 years ago

Resolving Interference When Merging Models

Paper • 2306.01708 • Published Jun 2, 2023 • 15

AI & ML interests

Recent Activity

Team members 3

fineinstructions's activity

can you add a license