Abhisek Behera PRO

Abhisek987

AI & ML interests

None yet

Recent Activity

repliedto their post 2 days ago

Every Python developer has hit this: you upgrade numpy or pandas, and code that worked yesterday breaks today. I built an open dataset for exactly this problem. DepDoctor is 6,204 examples of Python code broken by a dependency upgrade, each paired with the fix and a short note on the API change that caused it. It is a mixture of real cases mined from public GitHub commits and synthetic cases generated from a database of known breaking changes. A few things I tried to get right: - 935 "leave it alone" examples, to teach a model restraint, not just what to change. - Honest evaluation: a fine-tuned Qwen2.5-Coder-7B gets 62% of fixes fully correct. I report that, not just the 97% text-similarity score that hides the truth. - The main failure mode, over-editing, is measured and explained rather than buried. Dataset, fine-tuned model, and a live demo are all open in one place: https://huggingface.co/collections/Abhisek987/depdoctor Feedback welcome, especially from anyone working on code repair or API migration.

posted an update 3 days ago

updated a dataset 3 days ago

Abhisek987/depdoctor-dataset

View all activity

Organizations

None yet

Posts 1

Post

131

Every Python developer has hit this: you upgrade numpy or pandas, and code that worked yesterday breaks today.

I built an open dataset for exactly this problem. DepDoctor is 6,204 examples of Python code broken by a dependency upgrade, each paired with the fix and a short note on the API change that caused it. It is a mixture of real cases mined from public GitHub commits and synthetic cases generated from a database of known breaking changes.

A few things I tried to get right:
- 935 "leave it alone" examples, to teach a model restraint, not just what to change.
- Honest evaluation: a fine-tuned Qwen2.5-Coder-7B gets 62% of fixes fully correct. I report that, not just the 97% text-similarity score that hides the truth.
- The main failure mode, over-editing, is measured and explained rather than buried.

Dataset, fine-tuned model, and a live demo are all open in one place:
https://huggingface.co/collections/Abhisek987/depdoctor

Feedback welcome, especially from anyone working on code repair or API migration.

Collections 1

spaces 7

DepDoctor

🩺

AI code fixer for Python dependency upgrades

Text-to-Image Diffusion Model

🎨

Generate images from text prompts with a diffusion model

GPT Text Generator

🤖

Generate text continuations from a custom GPT model

Named Entity Recognition

🏷

Identify and highlight entities in any English sentence

English to German Translator

🌍

Translate English sentences to German

Phi35 Vision Pdf To Markdown

🌖

Fine-tuned Phi-3.5-Vision to extract structured markdown fro

View 7 Spaces

models 7

Abhisek Behera PRO

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 1

Abhisek987/depdoctor-dataset

Abhisek987/depdoctor-v5-lora

DepDoctor

Abhisek987/depdoctor-dataset

Abhisek987/depdoctor-v5-lora

DepDoctor

spaces 7

DepDoctor

Text-to-Image Diffusion Model

GPT Text Generator

Named Entity Recognition

English to German Translator

Phi35 Vision Pdf To Markdown

models 7

Abhisek987/depdoctor-v5-lora

Abhisek987/depdoctor-v4-lora

Abhisek987/phi35-vision-pdf-markdown

Abhisek987/llama-test-3.2-sql-merged

Abhisek987/llama-test-3.2-sql-lora

Abhisek987/llama-3.2-sql-merged

Abhisek987/llama-3.2-sql-lora

datasets 1

Abhisek987/depdoctor-dataset

Abhisek Behera PRO

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 1

DepDoctor

DepDoctor

spaces 7 Sort: Recently updated

DepDoctor

Text-to-Image Diffusion Model

GPT Text Generator

Named Entity Recognition

English to German Translator

Phi35 Vision Pdf To Markdown

models 7 Sort: Recently updated

datasets 1

spaces 7

models 7