OctoPack: Instruction Tuning Code Large Language Models
Paper • 2308.07124 • Published • 33
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("bigcode/santacoder-ldf", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("bigcode/santacoder-ldf", trust_remote_code=True)This is SantaCoder finetuned using the Line Diff Format introduced in OctoPack.
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="bigcode/santacoder-ldf", trust_remote_code=True)