๐ŸŸฃ JujutsuKaiserver

The Cursed Intelligence. The Canon Oracle.

Model Quantization RAG License Dataset


"Throughout Heaven and Earth, I alone am the honored one."
โ€” Satoru Gojo | and also this model, kind of.


JujutsuKaiserver is a Retrieval-Augmented Generation (RAG) model built for one purpose:
to answer anything and everything about the Jujutsu Kaisen universe โ€” with canon-backed accuracy, zero hallucination tolerance, and the confidence of Unlimited Void.


โšก What It Does

Ask it anything. Techniques. Domains. Arcs. Hidden lore. Character relationships. Cursed Energy mechanics. It retrieves the most relevant passages from a 200+ page wiki corpus, feeds them into a fine-tuned Qwen2.5-1.5B-Instruct backbone, and gives you a clean, grounded answer โ€” not a guess.

Ask This Get This
"What is Sukuna's Shrine?" Full technique breakdown with canon context
"How does Mahito's Idle Transfiguration work?" Soul-level mechanics explained accurately
"What happened in the Shibuya Incident?" Arc summary backed by wiki chunks
"Who is the strongest Grade 1 sorcerer?" Ranked answer with sourced reasoning

๐Ÿง  Architecture

User Query
    โ”‚
    โ–ผ
sentence-transformers (all-MiniLM-L6-v2)
    โ”‚  [embed query]
    โ–ผ
FAISS Index (jjk_index.faiss)
    โ”‚  [top-5 relevant wiki chunks]
    โ–ผ
Qwen2.5-1.5B-Instruct (4-bit)
    โ”‚  [context + question โ†’ chat template]
    โ–ผ
Canon-grounded Answer

Model Composition

Component Details
๐Ÿค– Base LLM Qwen/Qwen2.5-1.5B-Instruct (4-bit quantized)
๐Ÿ”ข Embeddings sentence-transformers/all-MiniLM-L6-v2
๐Ÿ“ฆ Vector Store FAISS โ€” jjk_index.faiss
๐Ÿ“– Knowledge Base 120+ cleaned JJK Fandom Wiki articles (chunks.txt)
๐Ÿ”ง Pipeline Custom JujutsuKaiserver class with Qwen chat template

๐Ÿš€ Quick Start

from huggingface_hub import snapshot_download

model_dir = snapshot_download("TinyModels/JujutsuKaiserver")

import sys
sys.path.insert(0, model_dir)
from pipeline import JujutsuKaiserver

bot = JujutsuKaiserver(model_dir=model_dir)

# Ask anything
print(bot.ask("What is Gojo's Domain Expansion called?"))
# โ†’ "Infinite Void (็„ก้‡็ฉบๅ‡ฆ). It..."

โš ๏ธ Requirements: bitsandbytes, GPU with โ‰ฅ6 GB VRAM. CPU inference works but is slow.

Install Dependencies

pip install transformers bitsandbytes faiss-cpu sentence-transformers huggingface_hub

๐Ÿ–ฅ๏ธ Gradio Demo (Optional)

Spin up a local chat UI in seconds:

import gradio as gr
from pipeline import JujutsuKaiserver

bot = JujutsuKaiserver(model_dir="<path_to_downloaded_model>")

def chat(message, history):
    return bot.ask(message)

gr.ChatInterface(
    fn=chat,
    title="๐ŸŸฃ JujutsuKaiserver",
    description="Ask anything about the JJK universe."
).launch()

โœจ Features

  • ๐Ÿ” Factual Q&A โ€” Every answer is grounded in retrieved wiki content, not imagination
  • ๐Ÿšซ Hallucination Guard โ€” Model is prompted to say "I don't know" when context is insufficient
  • ๐Ÿ“š Deep Coverage โ€” 200+ wiki pages: characters, techniques, domains, arcs, lore
  • โšก T4-Friendly โ€” 4-bit quantization means it runs on free Colab tiers
  • ๐Ÿค– Gradio Ready โ€” One-script local demo included out of the box

โš ๏ธ Known Limitations

  • Recent chapters beyond the scraping date may not be indexed yet
  • Ambiguous context can still occasionally produce imperfect answers โ€” being addressed via a feedback loop
  • Roleplay mode is possible with a custom system prompt, but this version is optimized for factual retrieval

๐Ÿ”ฎ Roadmap

  • Live Feedback Flagging โ€” ๐Ÿ‘/๐Ÿ‘Ž votes from the Gradio Space feed a correction dataset automatically
  • Self-Correcting Pipeline โ€” Weekly DPO fine-tuning on flagged examples + FAISS index refresh
  • Expanded KB โ€” Episode transcripts, manga panels text, community lore
  • Streaming Support โ€” Token-by-token output for snappier UX

๐Ÿ“‚ Repo Structure

JujutsuKaiserver/
โ”œโ”€โ”€ pipeline.py          # Core RAG pipeline class
โ”œโ”€โ”€ jjk_index.faiss      # FAISS vector index
โ”œโ”€โ”€ chunks.txt           # Raw wiki knowledge base
โ”œโ”€โ”€ generation_config.json
โ””โ”€โ”€ README.md

Built with ๐Ÿฉธ and cursed energy for the JJK community.

Got a question the bot fumbled? Open a Discussion and help us fix it.

TinyModels โ€ข QuantaSparkLabs โ€ข Apache 2.0

Downloads last month
102
Safetensors
Model size
2B params
Tensor type
F32
ยท
BF16
ยท
U8
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support