-
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Paper • 2604.09497 • Published • 26 -
artefactory/BERTJudge
Text Classification • 0.2B • Updated • 60 • 3 -
artefactory/BERTJudge-Free-CR
Text Classification • 0.2B • Updated • 4 • 1 -
artefactory/BERTJudge-Formatted-QCR
Text Classification • 0.2B • Updated • 61 • 1
AI & ML interests
NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series
Recent Activity
Papers
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate
Artefact is a data-driven company specializing in Artificial Intelligence and Machine Learning solutions.
Our mission:
👉 We help organizations unlock the full potential of their data, empowering them to make smarter decisions and drive digital transformation.
👉 We place a strong emphasis on research-oriented innovation, actively contributing to the AI and data science community.
Visit our website | Follow us on LinkedIn
-
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Paper • 2604.09497 • Published • 26 -
artefactory/BERTJudge
Text Classification • 0.2B • Updated • 60 • 3 -
artefactory/BERTJudge-Free-CR
Text Classification • 0.2B • Updated • 4 • 1 -
artefactory/BERTJudge-Formatted-QCR
Text Classification • 0.2B • Updated • 61 • 1