AI glossary
LLM-as-judge
Using a strong model to evaluate the outputs of another model against your criteria. Used for eval suites at scale when human grading isn't tractable.
AI glossary
Using a strong model to evaluate the outputs of another model against your criteria. Used for eval suites at scale when human grading isn't tractable.