AI glossary

LLM-as-judge

Using a strong model to evaluate the outputs of another model against your criteria. Used for eval suites at scale when human grading isn't tractable.

Want to talk about how this applies to your stack?

Book a 20-min call →Browse all terms

More terms

Agent
Agentic workflow
BAA (Business Associate Agreement)
Cache (prompt caching)
Citations / grounding
Context window

4.3
99% Job Success
Top rated Plus
4.5
4.9
5.0
4.0