Luis Quintanilla
Home
About
Profile
Contact
Uses
Colophon
Feeds
Main
Responses
Blog
Subscribe
Starter Packs
Blogroll
Podroll
Forums
YouTube
Collections
Radio
Books
Tags
Knowledgebase
Snippets
Wiki
Presentations
Live
Stream
Recordings
Events
evaluation
A list of posts tagged evaluation
Blogs
Notes
Responses
Predictive Human Preference: From Model Ranking to Model Routing
Evaluating LLMs is a minefield
Best Practices for LLM Evaluation of RAG Applications
MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications