Luis Quintanilla
Home
About
Profile
Contact
Uses
Colophon
Feeds
Main
Responses
Blog
Subscribe
Blogroll
Podroll
Forums
YouTube
Collections
Radio
Books
Tags
Knowledgebase
Snippets
Wiki
Presentations
Events
evaluation
A list of posts tagged evaluation
Blogs
Notes
Responses
Predictive Human Preference: From Model Ranking to Model Routing
Evaluating LLMs is a minefield
Best Practices for LLM Evaluation of RAG Applications
MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications