Content tagged with "evaluation"
Found 4 items tagged with "evaluation".
Predictive Human Preference: From Model Ranking to Model Routing
responses •
Other tags: ai, llm, ml
Evaluating LLMs is a minefield
bookmarks •
Other tags: ai, llms
Best Practices for LLM Evaluation of RAG Applications
bookmarks •
Other tags: ai, rag, llm
MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications
bookmarks •
Other tags: mlflow, ai, llm