Contributing Writer · Evaluation Systems

Abby Feng

Contributing Writer, Explore Agentic

Abby covers the evaluation side of shipping LLM systems — offline benchmarks versus production monitoring, metric design, and the unglamorous program management that turns one-off evals into a standing quality function. She writes for the engineer who has to defend a quality number in a review meeting.

LLM evaluationQuality metricsEval programs