monday Service partnered with LangSmith to create an eval-driven development framework for customer-facing service agents. They adopted a code-first evaluation strategy from day one. The LangChain blog shares their implementation details.
Key Points
- 1.monday Service integrates LangSmith for eval-driven agent development
- 2.Code-first evaluations implemented from project inception
- 3.Targets customer-facing service agents for reliability
- 4.Framework emphasizes programmatic testing over manual checks
Impact Analysis
This case study shows how evals ensure robust AI agents in production, inspiring similar strategies. It validates LangSmith's role in scalable LLM app development for enterprises.
Technical Details
LangSmith enables tracing, testing, and monitoring of LLM chains. monday Service built a framework around code-based evals for service agents, automating quality checks from day one.

