Navan
Call quality assurance
Teams couldn't manually review hundreds of daily AI hotel calls. Audio models now evaluate raw recordings, routing exceptions to humans.
- >0.9 macro F1 score for automated quality checks
- Evaluation accuracy increase from 0.56 to 0.89 F1 score