A pioneering initiative in collaboration with ARMMAN that leverages AI and robotics to address societal challenges, particularly in healthcare.
- Shipped a multilingual LLM chatbot with Hindi audio responses for frontline workers managing high-risk pregnancies, improving access for underserved users and reducing clinician workload.
- Built a 2-tier semantic caching layer using an approved FAQ bank to serve common queries — aiming to reduce real-time LLM inference cost by ~22% while improving response latency.
- Benchmarked embedding models and tuned similarity thresholds using production replay data; implemented LLM-as-judge rubrics across 10k+ query-response pairs to improve quality and reduce failure modes.
- Designed an analytics dashboard to track 40,000+ chatbot interactions, examining user engagement patterns and HRP topic distribution to enable effective resource allocation.
- Improved production readiness via structured logging, load testing, and unit tests for critical API endpoints.