Work

Where I've worked, what I studied, and what I've published.

Experience

LLMsPythonAIHealthcareSarvamAnthropic

A pioneering initiative in collaboration with ARMMAN that leverages AI and robotics to address societal challenges, particularly in healthcare. Shipped a multilingual LLM chatbot with Hindi audio responses for frontline workers managing high-risk pregnancies, improving access for underserved users and reducing clinician workload. Built a 2-tier semantic caching layer using an approved FAQ bank to serve common queries — aiming to reduce real-time LLM inference cost by ~22% while improving response latency. Benchmarked embedding models and tuned similarity thresholds using production replay data; implemented LLM-as-judge rubrics across 10k+ query-response pairs to improve quality and reduce failure modes. Designed an analytics dashboard to track 40,000+ chatbot interactions, examining user engagement patterns and HRP topic distribution to enable effective resource allocation. Improved production readiness via structured logging, load testing, and unit tests for critical API endpoints.

Next.jsFastAPIPythonRedisCeleryOCR

An AI-powered platform that automates the creation of presentations and summaries from various document types, enhancing productivity for professionals. Architected an asynchronous OCR microservice using Celery and Redis to scale document ingestion and text extraction, improving throughput by ~40%. Developed two full-stack applications using Next.js and FastAPI for document conversion (PowerPoint/PDF/Word) and optimised processing pipelines, reducing end-to-end processing time by ~30%. Enhanced chat-with-document UX by implementing a custom math expression renderer in Markdown.

AWSPythonMicroservicesREST APICompliance

A leading provider of compliance and archiving solutions for highly regulated industries, helping organisations manage digital communications across multiple channels. Developed Smarsh's transcriber-api using Amazon Transcribe, adding configurable settings to increase flexibility and support compliance requirements for voice communications. Led the development of RESTful API microservices, facilitating cross-functional team integration and enhancing scalability, which streamlined internal workflows and improved collaboration.

PythonMLNLPClassificationMultiprocessing

A Japanese tech conglomerate revolutionising online retail, fintech, and mobile services. Fine-tuned fashion article classification models to 88-93% accuracy to improve cross-platform product matching in a commerce workflow. Optimised preprocessing and training with multiprocessing, reducing training time by ~60% to iterate and ship model improvements faster. Improved matching precision to support better discount-deal discovery for Rakuten's cashback extension.