user_image

Avni Kothari

AI / Machine Learning Engineer

I'm a AI/ML Engineer with 5+ years of experience architecting and deploying end-to-end ML and LLM powered production systems in healthcare.

My specialization is in leveraging LLMs and multimodal patient data. I collaborate with clinicians to lead healthcare AI projects from conception and research through to scalable, maintained deployment, delivering measurable improvements in clinical workflows.

Below you can find my research which is published at top conferences (ICLR, NeurIPS, ML for Health) with corresponding open-source software packages.


I recently graduated from my Master's in Computer Science at UC San Diego where I was named a DeepMind Fellow. Outside of work you can usually find me biking or enjoying the outdoors.

My CV can be found here.

Don't hesitate to reach out if you want to learn more about me or my work!

News

  • August 2025

    Deeply humbled to hear many of LLM Evals expert Shreya Shankar's favorite topics are covered in our most recent paper!

  • March 2025

    Check out new my blogpost about Constitutional Classifiers!

Skills

    Languages

  • Python
  • TypeScript
  • Java
  • Elixir
  • Clojure
  • AI/ML

  • GenAI (GPT 4o, Claude Sonnet 4, Cohere Command A)
  • Langchain
  • HuggingFace
  • PyTorch
  • LLM Evaluations
  • Benchmarking LLMs
  • Scikit-Learn
  • MLOps

  • ML Pipelines and ETL
  • CI/CD
  • AWS (EC2, S3, Terraform)
  • Elasticsearch
  • Docker

Industry Experience

  • AI/ML Engineer; University of California, San Francisco - San Francisco, CA

    September 2023 – Present

    Architected and deployed clinical AI systems to save social workers 300+ hours/ week, identify patients at risk for readmission, and create simple, interpretable models from multimodal patient data

    Drove end-to-end architecture of a GPT 4o powered clinical summarization pilot to condense patient charts (20K+ tokens) into actionable summaries, co-developed with 5+ clinicians to improve patient care

    Established and led human and automated annotation pipelines with active learning to ensure 90%+ accuracy on 1000+ GPT 4o clinical summaries

    Designed and deployed a scalable ETL pipeline to process health record data, enabling ML training and evaluation for 3000+ patient records and 30000+ patient visits

    Engineered, deployed, and evaluated a custom readmission risk prediction model, adopted by 10+ clinics, with 12% higher accuracy than the general model

    Created and presented a hybrid LLM-empirical framework to reconcile clinical LLM hallucinations, achieving 40% interpretability improvement (validated by clinical experts) while matching top predictive performance and enabling production deployment

    Developed and authored a publication for a multimodal ML pipeline to integrate tabular and unstructured clinical data to improve black-box model interpretability, delivering 7% AUC improvement in readmission risk prediction

  • ML Engineer; University of California, SD; CA

    June 2022 – Mar 2024

    Developed a model-agnostic bias auditing framework surfacing bias in up to 30% of lending datasets across 5+ models and 3+ datasets with 25+ real-world constraints

    Collaborated with a lending expert to define real-world constraints on lending datasets to identify individuals subject to preclusion

    Authored a publication that received an ICLR Spotlight (Top 5% among submissions)

  • Software Engineer; Edovo - Chicago, IL

    Jan 2020 – May 2021

    Led the re-architecture and deployment of a scalable educational platform, handling the full lifecycle from leading product requirement sessions to engineering a data backend

    Architected, tested, and deployed a content platform using Elasticsearch to handle 700K+ requests per day to thousands of users

    Led 10+ requirement gathering sessions with Product owners to re-build a platform

    Created a data pipeline and job to merge 4B rows of user event data in PostgreSQL

  • Lead Software Engineer; 8th Light - Chicago, IL

    Aug 2017 – March 2019

    Built high-performance technical systems from inception to production, including load testing platforms, API integrations, and patient-facing healthcare applications

    Implemented and deployed a scalable load testing platform simulating 1000+ RPS

    Deployed API integrations to sync 1000+ interactions/ minute in different timezones

    Developed a diabetes management iOS app to connect 60+ patients with diabetic nurses

    Mentored peers and residents through pair programming sessions and code reviews

Papers

Research Software

  • bc-llm

    June 2024

    Responsible AI software that enhances model transparency and safety through a multimodal LLM–Bayesian framework, allowing for the creation of simple and interpretable models.

  • reachml

    June 2022

    Created a model-agnostic fairness and safety audit to identify scenarios of preclusion

Education

Poster Presentations

  • NeurIPS Workshop Statistical Foundations of LLMs and Foundation Models

    December 2024

  • ICML Workshop on Data-centric Machine Learning Research

    July 2023

  • ICML Workshop on Spurious Correlations, Invariance and Stability

    July 2023

  • ICML Workshop on Artificial Intelligence & Human Computer Interaction

    July 2023

Teaching Experience

  • Fall 2022

    TA for DSC 291 - Interpretability and Explainability in Machine Learning

  • May 2011 – August 2013

    Differential Calculus Tutor

Service

  • Vision 1948 (May 2023 – Present)
  • PenPal for the Incarcerated (September 2020 – Present)
  • Warren Community Garden (August 2021 – Present)
  • UCSF AI4All (July 2024 – July 2024)
  • The Recyclery (August 2018 – May 2021)