user_image

Avni Kothari

Research Engineer - AI Safety

I am an AI Research Engineer with 5 years of software and ML engineering experience. My AI research, presented at ICLR, ICML, and NeurIPS, focuses on:

  1. Improving transparency and alignment in high-stakes AI models
  2. Ensuring fairness by detecting scenarios of preclusion

I recently graduated from my Master's in Computer Science at UC San Diego where I was named a DeepMind Fellow. Outside of work you can usually find me biking or enjoying the outdoors.

My CV can be found here.

Don't hesitate to reach out if you want to learn more about me or my work!

News

Industry Experience

  • Research Engineer; University of California, San Francisco - San Francisco, CA

    September 2023 – Present

    Developed a multimodal method to improve interpretability of black box models by generating and refining concept bottleneck models within a Bayesian framework for uncertainty quantification, outperforming baseline methods by 35%

    Created a method to use a Bayesian tree based model that merges Llama 2’s domain knowledge with empirical data, increasing interpretability by 40%, as confirmed by domain experts, while matching top predictive performance and quantifying risk

    Designed and deployed a scalable ETL pipeline to process health record data, enabling ML training and evaluation for 3K+ patient records and 30K+ patient visits

    Built, deployed, and evaluated a custom clinical risk prediction model adopted by 10+ clinics, achieving 12% higher accuracy than the general clinical model

    Mentored 5+ peers and PhD students through teaching sessions and code reviews

  • Software Engineer; Edovo - Chicago, IL

    Jan 2020 – May 2021

    Architected, tested, and deployed an educational content platform using Elasticsearch to handle 700K+ requests per day

    Led 10+ requirement gathering sessions with Product owners to re-build a platform

    Created a data pipeline and job to merge 4B rows of user event data in PostgreSQL

  • Lead Software Engineer; 8th Light - Chicago, IL

    Aug 2017 – March 2019

    Implemented and deployed a scalable load testing platform simulating 1000+ RPS

    Engineered API integrations to sync 1000+ interactions/ minute in different timezones

Papers

Research Software

  • bc-llm

    June 2024

    Implemented and developed a multimodal method using Metropolis Gibbs sampling to identify interpretable features for complex models

    Benchmarked and implemented 5+ comparator methods against our method, achieving performance comparable to or exceeding black-box models

  • reachml

    June 2022

    Constructed a Mixed Integer Program to handle 50+ feature constraints for counterfactual explanations and test for robustness

    Created a model-agnostic fairness and safety audit to identify scenarios of preclusion

    Developed an HPC-based experimental pipeline to audit 200K+ individuals and benchmark results against baseline methods

Skills

    ML Engineering

  • ML Pipelines and ETL
  • Multimodal datasets
  • ML Deployment
  • ML Development & Evaluation
  • AI Safety Research Skills

  • Interpretability
  • Robustness
  • Fairness and Bias
  • Risk Quantification
  • LLMs and Foundation Models
  • LLM Evaluations
  • Tools and Frameworks

  • Python (Hugging Face, Pytorch, Scikit-learn, Numpy, Pandas)
  • AWS (EC2, S3, Terraform, Deployment Strategies)
  • Elasticsearch
  • DB (SQL, Postgres, DuckDB)
  • Docker

Education

Poster Presentations

  • NeurIPS Workshop Statistical Foundations of LLMs and Foundation Models

    December 2024

  • ICML Workshop on Data-centric Machine Learning Research

    July 2023

  • ICML Workshop on Spurious Correlations, Invariance and Stability

    July 2023

  • ICML Workshop on Artificial Intelligence & Human Computer Interaction

    July 2023

Teaching Experience

  • Fall 2022

    TA for DSC 291 - Interpretability and Explainability in Machine Learning

  • May 2011 – August 2013

    Differential Calculus Tutor

Service

  • Vision 1948 (May 2023 – Present)
  • PenPal for the Incarcerated (September 2020 – Present)
  • Warren Community Garden (August 2021 – Present)
  • UCSF AI4All (July 2024 – July 2024)
  • The Recyclery (August 2018 – May 2021)