I am a Machine Learning Scientist at Reddit, where I work on advancing ads conversion models to the state of the art. Previously, I was a Research Scientist at Meta, where I bootstrapped generative models for business AI agents and optimized ad relevance ranking. I completed my Computer Science PhD at the University of Virginia, advised by Prof. Tom Fletcher and previously by Prof. Vicente Ordóñez Román. During my PhD, I interned at Adobe Research with Kushal Kafle, bootstrapping large language and vision models to design VLMs via instruction-tuning, and at Salesforce Research with Nikhil Naik and Prof. Stefano Ermon, on vision-language alignment & retrieval and conditional & controllable image generation using diffusion models. Earlier, I was a Research Scientist with Prof. Donald E. Brown, developing AI methods for disease understanding and diagnosis. I also hold a Masters in Data Science from the University of Virginia and a Bachelors in Technology in Mechanical Engineering from the Indian Institute of Technology, Roorkee.

My research spans (1) information-efficient multi-media retrieval and ranking, (2) vision-language alignment and representation learning, (3) conditional and controllable image and text generation, (4) foundation language and vision models, and (5) their applications to disease understanding and computational medical imaging. Outside of my day job, I run aaam.dev, a small studio building quiet, private, ad-free apps. Find my CV here.

📝 Selected Publications

Diffusion Models for Histopathological Image Generation
Aman Shrivastava and P. Thomas Fletcher
In Generative Machine Learning Models in Medical Image Computing, Springer Nature. Book Chapter 2024
chapter

Learning Group Actions on Latent Representations
Yinzhu Jin, Aman Shrivastava, and P. Thomas Fletcher
In Advances in Neural Information Processing Systems. NeurIPS 2024
paper

NASDM, Nuclei-Aware Semantic Histopathology Image Generation Using Diffusion Models
Aman Shrivastava and P. Thomas Fletcher
In International Conference on Medical Image Computing and Computer-Assisted Intervention. MICCAI 2023 (Oral)
paper | code

CLIP-Lite, Information Efficient Visual Representation Learning from Textual Annotations
Aman Shrivastava, Ramprasaath R. Selvaraju, Nikhil Naik, and Vicente Ordonez.
In International Conference on Artificial Intelligence and Statistics, PMLR. AISTATS 2023 (Oral)
paper | code

Estimating and Maximizing Mutual Information for Knowledge Distillation
Aman Shrivastava, Yanjun Qi, and Vicente Ordonez.
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. CVPR TCV Workshop 2023
paper | code

Improving interpretability via explicit word interaction graph layer
Arshdeep Sekhon, Hanjie Chen, Aman Shrivastava, Zhe Wang, Yangfeng Ji, and Yanjun Qi.
In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI 2023
paper | code

Identifying metabolic shifts in Crohn’s disease using’omics-driven contextualized computational metabolic network models
Philip Fernandes, Yash Sharma, Fatima Zulqarnain, Brooklyn McGrew, Aman Shrivastava, Lubaina Ehsan, Dawson Payne et al.
In Nature Scientific Reports 2023.
paper

Self-attentive adversarial stain normalization
Aman Shrivastava, William Adorno, Yash Sharma, Lubaina Ehsan, S. Asad Ali, Sean R. Moore, Beatrice Amadi, Paul Kelly, Sana Syed, and Donald E. Brown.
In International Conference on Pattern Recognition, Springer. ICPR 2021 (Oral)
paper | code

Cluster-to-Conquer, A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification
Yash Sharma, Aman Shrivastava, Lubaina Ehsan, Christopher A. Moskaluk, Sana Syed, and Donald E. Brown.
In Medical Imaging with Deep Learning, PMLR. MIDL 2021
paper | code

Deep learning for visual recognition of environmental enteropathy and celiac disease
Aman Shrivastava, Karan Kant, Saurav Sengupta, Sung-Jun Kang, Marium Khan, S. Asad Ali, Sean R. Moore et al.
In IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). BHI 2019
paper

📂 Side Projects

aaam.dev
Founded a small independent app studio building quiet, private apps with no ads, no in-app purchases, and no tracking. Two apps are live and free on the App Store: hum, a calm, invite-only home for couples, and tir, a live semantic word race played against the room.
studio | hum — for two | tir — word race

Krity
Co-founded an open audiobook platform that allows listeners to find audiobooks in diverse voices, and narrators to give voices to their favorite books. Have produced and published over 40 audiobooks.
website

Connect 4 AI
Developed a lightweight connect-4 game with a self-written pure-javascript bot using Minimax algorithm and Monte Carlo simulations. Featured on Hacker News and released as a Google Play Store app rated 4.7.
website | code

Humorous Image Captioning System
Implemented a self-attentive encoder-decoder framework to generate humorous captions for images indistinguishable from human generated memes.
code

Soccer Squad Optimization
Designed a strategic football squad selection algorithm given budget, nationality (and/or club) and playing formation constraints based on self extracted FIFA dataset. Longstanding featured dataset on Kaggle.
code | dataset

📖 Education

  • 2020 - 2024: Computer Science PhD | University of Virginia
  • 2018 - 2019: Masters in Data Science | University of Virginia
  • 2013 - 2017: Bachelors in Technology, Mechanical Engineering | Indian Institute of Technology, Roorkee

💻 Experience

  • Machine Learning Scientist at Reddit | February 2026 - present
    Advancing ads conversion models to the state of the art.

  • Research Scientist at Meta | January 2025 - January 2026
    Worked on bootstrapping generative models for business AI agents and optimized ad relevance ranking to improve CTR by 25%.

  • Research Scientist Intern at Adobe Research | June - November, 2023
    Worked on designing an AI assistant for visual reasoning via bootstrapping pretrained foundation models.

  • Research Scientist Intern at Salesforce Research | June - November, 2022
    Worked on conditional generative diffusion models for image synthesis and vision-language alignment.

  • Research Scientist at University of Virginia | June 2019 - June 2020
    Developed learning frameworks for the understanding and assisted diagnosis of gastrointestinal diseases.

  • Analyst at Citibank | June 2018 - June 2019
    Built a streamlined visualization platform with data-driven insights for the Chief Country Officer. However, would not recommend.

🛠 Skills & Interests

  • Interests: Generative modeling, multimodal learning, computer vision, ranking & recommendation, and healthcare AI
  • Languages: Python, C++, JavaScript, R, Ruby, Julia, LaTeX
  • Frameworks & Tools: PyTorch, TensorFlow, Keras, Git, AWS, GCP, MongoDB, Redis

👨‍🏫 Teaching / Talks

  • 2023
    • Co-instructor for Geometry of Data | University of Virginia | lecture videos
    • Oral Presentation at MICCAI 2023
    • Invited Speaker for Research Speaker Series | PathAI
    • Teaching Assistant for Digital Signal Processing | University of Virginia
  • 2022
    • Teaching Assistant for Digital Signal Processing | University of Virginia
    • Teaching Assistant for Geometry of Data | University of Virginia
    • Teaching Assistant for Machine Learning | University of Virginia
    • Python Instructor for SOAR Scholars Program | University of Virginia
  • 2019
    • Python Instructor for Health Sciences Library | University of Virginia
    • Assistant Capstone Advisor for School of Data Science | University of Virginia
    • Invited Speaker for Applied Machine Learning Conference | Tom Tom Festival

♟️ Beyond Research

  • Chess: Represented the University of Virginia at the Virginia State Collegiate Chess Championship, 2023.
  • Writing: Editor-in-Chief of Geek Gazette, the campus technical magazine at IIT Roorkee.
  • Communities: Member of the Information Management Group (campus coding society) and the quizzing society at IIT Roorkee.