computer vision, machine learning
human-computer interaction, cognitive science
fairness, accountability, and transparency
*our lab members pictured
2025

Attention IoU: Examining Biases in CelebA using Attention Maps

Aaron Serianni, Tyler Zhu, Olga Russakovsky and Vikram V. Ramaswamy

Computer Vision and Pattern Recognition (CVPR), 2025.

[bibtex]

interpretability
model bias
AI fairness
transparency and explainability

Interactivity x Explainability: Toward Understanding How Interactivity Can Improve Computer Vision Explanations

Indu Panigrahi, Sunnie S. Y. Kim*, Amna Liaqat*, Rohan Jinturkar, Olga Russakovsky, Ruth Fong and Parastoo Abtahi

(* = equal contribution)

ACM Conference on Human Factors in Computing Systems (CHI), Extended Abstract Track, 2025.

[bibtex]

human-AI interaction
transparency and explainability

D^3: Scaling Up Deepfake Detection by Learning from Discrepancy

Yongqi Yang, Zhihao Qian, Ye Zhu, Olga Russakovsky and Yu Wu

Computer Vision and Pattern Recognition (CVPR), 2025.

[paper] [bibtex]

deepfake detection
scaling

Portraying Large Language Models as Machines, Tools, or Companions Affects What Mental Capacities Humans Attribute to Them

Allison Chen, Sunnie S. Y. Kim, Amaya Dharmasiri, Olga Russakovsky and Judith E. Fan

ACM Conference on Human Factors in Computing Systems (CHI), Extended Abstract Track, 2025.

[bibtex]

human-AI interaction
psychology
mental capacity attributions

Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies

Sunnie S. Y. Kim, Jennifer Wortman Vaughan, Q. Vera Liao, Tania Lombrozo and Olga Russakovsky

ACM Conference on Human Factors in Computing Systems (CHI), 2025.

[paper] [bibtex]

human-AI interaction
trust and reliance

Unifying Specialized Visual Encoders for Video Language Models

Jihoon Chung*, Tyler Zhu*, Max Gonzalez Saez-Diez, Juan Carlos Niebles, Honglu Zhou and Olga Russakovsky

arXiv preprint arXiv:2501.01426, 2025.

[paper] [code] [website] [bibtex]

video
video understanding
multimodal LLMs
2024

ICONS: Influence Consensus for Vision-Language Data Selection

Xindi Wu, Mengzhou Xia, Rulin Shao, Zhiwei Deng, Pang Wei Koh and Olga Russakovsky

arXiv preprint arXiv:2501.00654, 2024.

[paper] [code] [website] [bibtex]

vision and language
data selection
data for efficient learning

ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty

Xindi Wu*, Dingli Yu*, Yangsibo Huang*, Olga Russakovsky and Sanjeev Arora

Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2024.

[paper] [code] [website] [bibtex]

compositionality
image generation
evaluation

Benchmark Suites Instead of Leaderboards for Evaluating AI Fairness

Angelina Wang, Aaron Hertzmann and Olga Russakovsky

Patterns, 2024.

[paper] [bibtex]

evaluation
AI fairness

What is Dataset Distillation Learning?

William Yang, Ye Zhu, Zhiwei Deng and Olga Russakovsky

International Conference on Machine Learning (ICML), 2024.

[paper] [code] [bibtex]

dataset distillation

Analyzing the Roles of Language and Vision in Learning from Limited Data

Allison Chen, Ilia Sucholutsky, Olga Russakovsky and Tom Griffiths

Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci), 2024.

[paper] [bibtex]

vision and language
cognitive science
cognitive architecture

ImageNet-OOD: Deciphering Modern Out-of-Distribution Detection Algorithms

William Yang, Byron Zhang and Olga Russakovsky

International Conference on Learning Representations (ICLR), 2024.

[paper] [code] [bibtex]

OOD
analysis
dataset

Vision-Language Dataset Distillation

Xindi Wu, Byron Zhang, Zhiwei Deng and Olga Russakovsky

Transactions on Machine Learning Research (TMLR), 2024.

[paper] [code] [bibtex]

vision and language
dataset distillation
data for efficient learning
2023

Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning

Nobline Yoo and Olga Russakovsky

International Conference on Computer Vision (ICCVW) ROAD++ Workshop, 2023.

[paper] [code] [bibtex]

human pose estimation
self-supervised

Boundary Guided Learning-Free Semantic Control with Diffusion Models

Ye Zhu, Yu Wu, Zhiwei Deng, Olga Russakovsky and Yan Yan

Neural Information Processing Systems (NeurIPS), 2023.

[paper] [code] [website] [bibtex]

diffusion models
image generation
controllable generation

GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition

Vikram V. Ramaswamy, Sing Yu Lin, Dora Zhao, Aaron B. Adcock, Laurens van der Maaten, Deepti Ghadiyaram and Olga Russakovsky

Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2023.

[paper] [code] [website] [bibtex]

new benchmark
object recognition
geodiversity

Overwriting Pretrained Bias with Finetuning Data

Angelina Wang and Olga Russakovsky

International Conference on Computer Vision (ICCV), 2023.

[paper] [code] [bibtex]

algorithmic intervention
AI fairness

Gender Artifacts in Visual Datasets

Nicole Meister*, Dora Zhao*, Angelina Wang, Vikram V. Ramaswamy, Ruth Fong and Olga Russakovsky

(* = equal contribution)

International Conference on Computer Vision (ICCV), 2023.

[paper] [code] [bibtex]

data analysis
AI fairness

Art and the Science of Generative AI

Ziv Epstein, Aaron Hertzmann, the Investigators of Human Creativity (Memo Akten, Hany Farid, Jessica Fjeld, Morgan R. Frank, Matthew Groh, Laura Herman, Neil Leach, Robert Mahari, Alex Pentland, Olga Russakovsky, Hope Schroeder and Amy Smith)

Science Perspectives, 2023.

[paper] [extended white paper] [bibtex]

AI and society

Discrete Diffusion Reward Guidance Methods for Offline Reinforcement Learning

Matthew Coleman, Olga Russakovsky, Christine Allen-Blanchette and Ye Zhu

International Conference on Machine Learning (ICMLW) Sampling and Optimization in Discrete Space Workshop, 2023.

[paper] [bibtex]

offline RL
diffusion policy

ICON^2: Reliably Benchmarking Predictive Inequity in Object Detection

Sruthi Sudhakar, Viraj Prabhu, Olga Russakovsky and Judy Hoffman

arXiv preprint arXiv:2306.04482, 2023.

[paper] [bibtex]

object detection
evaluation
fairness benchmarking

Humans, AI, and Context: Understanding End-Users' Trust in a Real-World Computer Vision Application

Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong and Andres Monroy-Hernandez

ACM Conference on Fairness, Accountability, and Transparency (FAccT), 2023.

[paper] [bibtex]

human-AI interaction
trust and reliance

Overlooked Factors in Concept-based Explanations: Dataset Choice, Concept Learnability, and Human Capability

Vikram V. Ramaswamy, Sunnie S. Y. Kim, Ruth Fong and Olga Russakovsky

Computer Vision and Pattern Recognition (CVPR), 2023.

[paper] [code] [bibtex]

faithfulness
explainability

"Help Me Help the AI": Understanding How Explainability Can Support Human-AI Interaction

Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong and Andres Monroy-Hernandez

ACM Conference on Human Factors in Computing Systems (CHI), 2023.

[paper] [supplement] [30-sec video] [10-min video] [bibtex]

human-AI interaction
transparency and explainability
2022

Siri: A simple selective retraining mechanism for transformer-based visual grounding

Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao and Yunchao Wei

European Conference on Computer Vision (ECCV), 2022.

[paper] [bibtex]

Visual grounding
Transformer
Generalization

ELUDE: Generating Interpretable Explanations via a Decomposition into Labelled and Unlabelled Features

Vikram V. Ramaswamy, Sunnie S. Y. Kim, Nicole Meister, Ruth Fong and Olga Russakovsky

arXiv preprint arXiv:2206.07690, 2022.

[paper] [bibtex]

interpretability
explainable AI
global explanations

Learning Actionness from Action/Background Discrimination

Ozge Yalcinkaya Simsek, Olga Russakovsky and Pinar Duygulu

Signal, Image and Video Processing (SIViP), 2022.

[paper] [bibtex]

Actionness
Action localization
Action segmentation
Video representation

Enabling Detailed Action Recognition Evaluation Through Video Dataset Augmentation

Jihoon Chung, Yu Wu and Olga Russakovsky

Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2022.

[paper] [code] [bibtex]

human action recognition
background bias
segmentation

Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Zhiwei Deng and Olga Russakovsky

Neural Information Processing Systems (NeurIPS), 2022.

[paper] [code] [bibtex]

dataset distillation
memory addressing
continual learning

HIVE: Evaluating the Human Interpretability of Visual Explanations

Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong and Olga Russakovsky

European Conference on Computer Vision (ECCV), 2022.

[paper] [code] [website] [extended abstract] [2-min video] [8-min video] [bibtex]

human-AI interaction
transparency and explainability

Multi-Query Video Retrieval

Zeyu Wang, Yu Wu, Karthik Narasimhan and Olga Russakovsky

European Conference on Computer Vision (ECCV), 2022.

[paper] [code] [bibtex]

Video retrieval
Multi-query
Evaluation

Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation

Angelina Wang, Vikram V. Ramaswamy and Olga Russakovsky

ACM Conference on Fairness, Accountability, and Transparency (FAccT), 2022.

[paper] [code] [bibtex]

intersectionality
machine learning fairness

CARETS: A Consistency And Robustness Evaluative Test Suite for VQA

Carlos E. Jimenez, Olga Russakovsky and Karthik Narasimhan

Association for Computational Linguistics (ACL), 2022.

[paper] [code] [bibtex]

multimodal model robustness
VQA

A Study of Face Obfuscation in ImageNet

Kaiyu Yang, Jacqueline Yau, Li Fei-Fei, Jia Deng and Olga Russakovsky

International Conference on Machine Learning (ICML), 2022.

[paper] [code] [project] [bibtex]

privacy-aware visual recognition
face attribute classification
detection
privacy protection
2021

Understanding and Evaluating Racial Biases in Image Captioning

Dora Zhao, Angelina Wang and Olga Russakovsky

International Conference on Computer Vision (ICCV), 2021.

[paper] [code] [website] [bibtex]

biases in image captioning
evaluating bias

Point and Ask: Incorporating Pointing into Visual Question Answering

Arjun Mani, Nobline Yoo, Will Hinthorn and Olga Russakovsky

Computer Vision and Pattern Recognition (CVPRW) Visual Question Answering Workshop, 2021.

[paper] [code] [website] [bibtex]

VQA
data
pointing
human supervision

Directional Bias Amplification

Angelina Wang and Olga Russakovsky

International Conference on Machine Learning (ICML), 2021.

[paper] [code] [bibtex]

directional bias amplification
causality

[Re] Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias

Sunnie S. Y. Kim, Sharon Zhang, Nicole Meister and Olga Russakovsky

ML Reproducibility Challenge, 2020.

ReScience C, 2021.

[paper] [code] [bibtex]

data

Fair Attribute Classification through Latent Space De-biasing

Vikram V. Ramaswamy, Sunnie S. Y. Kim and Olga Russakovsky

Computer Vision and Pattern Recognition (CVPR), 2021.

[paper] [code] [website] [bibtex]

data augmentation
GANs
attribute bias
2020

Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation

Zhiwei Deng, Karthik Narasimhan and Olga Russakovsky

Neural Information Processing Systems (NeurIPS), 2020.

[paper] [bibtex]

vision and language navigation
long-range planning
graphical planning

CornerNet-Lite: Efficient Keypoint Based Object Detection

Hei Law, Yun Teng, Olga Russakovsky and Jia Deng

British Machine Vision Conference (BMVC), 2020.

[paper] [code] [bibtex]

key-point based detection
efficient object detection

REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets

Angelina Wang, Arvind Narayanan and Olga Russakovsky

European Conference on Computer Vision (ECCV), 2020.

[paper] [code] [video] [90-sec video] [10-min video] [bibtex]

computer vision datasets
bias mitigation

Towards Unique and Informative Captioning of Images

Zeyu Wang, Berthy Feng, Karthik Narasimhan and Olga Russakovsky

European Conference on Computer Vision (ECCV), 2020.

[paper] [code] [1-min video] [10-min video] [bibtex]

informative image captioning

Take The Scenic Route: Improving Generalization In Vision-and-language Navigation

Felix Yu, Zhiwei Deng, Karthik Narasimhan and Olga Russakovsky

Computer Vision and Pattern Recognition (CVPRW) Visual Learning with Limited Labels Workshop, 2020.

[paper] [code] [video] [bibtex]

action priors
Vision-and-Language Navigation
generalization

Towards Fairness In Visual Recognition: Effective Strategies For Bias Mitigation

Zeyu Wang, Klint Qinami, Ioannis C. Karakozis, Kyle Genova, Prem Nair, Kenji Hata and Olga Russakovsky

Computer Vision and Pattern Recognition (CVPR), 2020.

[paper] [code] [1-min video] [bibtex]

visual recognition benchmark
bias mitigation

Towards Fairer Datasets: Filtering And Balancing The Distribution Of The People Subtree In The Imagenet Hierarchy

Kaiyu Yang, Klint Qinami, Li Fei-Fei, Jia Deng and Olga Russakovsky

Conference on Fairness, Accountability and Transparency (FAT*), 2020.

[paper] [project] [Wired article] [bibtex]

dataset balancing
visual recognition
dataset bias
algorithmic fairness
2019

Spatialsense: An adversarially crowdsourced benchmark for spatial relation recognition

Kaiyu Yang, Olga Russakovsky and Jia Deng

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.

[paper] [bibtex]

spatial relationship recognition
benchmarking
adversarial crowdsourcing

Compositional Temporal Visual Grounding of Natural Language Event Descriptions

Jonathan Stroud, Ryan McCaffrey, Rada Mihalcea, Jia Deng and Olga Russakovsky

arxiv preprint arXiv:1912.02256, 2019.

[paper] [project] [bibtex]

video understanding
temporal grounding

Human Uncertainty Makes Classification More Robust

Joshua C. Peterson*, Ruairidh M. Battleday*, Thomas L. Griffiths and Olga Russakovsky

(* = equal contribution)

International Conference on Computer Vision (ICCV), 2019.

[paper] [bibtex]

uncertainty
cognition

An Adversarially Crowdsourced Benchmark For Spatial Relation Recognition

Kaiyu Yang, Olga Russakovsky and Jia Deng

International Conference on Computer Vision (ICCV), 2019.

[paper] [bibtex]

data
crowdsourced benchmark
2018

The more you look, the more you see: towards general object understanding through recursive refinement

Jingyan Wang, Olga Russakovsky and Deva Ramanan

Winter Conference on Applications of Computer Vision (WACV), 2018.

[paper] [code] [supplement] [bibtex]

object understanding
2017

What Actions Are Needed For Understanding Human Actions In Videos?

Gunnar Sigurdsson, Olga Russakovsky and Abhinav Gupta

International Conference on Computer Vision (ICCV), 2017.

[paper] [bibtex]

human action recognition

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos

Serena Yeung, Olga Russakovsky, Ning Jin, Mykhaylo Andriluka, Greg Mori and Li Fei-Fei

International Journal of Computer Vision (IJCV), 2017.

[paper] [project] [bibtex]

data annotation

What's in a Question: Using Visual Questions as a Form of Supervision

Siddha Ganju, Olga Russakovsky and Abhinav Gupta

Computer Vision and Pattern Recognition (CVPR), 2017.

[paper] [project] [bibtex]

VQA

Predictive-Corrective Networks for Action Detection

Achal Dave, Olga Russakovsky and Deva Ramanan

Computer Vision and Pattern Recognition (CVPR), 2017.

[paper] [project] [bibtex]

action detection

Learning to Learn from Noisy Web Videos

Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, Greg Mori and Li Fei-Fei

Computer Vision and Pattern Recognition (CVPR), 2017.

[paper] [poster] [bibtex]

video
human action recognition
2016

Crowdsourcing in Computer Vision

Adriana Kovashka, Olga Russakovsky, Li Fei-Fei and Kristen Grauman

Foundation and Trends in Computer Vision and Graphics, 2016.

[paper] [bibtex]

data annotation

Much Ado About Time: Exhaustive Annotation of Temporal Data

Gunnar A. Sigurdsson, Olga Russakovsky, Ali Farhadi, Ivan Laptev and Abhinav Gupta

Conference on Human Computation and Crowdsourcing (HCOMP), 2016.

[paper] [project] [poster] [slides key] [slides pdf] [bibtex]

data annotation

What's the Point: Semantic Segmentation with Point Supervision

Amy Bearman, Olga Russakovsky, Vittorio Ferrari and Li Fei-Fei

European Conference on Computer Vision (ECCV), 2016.

[paper] [project] [bibtex]

semantic segmentation

End-to-end Learning of Action Detection from Frame Glimpses in Videos

Serena Yeung, Olga Russakovsky, Greg Mori and Li Fei-Fei

Computer Vision and Pattern Recognition (CVPR), 2016.

[paper] [project] [bibtex]

video
human action recognition

Towards More Gender Diversity in CS through an Artificial Intelligence Summer Program for High School Girls

Marie E. Vachovsky, Grace Wu, Sorathan Chaturapruek, Olga Russakovsky, Rick Sommer and Li Fei-Fei

Special Interest Group on Computer Science Education (SIGCSE), 2016.

[paper] [SAILORS camp homepage] [Wired article] [bibtex]

outreach
2015

Scaling up Object Detection

Olga Russakovsky

PhD Thesis, Stanford University, 2015.

[paper] [poster] [bibtex]

object detection
PhD thesis

Best of both worlds: human-machine collaboration for object annotation

Olga Russakovsky, Li-Jia Li and Li Fei-Fei

Computer Vision and Pattern Recognition (CVPR), 2015.

[paper] [project] [bibtex]

human-in-the-loop
data annotation

Joint calibration of Ensemble of Exemplar SVMs

Davide Modolo, Alexander Vezhnevets, Olga Russakovsky and Vittorio Ferrari

Computer Vision and Pattern Recognition (CVPR), 2015.

[paper] [code] [bibtex]

SVMs

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander Berg and Li Fei-Fei

(* = equal contribution)

International Journal of Computer Vision (IJCV), 2015.

[paper] [paper content on arxiv] [browse detection data] [attribute annotations] [ILSVRC homepage] [bibtex]

data collection
2014

Scalable Multi-Label Annotation

Jia Deng, Olga Russakovsky, Jonathan Krause, Michael Bernstein, Alexander Berg and Li Fei-Fei

ACM Conference on Human Factors in Computing Systems (CHI), 2014.

[paper] [slides] [bibtex]

data annotation
2013

Detecting avocados to zucchinis: what have we done, and where are we going?

Olga Russakovsky, Jia Deng, Zhiheng Huang, Alexander Berg and Li Fei-Fei

International Conference on Computer Vision (ICCV), 2013.

[paper] [supplement] [additional analysis] [attribute annotations] [poster] [poster of talk at BAVM] [slides pptx] [slides pdf] [bibtex]

data analysis
2012

Object-centric spatial pooling for image classification

Olga Russakovsky, Yuanqing Lin, Kai Yu and Li Fei-Fei

European Conference on Computer Vision (ECCV), 2012.

[paper] [poster] [slides pptx] [slides pdf] [30-sec spotlight] [FAQ] [bibtex]

image classification
2010

Attribute learning in large-scale data

Olga Russakovsky and Li Fei-Fei

European Conference on Computer Vision (ECCVW) Parts and Attributes Workshop, 2010.

[paper] [slides odp] [slides pdf] [data] [bibtex]

image classification

A Steiner tree approach to efficient object detection

Olga Russakovsky and Andrew Y. Ng

Computer Vision and Pattern Recognition (CVPR), 2010.

[paper] [poster] [data] [bibtex]

object detection

Autonomous operation of novel elevators for robot navigation

Ellen Klingbeil, Blake Carpenter, Olga Russakovsky and Andrew Y. Ng

International Conference on Robotics and Automation (ICRA), 2010.

[paper] [bibtex]

robotics

STanford AI Robot (STAIR) Vision Library

Stephen Gould, Olga Russakovsky, Ian Goodfellow, Paul Baumstarck, Andrew Y. Ng and Daphne Koller

http://ai.stanford.edu/~sgould/svl, 2010.

[code] [project] [bibtex]

robotics
2007

Training Conditional Random Fields for maximum labelwise accuracy

Samuel S. Gross, Olga Russakovsky, Chuong B. Do and Serafim Batzoglou

Advances in Neural Information Processing Systems (NeurIPS), 2007.

[paper] [bibtex]

labelwise accuracy