Publications

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang, Guy Tennenholtz, Chih-wei Hsu, Yinlam Chow, Erdem Bıyık, Craig Boutilier
Conference on Neural Information Processing Systems (NeurIPS), December 2024
Trajectory Improvement and Reward Learning from Comparative Language Feedback
Zhaojing Yang, Miru Jun, Jeremy Tien, Stuart J. Russell, Anca Dragan, Erdem Bıyık
Proceedings of the 8th Conference on Robot Learning (CoRL), November 2024

Also presented at HRI Human-Interactive Robot Learning Workshop, March 2024 (PDF).
EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data
Jesse Zhang, Minho Heo, Zuxin Liu, Erdem Bıyık, Joseph J Lim, Yao Liu, Rasool Fakoor
Proceedings of the 8th Conference on Robot Learning (CoRL), November 2024
Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree
Harbani Jaggi*, Kashyap Murali*, Eve Fleisig, Erdem Bıyık
Conference on Empirical Methods in Natural Language Processing (EMNLP), November 2024

* denotes equal contribution.
ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Anthony Liang, Jesse Thomason, Erdem Bıyık
International Conference on Intelligent Robots and Systems (IROS), October 2024

Also presented at ICRA Pretraining for Robotics Workshop, May 2023 (PDF).
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain, Norio Kosaka, Xinhu Li, Kyung-Min Kim, Erdem Bıyık, Joseph J. Lim
arXiv preprint, October 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang*, Zhanyi Sun*, Jesse Zhang, Zhou Xian, Erdem Bıyık, David Held†, Zackory Erickson†
International Conference on Machine Learning (ICML), July 2024

* denotes equal contribution. † denotes equal advising.
Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Michelle Pan*, Mariah Schrum*, Vivek Myers, Erdem Bıyık, Anca Dragan
International Conference on Machine Learning (ICML), July 2024

* denotes equal contribution.
Foundation Models for Embodied AI
Sumedh Anand Sontakke
CS Department, University of Southern California, May 2024

Ph.D. Dissertation
Batch Active Learning of Reward Functions from Human Preferences
Erdem Bıyık, Nima Anari, Dorsa Sadigh
ACM Transactions on Human-Robot Interaction (THRI), 2024
A Generalized Acquisition Function for Preference-based Reward Learning
Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Bıyık
International Conference on Robotics and Automation (ICRA), May 2024
Active Preference-Based Gaussian Process Regression for Reward Learning and Optimization
Erdem Bıyık, Nicolas Huynh, Mykel J. Kochenderfer, Dorsa Sadigh
International Journal of Robotics Research (IJRR), 2024
Preference Elicitation with Soft Attributes in Interactive Recommendation
Erdem Bıyık, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, Craig Boutilier
arXiv preprint, November 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
Sumedh A. Sontakke, Jesse Zhang, Sébastien M. R. Arnold, Karl Pertsch, Erdem Bıyık, Dorsa Sadigh, Chelsea Finn, Laurent Itti
Conference on Neural Information Processing Systems (NeurIPS), December 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper*, Xander Davies*, et al.
Transactions on Machine Learning Research (TMLR), 2023

* denotes equal contribution.
Active Reward Learning from Online Preferences
Vivek Myers, Erdem Bıyık, Dorsa Sadigh
International Conference on Robotics and Automation (ICRA), May 2023
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava, Erdem Bıyık, Suvir Mirchandani, Noah Goodman, Dorsa Sadigh
Conference on Neural Information Processing Systems (NeurIPS), November 2022
How do People Incorporate Advice from Artificial Agents when Making Physical Judgments?
Erik Brockbank*, Haoliang Wang*, Justin Yang, Suvir Mirchandani, Erdem Bıyık, Dorsa Sadigh, Judith Fan
Cognitive Science Society Conference (CogSci), July 2022

* denotes equal contribution.
Oral presentation.
Learning Preferences For Interactive Autonomy
Erdem Bıyık
EE Department, Stanford University, May 2022

Ph.D. Dissertation
Leveraging Smooth Attention Prior for Multi-Agent Trajectory Prediction
Zhangjie Cao, Erdem Bıyık, Guy Rosman, Dorsa Sadigh
International Conference on Robotics and Automation (ICRA), May 2022
APReL: A Library for Active Preference-based Reward Learning Algorithms
Erdem Bıyık, Aditi Talati, Dorsa Sadigh
17th ACM/IEEE International Conference on Human-Robot Interaction (HRI), March 2022

Also presented at Artificial Intelligence for Human-Robot Interaction (AI-HRI) at AAAI Fall Symposium Series, November 2021 (PDF).
Learning from Humans for Adaptive Interaction
Erdem Bıyık
The 17th Annual Human-Robot Interaction Pioneers Workshop (HRI Pioneers), March 2022
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams
Erdem Bıyık, Anusha Lalitha, Rajarshi Saha, Andrea Goldsmith, Dorsa Sadigh
Proceedings of the 36th AAAI Conference on Artificial Intelligence, February 2022

Also presented at Artificial Intelligence for Human-Robot Interaction (AI-HRI) at AAAI Fall Symposium Series, November 2021 (PDF).
Oral presentation.
Learning Multimodal Rewards from Rankings
Vivek Myers, Erdem Bıyık, Nima Anari, Dorsa Sadigh
Proceedings of the 5th Conference on Robot Learning (CoRL), November 2021

Oral presentation.
Learning Reward Functions from Scale Feedback
Nils Wilde*, Erdem Bıyık*, Dorsa Sadigh, Stephen L. Smith
Proceedings of the 5th Conference on Robot Learning (CoRL), November 2021

* denotes equal contribution.
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences
Erdem Bıyık, Dylan P. Losey, Malayandi Palan, Nicholas C. Landolfi, Gleb Shevchuk, Dorsa Sadigh
The International Journal of Robotics Research (IJRR), 2021
Learning How to Dynamically Route Autonomous Vehicles on Shared Roads
Daniel A. Lazar*, Erdem Bıyık*, Dorsa Sadigh, Ramtin Pedarsani
Transportation Research Part C: Emerging Technologies (TR_C), September 2021

* denotes equal contribution.
Emergent Prosociality in Multi-Agent Games Through Gifting
Woodrow Z. Wang*, Mark Beliaev*, Erdem Bıyık*, Daniel A. Lazar, Ramtin Pedarsani, Dorsa Sadigh
30th International Joint Conference on Artificial Intelligence (IJCAI), August 2021

* denotes equal contribution.
Incentivizing Efficient Equilibria in Traffic Networks with Mixed Autonomy
Erdem Bıyık*, Daniel A. Lazar*, Ramtin Pedarsani, Dorsa Sadigh
IEEE Transactions on Control of Network Systems (TCNS), 2021

* denotes equal contribution.
ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes
Kejun Li, Maegan Tucker, Erdem Bıyık, Ellen Novoseller, Joel W. Burdick, Yanan Sui, Dorsa Sadigh, Yisong Yue, Aaron D. Ames
International Conference on Robotics and Automation (ICRA), May 2021
Incentivizing Routing Choices for Safe and Efficient Transportation in the Face of the COVID-19 Pandemic
Mark Beliaev, Erdem Bıyık, Daniel A. Lazar, Woodrow Z. Wang, Dorsa Sadigh, Ramtin Pedarsani
12th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), May 2021
Multi-Agent Safe Planning with Gaussian Processes
Zheqing Zhu, Erdem Bıyık, Dorsa Sadigh
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2020
Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Bıyık*, Nicolas Huynh*, Mykel J. Kochenderfer, Dorsa Sadigh
Proceedings of Robotics: Science and Systems (RSS), July 2020

* denotes equal contribution.
Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving
Zhangjie Cao*, Erdem Bıyık*, Woodrow Z. Wang, Allan Raventos, Adrien Gaidon, Guy Rosman, Dorsa Sadigh
Proceedings of Robotics: Science and Systems (RSS), July 2020

* denotes equal contribution.
Emergent Correlated Equilibrium through Synchronized Exploration
Mark Beliaev*, Woodrow Z. Wang*, Daniel A. Lazar, Erdem Bıyık, Dorsa Sadigh, Ramtin Pedarsani
RSS 2020 Workshop on Emergent Behaviors in Human-Robot Systems, July 2020
When Humans Aren't Optimal: Robots that Collaborate with Risk-Aware Humans
Minae Kwon, Erdem Bıyık, Aditi Talati, Karan Bhasin, Dylan P. Losey, Dorsa Sadigh
ACM/IEEE International Conference on Human-Robot Interaction (HRI), March 2020

Also presented at Cooperative AI NeurIPS Workshop 2021, December 2021 (PDF).
Honorable mention award.
The Green Choice: Learning and Influencing Human Decisions on Shared Roads
Erdem Bıyık, Daniel A. Lazar, Dorsa Sadigh, Ramtin Pedarsani
Proceedings of the 58th IEEE Conference on Decision and Control (CDC), December 2019
Active Learning of Reward Dynamics from Hierarchical Queries
Chandrayee Basu, Erdem Bıyık, Zhixun He, Mukesh Singhal, Dorsa Sadigh
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), November 2019
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning
Erdem Bıyık, Malayandi Palan, Nicholas C. Landolfi, Dylan P. Losey, Dorsa Sadigh
Proceedings of the 3rd Conference on Robot Learning (CoRL), October 2019
Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models
Erdem Bıyık*, Jonathan Margoliash*, Shahrouz R. Alimo, Dorsa Sadigh
Proceedings of the American Control Conference (ACC), July 2019

* denotes equal contribution.
Batch Active Learning Using Determinantal Point Processes
Erdem Bıyık, Kenneth Wang, Nima Anari, Dorsa Sadigh
arXiv preprint, June 2019
Altruistic Autonomy: Beating Congestion on Shared Roads
Erdem Bıyık*, Daniel A. Lazar*, Ramtin Pedarsani, Dorsa Sadigh
Proceedings of the 13th International Workshop on Algorithmic Foundations of Robotics (WAFR), December 2018

* denotes equal contribution.
Batch Active Preference-Based Learning of Reward Functions
Erdem Bıyık, Dorsa Sadigh
Proceedings of the 2nd Conference on Robot Learning (CoRL), October 2018

Oral presentation.
Real-Time Detection, Tracking and Classification of Multiple Moving Objects in UAV Videos
Hüseyin C. Baykara*, Erdem Bıyık*, Gamze Gül*, Deniz Onural*, Ahmet S. Öztürk*, İlkay Yıldız*
International Conference on Tools with Artificial Intelligence (ICTAI), November 2017

* denotes equal contribution.