Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

Dorsa Sadigh; Nick Landolfi; Shankar Sastry; Sanjit Seshia; Anca Dragan

doi:10.1007/s10514-018-9746-1

Loading next page...

References (14)

C Urmson, J Anhalt, D Bagnell, C Baker, R Bittner, M Clark, J Dolan, D Duggins, T Galatali, C Geyer (2008)
Autonomous driving in urban environments: Boss and the urban challenge
Journal of Field Robotics, 25
P Trautman (2013)
Robot navigation in dense crowds: Statistical models and experimental studies of human robot cooperation
A-A Agha-Mohammadi, S Chakravorty, NM Amato (2014)
FIRM: Sampling-based feedback motion-planning under motion uncertainty and imperfect measurements
The International Journal of Robotics Research, 33
M Dissanayake, P Newman, S Clark, HF Durrant-Whyte, M Csorba (2001)
A solution to the simultaneous localization and map building (SLAM) problem
IEEE Transactions on Robotics and Automation, 17
P Falcone, F Borrelli, HE Tseng, J Asgari, D Hrovat (2007)
Integrated braking and steering model predictive control approach in autonomous vehicles
Advances in Automotive Control, 5
S Prentice, N Roy (2009)
The belief roadmap: Efficient planning in belief space by factoring the covariance
The International Journal of Robotics Research, 28
T Hedden, J Zhang (2002)
What do you think i think you think?: Strategic reasoning in matrix games
Cognition, 85
P Falcone, F Borrelli, J Asgari, HE Tseng, D Hrovat (2007)
Predictive active steering control for autonomous vehicle systems
IEEE Transactions on Control Systems Technology, 15
RJ Aumann, M Maschler, RE Stearns (1995)
Repeated games with incomplete information
EA Hansen, DS Bernstein, S Zilberstein (2004)
Dynamic programming for partially observable stochastic games
AAAI, 4
P Falcone, HE Tseng, F Borrelli, J Asgari, D Hrovat (2008)
MPC-based yaw and lateral stabilisation via active front steering and braking
Vehicle System Dynamics, 46
DS Bernstein, R Givan, N Immerman, S Zilberstein (2002)
The complexity of decentralized control of Markov decision processes
Mathematics of Operations Research, 27
EF Camacho, CB Alba (2013)
Model predictive control
J Leonard, J How, S Teller, M Berger, S Campbell, G Fiore, L Fletcher, E Frazzoli, A Huang, S Karaman (2008)
A perception-driven autonomous urban vehicle
Journal of Field Robotics, 25

Publisher: Springer Journals
Copyright: Copyright © 2018 by Springer Science+Business Media, LLC, part of Springer Nature
Subject: Engineering; Robotics and Automation; Artificial Intelligence (incl. Robotics); Computer Imaging, Vision, Pattern Recognition and Graphics; Control, Robotics, Mechatronics
ISSN: 0929-5593
eISSN: 1573-7527
DOI: 10.1007/s10514-018-9746-1
Publisher site: See Article on Publisher Site

Abstract

Traditionally, autonomous cars treat human-driven vehicles like moving obstacles. They predict their future trajectories and plan to stay out of their way. While physically safe, this results in defensive and opaque behaviors. In reality, an autonomous car’s actions will actually affect what other cars will do in response, creating an opportunity for coordination. Our thesis is that we can leverage these responses to plan more efficient and communicative behaviors. We introduce a formulation of interaction with human-driven vehicles as an underactuated dynamical system, in which the robot’s actions have consequences on the state of the autonomous car, but also on the human actions and thus the state of the human-driven car. We model these consequences by approximating the human’s actions as (noisily) optimal with respect to some utility function. The robot uses the human actions as observations of her underlying utility function parameters. We first explore learning these parameters offline, and show that a robot planning in the resulting underactuated system is more efficient than when treating the person as a moving obstacle. We also show that the robot can target specific desired effects, like getting the person to switch lanes or to proceed first through an intersection. We then explore estimating these parameters online, and enable the robot to perform active information gathering: generating actions that purposefully probe the human in order to clarify their underlying utility parameters, like driving style or attention level. We show that this significantly outperforms passive estimation and improves efficiency. Planning in our model results in coordination behaviors: the robot inches forward at an intersection to see if can go through, or it reverses to make the other car proceed first. These behaviors result from the optimization, without relying on hand-coded signaling strategies. Our user studies support the utility of our model when interacting with real users.

Journal

Autonomous Robots – Springer Journals

Published: May 4, 2018

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

References (14)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies