Yavas et al., 2020 - Google Patents

A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback reward

Yavas et al., 2020

View PDF

Document ID: 7344800501508030651
Author: Yavas U; Kumbasar T; Ure N
Publication year: 2020
Publication venue: 2020 IEEE Intelligent Vehicles Symposium (IV)

External Links

Cited by

Snippet

Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art Q learning method, namely Rainbow DQN, that uses a …

Continue reading at arxiv.org (PDF) (other versions)

238000004088 simulation 0 abstract description 9

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/02—Computer systems based on specific mathematical models using fuzzy logic
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/0285—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks and fuzzy logic
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition

Similar Documents

Publication	Publication Date	Title
Yavas et al.	2020	A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback reward
CN112888612B (en)	2024-11-01	Automated driving vehicle planning
US11093829B2 (en)	2021-08-17	Interaction-aware decision making
Alizadeh et al.	2019	Automated lane change decision making using deep reinforcement learning in dynamic and uncertain highway environment
Hoel et al.	2018	Automated speed and lane change decision making using deep reinforcement learning
Saxena et al.	2020	Driving in dense traffic with model-free reinforcement learning
Li et al.	2017	Game theoretic modeling of driver and vehicle interactions for verification and validation of autonomous vehicle control systems
Naveed et al.	2021	Trajectory planning for autonomous vehicles using hierarchical reinforcement learning
US11465650B2 (en)	2022-10-11	Model-free reinforcement learning
US11884302B2 (en)	2024-01-30	Social behavior for autonomous vehicles
Makantasis et al.	2020	Deep reinforcement‐learning‐based driving policy for autonomous road vehicles
Aradi et al.	2018	Policy gradient based reinforcement learning approach for autonomous highway driving
Wang et al.	2018	Autonomous ramp merge maneuver based on reinforcement learning with continuous action space
Dong et al.	2017	Interactive ramp merging planning in autonomous driving: Multi-merging leading PGM (MML-PGM)
Garzón et al.	2019	Game theoretic decision making for autonomous vehicles’ merge manoeuvre in high traffic scenarios
Onieva et al.	2015	A multi-objective evolutionary algorithm for the tuning of fuzzy rule bases for uncoordinated intersections in autonomous driving
Wang et al.	2022	High-level decision making for automated highway driving via behavior cloning
Ding et al.	2018	Game-theoretic cooperative lane changing using data-driven models
Li et al.	2017	An explicit decision tree approach for automated driving
Li et al.	2020	Interaction-aware behavior planning for autonomous vehicles validated with real traffic data
Alighanbari et al.	2022	Deep reinforcement learning with nmpc assistance nash switching for urban autonomous driving
Liu et al.	2021	Cooperation-aware decision making for autonomous vehicles in merge scenarios
US20230162539A1 (en)	2023-05-25	Driving decision-making method and apparatus and chip
Kamran et al.	2021	High-level decisions from a safe maneuver catalog with reinforcement learning for safe and cooperative automated merging
CN117585017A (en)	2024-02-23	Automatic driving vehicle lane change decision method, device, equipment and storage medium