Yavas et al., 2020 - Google Patents

A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback reward

Yavas et al., 2020

View PDF
Document ID
7344800501508030651
Author
Yavas U
Kumbasar T
Ure N
Publication year
Publication venue
2020 IEEE Intelligent Vehicles Symposium (IV)

External Links

Snippet

Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art Q learning method, namely Rainbow DQN, that uses a …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/02Computer systems based on specific mathematical models using fuzzy logic
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/0285Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks and fuzzy logic
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition

Similar Documents

Publication Publication Date Title
Yavas et al. A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback reward
CN112888612B (en) Automated driving vehicle planning
US11093829B2 (en) Interaction-aware decision making
Alizadeh et al. Automated lane change decision making using deep reinforcement learning in dynamic and uncertain highway environment
Hoel et al. Automated speed and lane change decision making using deep reinforcement learning
Saxena et al. Driving in dense traffic with model-free reinforcement learning
Li et al. Game theoretic modeling of driver and vehicle interactions for verification and validation of autonomous vehicle control systems
Naveed et al. Trajectory planning for autonomous vehicles using hierarchical reinforcement learning
US11465650B2 (en) Model-free reinforcement learning
US11884302B2 (en) Social behavior for autonomous vehicles
Makantasis et al. Deep reinforcement‐learning‐based driving policy for autonomous road vehicles
Aradi et al. Policy gradient based reinforcement learning approach for autonomous highway driving
Wang et al. Autonomous ramp merge maneuver based on reinforcement learning with continuous action space
Dong et al. Interactive ramp merging planning in autonomous driving: Multi-merging leading PGM (MML-PGM)
Garzón et al. Game theoretic decision making for autonomous vehicles’ merge manoeuvre in high traffic scenarios
Onieva et al. A multi-objective evolutionary algorithm for the tuning of fuzzy rule bases for uncoordinated intersections in autonomous driving
Wang et al. High-level decision making for automated highway driving via behavior cloning
Ding et al. Game-theoretic cooperative lane changing using data-driven models
Li et al. An explicit decision tree approach for automated driving
Li et al. Interaction-aware behavior planning for autonomous vehicles validated with real traffic data
Alighanbari et al. Deep reinforcement learning with nmpc assistance nash switching for urban autonomous driving
Liu et al. Cooperation-aware decision making for autonomous vehicles in merge scenarios
US20230162539A1 (en) Driving decision-making method and apparatus and chip
Kamran et al. High-level decisions from a safe maneuver catalog with reinforcement learning for safe and cooperative automated merging
CN117585017A (en) Automatic driving vehicle lane change decision method, device, equipment and storage medium