Yavas et al., 2020 - Google Patents
A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback rewardYavas et al., 2020
View PDF- Document ID
- 7344800501508030651
- Author
- Yavas U
- Kumbasar T
- Ure N
- Publication year
- Publication venue
- 2020 IEEE Intelligent Vehicles Symposium (IV)
External Links
Snippet
Automated lane change is one of the most challenging task to be solved of highly automated vehicles due to its safety-critical, uncertain and multi-agent nature. This paper presents the novel deployment of the state of art Q learning method, namely Rainbow DQN, that uses a …
- 238000004088 simulation 0 abstract description 9
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/02—Computer systems based on specific mathematical models using fuzzy logic
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/0285—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks and fuzzy logic
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yavas et al. | A new approach for tactical decision making in lane changing: Sample efficient deep Q learning with a safety feedback reward | |
CN112888612B (en) | Automated driving vehicle planning | |
US11093829B2 (en) | Interaction-aware decision making | |
Alizadeh et al. | Automated lane change decision making using deep reinforcement learning in dynamic and uncertain highway environment | |
Hoel et al. | Automated speed and lane change decision making using deep reinforcement learning | |
Saxena et al. | Driving in dense traffic with model-free reinforcement learning | |
Li et al. | Game theoretic modeling of driver and vehicle interactions for verification and validation of autonomous vehicle control systems | |
Naveed et al. | Trajectory planning for autonomous vehicles using hierarchical reinforcement learning | |
US11465650B2 (en) | Model-free reinforcement learning | |
US11884302B2 (en) | Social behavior for autonomous vehicles | |
Makantasis et al. | Deep reinforcement‐learning‐based driving policy for autonomous road vehicles | |
Aradi et al. | Policy gradient based reinforcement learning approach for autonomous highway driving | |
Wang et al. | Autonomous ramp merge maneuver based on reinforcement learning with continuous action space | |
Dong et al. | Interactive ramp merging planning in autonomous driving: Multi-merging leading PGM (MML-PGM) | |
Garzón et al. | Game theoretic decision making for autonomous vehicles’ merge manoeuvre in high traffic scenarios | |
Onieva et al. | A multi-objective evolutionary algorithm for the tuning of fuzzy rule bases for uncoordinated intersections in autonomous driving | |
Wang et al. | High-level decision making for automated highway driving via behavior cloning | |
Ding et al. | Game-theoretic cooperative lane changing using data-driven models | |
Li et al. | An explicit decision tree approach for automated driving | |
Li et al. | Interaction-aware behavior planning for autonomous vehicles validated with real traffic data | |
Alighanbari et al. | Deep reinforcement learning with nmpc assistance nash switching for urban autonomous driving | |
Liu et al. | Cooperation-aware decision making for autonomous vehicles in merge scenarios | |
US20230162539A1 (en) | Driving decision-making method and apparatus and chip | |
Kamran et al. | High-level decisions from a safe maneuver catalog with reinforcement learning for safe and cooperative automated merging | |
CN117585017A (en) | Automatic driving vehicle lane change decision method, device, equipment and storage medium |