Floyd et al., 2011 - Google Patents
Building learning by observation agents using jloafFloyd et al., 2011
View PDF- Document ID
- 2290192942248182035
- Author
- Floyd M
- Esfandiari B
- Publication year
- Publication venue
- Workshop on Case-Based Reasoning for Computer Games: 19th international conference on Case-Based Reasoning,(Figure 1)
External Links
Snippet
The environments an agent is situated in or the behaviours it is required to perform may change over time. Ideally, an agent should be able to move to a new domain without requiring significant changes from the agent's designer. We describe our framework jLOAF …
- 230000006399 behavior 0 abstract description 10
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/39—Robotics, robotics to robotics hand
- G05B2219/39376—Hierarchical, learning, recognition and skill level and adaptation servo level
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ding et al. | Challenges of reinforcement learning | |
US11062617B2 (en) | Training system for autonomous driving control policy | |
Reddy et al. | Where do you think you're going?: Inferring beliefs about dynamics from behavior | |
Doncieux et al. | Evolutionary robotics: what, why, and where to | |
CN111144580B (en) | Hierarchical reinforcement learning training method and device based on imitation learning | |
Guerin | Learning like a baby: a survey of artificial intelligence approaches | |
CN110516389B (en) | Behavior control strategy learning method, device, equipment and storage medium | |
Floyd et al. | A case-based reasoning framework for developing agents using learning by observation | |
US20220366246A1 (en) | Controlling agents using causally correct environment models | |
Gym et al. | Deep reinforcement learning with python | |
EP2363251A1 (en) | Robot with Behavioral Sequences on the basis of learned Petri Net Representations | |
Hafez et al. | Efficient intrinsically motivated robotic grasping with learning-adaptive imagination in latent space | |
Ollington et al. | Incorporating expert advice into reinforcement learning using constructive neural networks | |
Floyd et al. | Building learning by observation agents using jloaf | |
Rabault et al. | 18 Deep Reinforcement Learning Applied to Active Flow Control | |
Conde et al. | Behavioral animation of autonomous virtual agents helped by reinforcement learning | |
Lee et al. | Combining GRN modeling and demonstration-based programming for robot control | |
Stulp et al. | Combining declarative, procedural, and predictive knowledge to generate, execute, and optimize robot plans | |
Holtman | AGI agent safety by iteratively improving the utility function | |
JP6360197B2 (en) | System and method for recognition-based processing of knowledge | |
Romero et al. | Developmental Learning of Value Functions in a Motivational System for Cognitive Robotics | |
Montana et al. | Towards a unified framework for learning from observation | |
Sheh | Learning robot behaviours by observing and envisaging | |
Yang | Robotic object manipulation via hierarchical and affordance learning | |
Floyd et al. | Creation of devs models using imitation learning |