Floyd et al., 2011 - Google Patents

Building learning by observation agents using jloaf

Floyd et al., 2011

View PDF
Document ID
2290192942248182035
Author
Floyd M
Esfandiari B
Publication year
Publication venue
Workshop on Case-Based Reasoning for Computer Games: 19th international conference on Case-Based Reasoning,(Figure 1)

External Links

Snippet

The environments an agent is situated in or the behaviours it is required to perform may change over time. Ideally, an agent should be able to move to a new domain without requiring significant changes from the agent's designer. We describe our framework jLOAF …
Continue reading at sce.carleton.ca (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/004Artificial life, i.e. computers simulating life
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/39Robotics, robotics to robotics hand
    • G05B2219/39376Hierarchical, learning, recognition and skill level and adaptation servo level
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass

Similar Documents

Publication Publication Date Title
Ding et al. Challenges of reinforcement learning
US11062617B2 (en) Training system for autonomous driving control policy
Reddy et al. Where do you think you're going?: Inferring beliefs about dynamics from behavior
Doncieux et al. Evolutionary robotics: what, why, and where to
CN111144580B (en) Hierarchical reinforcement learning training method and device based on imitation learning
Guerin Learning like a baby: a survey of artificial intelligence approaches
CN110516389B (en) Behavior control strategy learning method, device, equipment and storage medium
Floyd et al. A case-based reasoning framework for developing agents using learning by observation
US20220366246A1 (en) Controlling agents using causally correct environment models
Gym et al. Deep reinforcement learning with python
EP2363251A1 (en) Robot with Behavioral Sequences on the basis of learned Petri Net Representations
Hafez et al. Efficient intrinsically motivated robotic grasping with learning-adaptive imagination in latent space
Ollington et al. Incorporating expert advice into reinforcement learning using constructive neural networks
Floyd et al. Building learning by observation agents using jloaf
Rabault et al. 18 Deep Reinforcement Learning Applied to Active Flow Control
Conde et al. Behavioral animation of autonomous virtual agents helped by reinforcement learning
Lee et al. Combining GRN modeling and demonstration-based programming for robot control
Stulp et al. Combining declarative, procedural, and predictive knowledge to generate, execute, and optimize robot plans
Holtman AGI agent safety by iteratively improving the utility function
JP6360197B2 (en) System and method for recognition-based processing of knowledge
Romero et al. Developmental Learning of Value Functions in a Motivational System for Cognitive Robotics
Montana et al. Towards a unified framework for learning from observation
Sheh Learning robot behaviours by observing and envisaging
Yang Robotic object manipulation via hierarchical and affordance learning
Floyd et al. Creation of devs models using imitation learning