Huskarl

Huskarl is a framework for deep reinforcement learning focused on research and fast prototyping. It's built on TensorFlow 2.0 and uses the tf.keras API when possible for conciseness and readability.

Huskarl makes it easy to parallelize computation of environment dynamics across multiple CPUs. This is useful for speeding up on-policy learning algorithms that benefit from multiple concurrent sources of experience such as A2C or PPO. It is specially useful for computationally intensive environments such as physics-based ones.

Huskarl works seamlessly with OpenAI Gym environments.

There are plans to support multi-agent environments and Unity3D environments.

Algorithms

Several algorithms are implemented already and many more are planned.

Installation

Since TensorFlow 2.0 is not officially out yet you need to install it and other dependencies manually for now:

pip install tf-nightly-2.0-preview
pip install cloudpickle
pip install scipy
pip install huskarl --no-deps

Citing

If you use Huskarl in your research, you can cite it as follows:

@misc{salvadori2019huskarl,
    author = {Daniel Salvadori},
    title = {huskarl},
    year = {2019},
    publisher = {GitHub},
    journal = {GitHub repository},
    howpublished = {\url{https://github.com/danaugrs/huskarl}},
}

About

hùskarl in Old Norse means a warrior who works in his/her lord's service.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
examples		examples
huskarl		huskarl
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Huskarl

Algorithms

Installation

Citing

About

About

Releases

Packages

Languages

License

jacksparal/huskarl

Folders and files

Latest commit

History

Repository files navigation

Huskarl

Algorithms

Installation

Citing

About

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages