Popular repositories Loading
-
nanoChatGPT
nanoChatGPT PublicForked from karpathy/nanoGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
-
baselines
baselines PublicForked from openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Python
-
-
spinningup
spinningup PublicForked from openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Python
-
multiagent-particle-envs
multiagent-particle-envs PublicForked from jarbus/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Python
-
sanic
sanic PublicForked from sanic-org/sanic
Async Python 3.7+ web server/framework | Build fast. Run fast.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.