Popular repositories Loading
-
PPOCoder
PPOCoder PublicForked from reddy-lab-code-research/PPOCoder
Code for "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
Python
-
CodeGen
CodeGen PublicForked from salesforce/CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Python
-
toolformer-pytorch
toolformer-pytorch PublicForked from lucidrains/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Python
-
TPSR
TPSR PublicForked from deep-symbolic-mathematics/TPSR
[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"
Python
-
ColossalAI
ColossalAI PublicForked from hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Python
-
awesome-RLHF
awesome-RLHF PublicForked from opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
If the problem persists, check the GitHub status page or contact support.