ffmott

ffmott

Popular repositories Loading

PPOCoder PPOCoder Public

Forked from reddy-lab-code-research/PPOCoder

Code for "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"

Python
CodeGen CodeGen Public

Forked from salesforce/CodeGen

CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python
toolformer-pytorch toolformer-pytorch Public

Forked from lucidrains/toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python
TPSR TPSR Public

Forked from deep-symbolic-mathematics/TPSR

[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"

Python
ColossalAI ColossalAI Public

Forked from hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Python
awesome-RLHF awesome-RLHF Public

Forked from opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)