Highlights
- Pro
Stars
A curated list of resources for using LLMs to develop more competitive grant applications.
Building Optimization Performance Tests
A flexible toolkit for simulation based inference in Julia
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Generative Agents for video games. Based on Generative Agents: Interactive Simulacra of Human Behavior
List of language agents based on paper "Cognitive Architectures for Language Agents"
Generative Agents: Interactive Simulacra of Human Behavior
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Honor of Kings AI Open Environment of Tencent
moderncv / moderncv
Forked from xdanaux/moderncvA modern curriculum vitae class for LaTeX
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
PyTorch Tutorial for Deep Learning Researchers
Device Driver Sample for Energy Sharing System
Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow - Guillaume Chevalier
A simulator and learning agent to solve the ridesharing problem
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
This is the official implementation of Multi-Agent PPO (MAPPO).