zuoxingdong

Xingdong Zuo zuoxingdong

AI in well-being is my dream. Neural networks need to understand the world causally.

214 followers · 3 following

NAVER Corp
Seongnam, South Korea
https://zuoxingdong.github.io/

Achievements

x3 x2

Achievements

x3 x2

Block or Report

Block or report zuoxingdong

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

facebookresearch / generative-recommenders

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, I…

Python 538 88 Updated Jul 3, 2024

huggingface / lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Python 4,447 372 Updated Jul 19, 2024

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 586 90 Updated Mar 23, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,241 361 Updated Jul 17, 2024

jlin816 / dynalang

Code for "Learning to Model the World with Language."

Python 335 21 Updated Sep 21, 2023

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,075 194 Updated Jun 24, 2024

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 138 7 Updated Jan 14, 2024

aikorea / awesome-rl

Reinforcement learning resources curated

8,717 1,828 Updated May 25, 2023

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

15,891 2,012 Updated Jun 3, 2024

cbamls / AI_Tutorial

精选机器学习，NLP，图像识别，深度学习等人工智能领域学习资料，搜索，推荐，广告系统架构及算法技术资料整理。算法大牛笔记汇总

2,921 463 Updated Apr 15, 2024

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Python 1,873 252 Updated Jul 20, 2024

facebookresearch / mtm

MTM Masked Trajectory Models for Prediction, Representation, and Control.

Python 144 3 Updated Apr 28, 2023

NM512 / dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Python 358 79 Updated Mar 10, 2024

st-tech / zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Python 627 86 Updated Jun 3, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,405 882 Updated Jul 20, 2024

ikostrikov / rlpd

Python 185 21 Updated Feb 13, 2023

danijar / dreamerv3

Mastering Diverse Domains through World Models

Python 1,165 201 Updated Jul 17, 2024

fernandoamat / slateOPE

Accompanies and reproduces results from the paper "Control Variates for Slate Off-Policy Evaluation"

Python 5 Updated Oct 26, 2021

Victor-YG / PILCO_victor

Forked from nrontsis/PILCO

Bayesian Reinforcement Learning in Tensorflow

Python 2 Updated Dec 12, 2022

deep-reinforcement-learning