Skip to content
View zuoxingdong's full-sized avatar
Block or Report

Block or report zuoxingdong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

148 results for source starred repositories
Clear filter

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, I…

Python 538 88 Updated Jul 3, 2024

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Python 4,450 373 Updated Jul 19, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 586 90 Updated Mar 23, 2024

Robust recipes to align language models with human and AI preferences

Python 4,242 361 Updated Jul 17, 2024

Code for "Learning to Model the World with Language."

Python 335 21 Updated Sep 21, 2023

A curated list of reinforcement learning with human feedback resources (continually updated)

3,075 194 Updated Jun 24, 2024

RLHF implementation details of OAI's 2019 codebase

Python 138 7 Updated Jan 14, 2024

Reinforcement learning resources curated

8,717 1,828 Updated May 25, 2023

Generative Agents: Interactive Simulacra of Human Behavior

15,893 2,011 Updated Jun 3, 2024

精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总

2,921 463 Updated Apr 15, 2024

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 1,874 252 Updated Jul 20, 2024

MTM Masked Trajectory Models for Prediction, Representation, and Control.

Python 144 3 Updated Apr 28, 2023

Implementation of Dreamer v3 in pytorch.

Python 358 79 Updated Mar 10, 2024

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Python 627 87 Updated Jun 3, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,405 882 Updated Jul 20, 2024
Python 185 21 Updated Feb 13, 2023

Mastering Diverse Domains through World Models

Python 1,165 201 Updated Jul 17, 2024

Accompanies and reproduces results from the paper "Control Variates for Slate Off-Policy Evaluation"

Python 5 Updated Oct 26, 2021

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,940 5,387 Updated Jul 19, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,399 469 Updated Jan 8, 2024

Train transformer language models with reinforcement learning.

Python 8,822 1,086 Updated Jul 19, 2024

【浅梦学习笔记】文章汇总:包含 排序&CXR预估,召回匹配,用户画像&特征工程,推荐搜索综合 计算广告,大数据,图算法,NLP&CV,求职面试 等内容

1,533 219 Updated Dec 24, 2022

Foundation Architecture for (M)LLMs

Python 2,975 201 Updated Apr 11, 2024

Python implementations of contextual bandits algorithms

Python 723 142 Updated Jun 18, 2024

The Fuzzy Labs guide to the universe of open source MLOps

438 47 Updated Jul 17, 2024

Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow

Python 224 14 Updated Apr 7, 2023

Behavioral "black-box" testing for recommender systems

Python 453 26 Updated Aug 9, 2023

This is the official implementation for the paper: "CIRS: Bursting Filter Bubbles by Counterfactual Interactive Recommender System"

Python 62 7 Updated Jan 2, 2024

An up-to-date, comprehensive and flexible recommendation library

170 21 Updated Nov 5, 2023

Hands-on tutorials at EEML2022 summer school

Jupyter Notebook 60 19 Updated Jul 13, 2022
Next