Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Linear95 Follow

Overview Repositories 21 Projects 0 Packages 0 Stars 203

More

Overview
Repositories
Projects
Packages
Stars

Linear95

Follow

Pengyu Cheng Linear95

Follow

Researcher at Tencent AI Lab

125 followers · 65 following

Tencent AI Lab
https://linear95.github.io/
@cheng_pengyu
in/pengyu-cheng

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report Linear95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Overview Repositories 21 Projects 0 Packages 0 Stars 203

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python JavaScript Jupyter Notebook Emacs Lisp

Sort Last updated

Select order

Last updated Name Stars

SPAG Public

Self-playing Adversarial Language Game Enhances LLM Reasoning

Python 77 8 Apache License 2.0 Updated Jul 2, 2024
awesome-auto-alignment Public
Forked from cascip/awesome-auto-alignment

Updated Jun 4, 2024
APO Public

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

Python 45 2 Apache License 2.0 Updated Jun 3, 2024
linear95.github.io Public
Forked from academicpages/academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 2 3 MIT License Updated May 27, 2024
bert-intent-slot-detector Public

BERT-based intent and slots detector for chatbots.

Python 102 13 Updated May 10, 2024
CLUB Public

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Jupyter Notebook 297 38 Updated May 10, 2024
Awesome-LLM-Robotics Public
Forked from GT-RIPL/Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

2 BSD 3-Clause "New" or "Revised" License Updated Apr 24, 2024
LLM-with-RL-papers Public
Forked from floodsung/LLM-with-RL-papers

A collection of LLM with RL papers

1 Updated Apr 24, 2024
Awesome-LLM-RL Public
Forked from 123penny123/Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

Updated Apr 24, 2024
Awesome-LLM-Reasoning Public
Forked from atfortes/Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

MIT License Updated Apr 24, 2024
awesome-RLHF Public
Forked from opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache License 2.0 Updated Mar 18, 2024
DSP Public

Domain-specific preference (DSP) data and customized RM fine-tuning.

Python 26 3 Apache License 2.0 Updated Mar 7, 2024
Linear95 Public

My personal repository

Updated Dec 25, 2023
TC-estimation Public

Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators

Jupyter Notebook 13 1 Updated Dec 15, 2023
alpaca-lora Public
Forked from tloen/alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook Apache License 2.0 Updated May 9, 2023
RLM Public

Code for the paper - Replacing Language Model for Style Transfer

Python 3 Updated Apr 13, 2023
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Feb 23, 2023
emacs-init Public

My emacs init file for python coding in deep learning

Emacs Lisp Updated Mar 7, 2022
BinarySentEmb Public

Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.

Python 44 7 Updated Sep 1, 2020
DetGP Public

Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.

Python 10 Updated Mar 26, 2020
ECC_classification Public

The implement of ECC classification

Python 1 Updated Dec 12, 2017

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.