-
Tencent AI Lab
- https://linear95.github.io/
- @cheng_pengyu
- in/pengyu-cheng
Highlights
- Pro
Block or Report
Block or report Linear95
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
SPAG Public
Self-playing Adversarial Language Game Enhances LLM Reasoning
-
-
APO Public
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
-
linear95.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
-
bert-intent-slot-detector Public
BERT-based intent and slots detector for chatbots.
-
CLUB Public
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
-
Awesome-LLM-Robotics Public
Forked from GT-RIPL/Awesome-LLM-RoboticsA comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
-
LLM-with-RL-papers Public
Forked from floodsung/LLM-with-RL-papersA collection of LLM with RL papers
1 UpdatedApr 24, 2024 -
Awesome-LLM-RL Public
Forked from 123penny123/Awesome-LLM-RLA comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
UpdatedApr 24, 2024 -
Awesome-LLM-Reasoning Public
Forked from atfortes/Awesome-LLM-ReasoningReasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
MIT License UpdatedApr 24, 2024 -
awesome-RLHF Public
Forked from opendilab/awesome-RLHFA curated list of reinforcement learning with human feedback resources (continually updated)
Apache License 2.0 UpdatedMar 18, 2024 -
DSP Public
Domain-specific preference (DSP) data and customized RM fine-tuning.
-
-
TC-estimation Public
Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators
-
alpaca-lora Public
Forked from tloen/alpaca-loraInstruct-tune LLaMA on consumer hardware
Jupyter Notebook Apache License 2.0 UpdatedMay 9, 2023 -
RLM Public
Code for the paper - Replacing Language Model for Style Transfer
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedFeb 23, 2023 -
emacs-init Public
My emacs init file for python coding in deep learning
Emacs Lisp UpdatedMar 7, 2022 -
BinarySentEmb Public
Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
-
DetGP Public
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
-