Skip to content
View Linear95's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Linear95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • SPAG Public

    Self-playing Adversarial Language Game Enhances LLM Reasoning

    Python 77 8 Apache License 2.0 Updated Jul 2, 2024
  • Updated Jun 4, 2024
  • APO Public

    Code for ACL2024 paper - Adversarial Preference Optimization (APO).

    Python 45 2 Apache License 2.0 Updated Jun 3, 2024
  • Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

    JavaScript 2 3 MIT License Updated May 27, 2024
  • BERT-based intent and slots detector for chatbots.

    Python 102 13 Updated May 10, 2024
  • CLUB Public

    Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

    Jupyter Notebook 297 38 Updated May 10, 2024
  • A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

    2 BSD 3-Clause "New" or "Revised" License Updated Apr 24, 2024
  • A collection of LLM with RL papers

    1 Updated Apr 24, 2024
  • A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

    Updated Apr 24, 2024
  • Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

    MIT License Updated Apr 24, 2024
  • A curated list of reinforcement learning with human feedback resources (continually updated)

    Apache License 2.0 Updated Mar 18, 2024
  • DSP Public

    Domain-specific preference (DSP) data and customized RM fine-tuning.

    Python 26 3 Apache License 2.0 Updated Mar 7, 2024
  • Linear95 Public

    My personal repository

    Updated Dec 25, 2023
  • Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators

    Jupyter Notebook 13 1 Updated Dec 15, 2023
  • alpaca-lora Public

    Forked from tloen/alpaca-lora

    Instruct-tune LLaMA on consumer hardware

    Jupyter Notebook Apache License 2.0 Updated May 9, 2023
  • RLM Public

    Code for the paper - Replacing Language Model for Style Transfer

    Python 3 Updated Apr 13, 2023
  • Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python Other Updated Feb 23, 2023
  • emacs-init Public

    My emacs init file for python coding in deep learning

    Emacs Lisp Updated Mar 7, 2022
  • Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.

    Python 44 7 Updated Sep 1, 2020
  • DetGP Public

    Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.

    Python 10 Updated Mar 26, 2020
  • The implement of ECC classification

    Python 1 Updated Dec 12, 2017