Skip to content
View xiaosongyuan's full-sized avatar
  • Jilin University
  • Changchun City, Jilin Province, China
  • 06:35 (UTC -12:00)

Block or report xiaosongyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 6,801 876 Updated Aug 27, 2024

Retrieval-Augmented Generation in 3 Lines of Code!

Python 22 3 Updated Aug 26, 2024

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Python 199 16 Updated Jun 6, 2024

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 3,116 259 Updated Aug 29, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,634 385 Updated Aug 29, 2024

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 1,956 166 Updated Aug 15, 2024

简单易懂的LLaMA微调指南。

Python 333 34 Updated Jul 5, 2023

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 391 47 Updated Apr 24, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

416 44 Updated Jul 10, 2024

This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence

Jupyter Notebook 355 24 Updated Apr 17, 2023

Data and code of the paper Toward Unified Controllable Text Generation via Regular Expression Instruction

Python 5 1 Updated Nov 22, 2023

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

321 29 Updated Apr 25, 2024

Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models

Python 39 5 Updated Mar 13, 2022

Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors

Python 1,768 155 Updated Aug 7, 2023

GeDi: Generative Discriminator Guided Sequence Generation

Python 208 47 Updated Sep 28, 2022

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

1,896 126 Updated Oct 5, 2023

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,762 3,660 Updated Aug 29, 2024

Samples for users of the Yelp Academic Dataset

Python 1,218 614 Updated May 28, 2023

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 361 18 Updated May 17, 2024

An awesome paper list of Semi-Supervised Learning under realistic settings.

Shell 91 8 Updated Jun 16, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,581 266 Updated Aug 14, 2024
Python 76 1 Updated Nov 11, 2022

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,896 778 Updated Aug 20, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,222 700 Updated May 31, 2024

The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation

Python 25 2 Updated Nov 13, 2023

Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Constrained Sampling from LMs"

Python 59 5 Updated Mar 21, 2024
Python 31 4 Updated Sep 7, 2023

Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Python 62 4 Updated Nov 13, 2022
Python 43 2 Updated Nov 21, 2023

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

Python 1,127 202 Updated Feb 20, 2024
Next