shadowkiller33

Follow

🎯

Focusing

Lingfeng Shen shadowkiller33

🎯

Focusing

Follow

I'm Lingfeng Shen, an NLPer on the road.

29 followers · 15 following

Johns Hopkins University
USA
03:24 (UTC -05:00)

Achievements

Achievements

Stars

zhentingqi / rStar

Python 480 53 Updated Oct 14, 2024

suzgunmirac / BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

424 28 Updated Jun 25, 2024

Zayne-sprague / MuSR

Python 33 Updated Aug 10, 2024

cnkyrpsgl / leetcode

All Python solutions for Leetcode

Python 343 171 Updated Apr 10, 2024

swarnaHub / System-1.x

PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Python 19 2 Updated Jul 22, 2024

leanprover / lean4

Lean 4 programming language and theorem prover

Lean 4,670 419 Updated Nov 6, 2024

FutureAGI / L3C

Lifelong Learning In Context

Python 8 Updated Oct 22, 2024

open-compass / T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Python 228 14 Updated Apr 3, 2024

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 656 34 Updated Aug 25, 2024

cvangysel / pytrec_eval

pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.

C++ 290 32 Updated Oct 10, 2023

modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Python 2,860 174 Updated Nov 6, 2024

mistralai / cookbook

Jupyter Notebook 1,259 264 Updated Nov 5, 2024

hyp1231 / awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,553 122 Updated Sep 15, 2024

hymie122 / RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,238 86 Updated Aug 20, 2024

tianyi-lab / Superfiltering

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 121 10 Updated Sep 6, 2024

tianyi-lab / Mosaic-IT

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics

Python 14 3 Updated Jul 2, 2024

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,826 162 Updated Aug 13, 2024

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 493 27 Updated May 20, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 785 65 Updated Nov 4, 2024

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

2,584 168 Updated Nov 28, 2023

Minami-su / character_AI_open

Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.

Python 102 12 Updated Aug 30, 2024

Bauhinia-AI / evol-character

Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.

23 Updated Feb 1, 2024

Shivanshu-Gupta / icl-coverage

Python 13 1 Updated Mar 5, 2024

teknium1 / GPTeacher

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

Python 1,618 169 Updated Sep 15, 2023

Shark-NLP / self-adaptive-ICL

self-adaptive in-context learning

Python 41 5 Updated May 5, 2023

databricks / lilac

Curate better data for LLMs

Python 953 89 Updated Mar 19, 2024

terarachang / DataICL

Data Valuation on In-Context Examples (ACL23)

Python 23 4 Updated Oct 19, 2024

StableFluffy / EasyLLMFeaturePorter

1-Click is all you need.

Jupyter Notebook 58 8 Updated Apr 29, 2024

SillyTavern / SillyTavern

LLM Frontend for Power Users.

JavaScript 8,156 2,397 Updated Nov 5, 2024

Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,019 742 Updated Nov 5, 2024