Skip to content
View shadowkiller33's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Johns Hopkins University
  • USA
  • 03:24 (UTC -05:00)

Block or report shadowkiller33

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 480 53 Updated Oct 14, 2024

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

424 28 Updated Jun 25, 2024
Python 33 Updated Aug 10, 2024

All Python solutions for Leetcode

Python 343 171 Updated Apr 10, 2024

PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Python 19 2 Updated Jul 22, 2024

Lean 4 programming language and theorem prover

Lean 4,670 419 Updated Nov 6, 2024

Lifelong Learning In Context

Python 8 Updated Oct 22, 2024

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Python 228 14 Updated Apr 3, 2024

A PyTorch Native LLM Training Framework

Python 656 34 Updated Aug 25, 2024

pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.

C++ 290 32 Updated Oct 10, 2023

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,860 174 Updated Nov 6, 2024
Jupyter Notebook 1,259 264 Updated Nov 5, 2024

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,553 122 Updated Sep 15, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,238 86 Updated Aug 20, 2024

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 121 10 Updated Sep 6, 2024

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics

Python 14 3 Updated Jul 2, 2024

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,826 162 Updated Aug 13, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 493 27 Updated May 20, 2024

Recipes to train reward model for RLHF.

Python 785 65 Updated Nov 4, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,584 168 Updated Nov 28, 2023

Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.

Python 102 12 Updated Aug 30, 2024

Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.

23 Updated Feb 1, 2024

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

Python 1,618 169 Updated Sep 15, 2023

self-adaptive in-context learning

Python 41 5 Updated May 5, 2023

Curate better data for LLMs

Python 953 89 Updated Mar 19, 2024

Data Valuation on In-Context Examples (ACL23)

Python 23 4 Updated Oct 19, 2024

1-Click is all you need.

Jupyter Notebook 58 8 Updated Apr 29, 2024

LLM Frontend for Power Users.

JavaScript 8,156 2,397 Updated Nov 5, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,019 742 Updated Nov 5, 2024
Next