Skip to content
View KimJaehee0725's full-sized avatar
🥔
I'm a talking potato
🥔
I'm a talking potato

Highlights

  • Pro

Block or report KimJaehee0725

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM101n: Let's build a Storyteller

29,239 1,600 Updated Aug 1, 2024

Benchmarking library for RAG

Jupyter Notebook 95 8 Updated Oct 9, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,183 89 Updated Oct 8, 2024
135 Updated Dec 26, 2023

PyTorch native finetuning library

Python 4,101 383 Updated Oct 9, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 28,851 3,309 Updated Oct 8, 2024

Must-read Papers on Knowledge Editing for Large Language Models.

857 54 Updated Oct 9, 2024

The Universe of Data. All about data, data science, and data engineering

Python 507 52 Updated Jul 18, 2024

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,341 231 Updated Jul 11, 2024

Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification (EMNLP 2023 Findings)

Python 1 Updated Feb 1, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,233 249 Updated Jun 24, 2024

SILO Language Models code repository

Python 80 10 Updated Feb 23, 2024
Jupyter Notebook 24 22 Updated Nov 24, 2023
Jupyter Notebook 23 19 Updated Mar 19, 2024

Machine Learning Engineering Open Book

Python 11,337 685 Updated Oct 9, 2024

code and data for Hayati et al's paper on "How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!"

JavaScript 5 Updated Oct 1, 2024

Robust recipes to align language models with human and AI preferences

Python 4,559 395 Updated Oct 7, 2024

Simple replication of DPR (Dense Passage Retrieval)

Python 33 3 Updated Nov 10, 2023

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 629 109 Updated Jan 4, 2023

Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"

Jupyter Notebook 15 1 Updated Feb 16, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,365 940 Updated Oct 9, 2024

👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"

Python 50 3 Updated May 31, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 12,052 1,925 Updated Oct 9, 2024

Time-series-LLM

Python 20 3 Updated Oct 31, 2023

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,850 562 Updated Apr 16, 2024

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Python 129 8 Updated Jun 23, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,494 161 Updated Nov 28, 2023

Examples and guides for using the OpenAI API

MDX 58,938 9,374 Updated Oct 9, 2024
Next