-
Korea University DSBA Lab.
- Seoul, Republic of Korea
- https://velog.io/@stapers
- https://jaehee-kim.notion.site/Unknown-NLP-Study-ff54da176c164c5aa01165a255370e8a?pvs=4
Highlights
- Pro
Stars
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Must-read Papers on Knowledge Editing for Large Language Models.
The Universe of Data. All about data, data science, and data engineering
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification (EMNLP 2023 Findings)
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Machine Learning Engineering Open Book
code and data for Hayati et al's paper on "How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!"
Robust recipes to align language models with human and AI preferences
Simple replication of DPR (Dense Passage Retrieval)
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
Implementation of Nougat Neural Optical Understanding for Academic Documents
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
A quick guide (especially) for trending instruction finetuning datasets
Examples and guides for using the OpenAI API