Skip to content
View Amber-Chaeeunk's full-sized avatar
🐶
🐶

Block or report Amber-Chaeeunk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 3,135 166 Updated Oct 5, 2024

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey"

719 43 Updated Oct 1, 2024

Fast and memory-efficient exact attention

Python 13,647 1,251 Updated Oct 6, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 3,065 231 Updated Aug 10, 2024

Implementation for MatMul-free LM.

Python 2,892 179 Updated Sep 19, 2024

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 766 47 Updated Sep 9, 2024

[SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous Decoding"

Python 20 2 Updated Apr 24, 2024

언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.

Python 340 26 Updated Jul 31, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 13,794 1,309 Updated Oct 3, 2024

A list of multi-vector retrieval resources

8 Updated May 29, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,215 1,013 Updated Oct 4, 2024

Longformer: The Long-Document Transformer

Python 2,035 271 Updated Feb 8, 2023

[WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".

Python 49 5 Updated May 7, 2024

An open science effort to benchmark legal reasoning in foundation models

Python 331 43 Updated Aug 25, 2024
3 Updated Aug 16, 2024

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Python 180 35 Updated Jun 15, 2023

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,652 370 Updated Oct 6, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,618 471 Updated Sep 23, 2024

A curated list of awesome LLM agents.

461 38 Updated Jul 1, 2024

LLM finetuned for medical question answering

Python 477 56 Updated Sep 7, 2023
Python 3,456 401 Updated May 17, 2024

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

911 47 Updated Sep 4, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,968 516 Updated Sep 6, 2024

Train transformer language models with reinforcement learning.

Python 9,616 1,207 Updated Oct 6, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,604 2,157 Updated Aug 12, 2024

LOMO: LOw-Memory Optimization

Python 975 68 Updated Jul 2, 2024

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,071 59 Updated Jan 4, 2024
Next