Skip to content
View shyram's full-sized avatar

Organizations

@Gubuzeong
Block or Report

Block or report shyram

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 364 33 Updated Jul 11, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,297 306 Updated Jul 16, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 587 91 Updated Mar 23, 2024

Recipes to train reward model for RLHF.

Python 458 35 Updated Jul 20, 2024

Benchmarking LLMs with Challenging Tasks from Real Users

Python 144 13 Updated Jul 21, 2024

Let ChatGPT teach your own chatbot in hours with a single GPU!

Python 3,147 277 Updated Mar 17, 2024

Dromedary: towards helpful, ethical and reliable LLMs.

Python 1,102 84 Updated Oct 26, 2023

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,180 112 Updated Mar 13, 2024
Jupyter Notebook 3,874 499 Updated Mar 28, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 230 27 Updated Jul 22, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,420 3,595 Updated Jul 16, 2024

Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens

Python 227 22 Updated Jun 11, 2024
Python 84 30 Updated Jul 21, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,391 366 Updated Jul 18, 2024

A set of scripts to grab public datasets from resources related to arXiv

Python 383 61 Updated May 20, 2024

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

627 23 Updated Jul 21, 2024

RAG AutoML Tool - Find optimal RAG pipeline for your own data.

Python 1,181 102 Updated Jul 19, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 306 17 Updated May 29, 2024
Jupyter Notebook 12 1 Updated Mar 11, 2024

Official repo for "Make Your LLM Fully Utilize the Context"

Python 229 17 Updated May 15, 2024

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

38 1 Updated Jul 4, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 77,683 7,092 Updated Jul 21, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,339 858 Updated May 23, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 241 9 Updated Jul 22, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,787 238 Updated Jul 20, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 470 23 Updated Jul 6, 2024

A very fast and expressive template engine.

Python 10,119 1,595 Updated Jul 10, 2024

Official implementation of ECCV2024 paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".

Python 75 3 Updated Jul 9, 2024
Next