Skip to content
View BinhangYuan's full-sized avatar
😊
😊

Organizations

@DS3Lab

Block or report BinhangYuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collection of training data management explorations for large language models

256 24 Updated Aug 2, 2024

Course Material for the UG Course COMP4901Y

Python 44 2 Updated May 12, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,696 288 Updated Aug 20, 2024

Implementation of πŸ’ Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 447 25 Updated Aug 15, 2024

Scalable toolkit for efficient model alignment

Python 499 52 Updated Sep 1, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,958 3,696 Updated Sep 2, 2024

Data processing system for polyglot

Python 88 24 Updated Sep 5, 2023

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 180 8 Updated Aug 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,761 3,759 Updated Sep 2, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,360 1,311 Updated Aug 28, 2024

The official Meta Llama 3 GitHub site

Python 25,869 2,886 Updated Aug 12, 2024

[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".

Python 102 8 Updated May 14, 2024

Data compression in TensorFlow

Python 850 248 Updated Aug 7, 2024

DLRover: An Automatic Distributed Deep Learning System

Python 1,148 145 Updated Aug 30, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,202 999 Updated Sep 2, 2024

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 262 29 Updated Aug 19, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,940 526 Updated May 31, 2024

πŸ€– Chat with your SQL database πŸ“Š. Accurate Text-to-SQL Generation via LLMs using RAG πŸ”„.

Python 10,590 811 Updated Aug 28, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,483 2,059 Updated Aug 9, 2024

Summarize existing representative LLMs text datasets.

811 77 Updated Aug 29, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,691 335 Updated Jul 31, 2024

FlagPerf is an open-source software platform for benchmarking AI chips.

Python 294 99 Updated Sep 1, 2024

Microsoft Collective Communication Library

C++ 301 29 Updated Sep 20, 2023

πŸ’‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Python 8,621 570 Updated Aug 26, 2024

Serving LLMs on heterogeneous decentralized clusters.

Python 13 1 Updated May 6, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,535 423 Updated Jun 22, 2024
Python 269 45 Updated Sep 2, 2024

[NeurIPS'23] Speculative Decoding with Big Little Decoder

Python 81 10 Updated Feb 6, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,084 96 Updated Sep 1, 2024

An open-source framework for training large multimodal models.

Python 3,630 278 Updated Aug 31, 2024
Next