Skip to content
View wuxb45's full-sized avatar
💘
💘

Highlights

  • Pro

Block or report wuxb45

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Native-PyTorch Library for LLM Fine-tuning

Python 3,930 354 Updated Sep 13, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,571 1,637 Updated Sep 11, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,120 22,078 Updated Sep 14, 2024

RCCL Performance Benchmark Tests

Cuda 41 37 Updated Sep 10, 2024

NCCL Tests

Cuda 816 230 Updated Jul 30, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,162 903 Updated Sep 10, 2024

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

C++ 352 62 Updated Aug 18, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,347 901 Updated Sep 11, 2024

Multi-GPU CUDA stress test

C++ 1,342 295 Updated Aug 20, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 6,765 757 Updated Sep 12, 2024

Attention is all you need implementation

Jupyter Notebook 542 231 Updated Jun 8, 2024

Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.

Python 23 6 Updated May 5, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,532 185 Updated Mar 8, 2024

Mamba SSM architecture

Python 12,525 1,053 Updated Aug 15, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,843 173 Updated Sep 11, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,506 503 Updated Sep 13, 2024

The most widely used Python to C compiler

Python 9,346 1,478 Updated Sep 12, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

1,459 80 Updated Sep 8, 2024

Dafny is a verification-aware programming language

C# 2,875 256 Updated Sep 13, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 47,672 4,646 Updated Sep 10, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,310 3,133 Updated Sep 13, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,382 4,983 Updated Sep 14, 2024

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,266 626 Updated Aug 11, 2024

Library for fast text representation and classification.

HTML 25,840 4,710 Updated Mar 22, 2024

Google Research

Jupyter Notebook 33,799 7,830 Updated Sep 12, 2024

Benchmarks of approximate nearest neighbor libraries in Python

Python 4,856 729 Updated Sep 2, 2024

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 23,146 4,268 Updated Aug 18, 2024

🎓 Path to a free self-taught education in Computer Science!

168,413 21,332 Updated Sep 10, 2024

TensorFlow code and pre-trained models for BERT

Python 37,827 9,561 Updated Jul 23, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 131,978 26,286 Updated Sep 13, 2024
Next