Skip to content
View chengscott's full-sized avatar

Highlights

  • Pro

Organizations

@tw-csie-sprout @pcshjq @pcshic @nthuion

Block or report chengscott

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM training in simple, raw C/CUDA

Cuda 23,840 2,666 Updated Oct 2, 2024

Transformers with Arbitrarily Large Context

Python 622 48 Updated Aug 12, 2024

Enjoy the magic of Diffusion models!

Python 6,405 575 Updated Oct 8, 2024

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,158 71 Updated Jul 16, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 464 21 Updated Oct 5, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,233 115 Updated Oct 8, 2024

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 161 17 Updated May 23, 2024

😎 Awesome Cloudflare Workers

502 16 Updated Dec 22, 2021

Implementation for MatMul-free LM.

Python 2,895 179 Updated Sep 19, 2024

Smart pointers for the (GNU) C programming language

CMake 1,576 143 Updated Nov 2, 2022

Extending the HDF5 library to support intelligent I/O buffering for deep memory and storage hierarchy systems

C++ 34 17 Updated Sep 26, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,471 144 Updated Sep 25, 2024

TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.

Python 2,821 49 Updated Oct 2, 2024

ldd as a tree

C 2,651 60 Updated Jun 21, 2024

Powerful menu bar manager for macOS

Swift 12,850 237 Updated Oct 8, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 52,632 5,560 Updated Oct 8, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,596 1,145 Updated Oct 6, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,452 602 Updated Sep 27, 2024

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 183 15 Updated May 28, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,734 3,866 Updated Oct 2, 2024

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, …

TypeScript 5,347 1,208 Updated Oct 7, 2024

Curated list of useful LLM / Analytics / Datascience resources

1,771 152 Updated Oct 3, 2024

LLM inference in C/C++

C++ 65,985 9,476 Updated Oct 8, 2024

Tools for merging pretrained large language models.

Python 4,610 413 Updated Oct 8, 2024

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

TypeScript 47,009 8,240 Updated Oct 8, 2024

Cybernetically enhanced web apps

JavaScript 78,685 4,124 Updated Oct 8, 2024

A simple code for plotting figure, colorbar, and cropping with python

Python 350 44 Updated Apr 13, 2022

NCCL tests for ALCF machines

Roff 7 Updated Jul 8, 2024
Next