Skip to content
View Blaizzy's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report Blaizzy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Large Reasoning Models

Python 415 27 Updated Nov 7, 2024
Python 48 4 Updated Oct 28, 2024

Implementation of F5-TTS in MLX

Python 308 29 Updated Nov 1, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

5,007 279 Updated Nov 1, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 4,745 612 Updated Aug 9, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 10 2 Updated Sep 8, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 863 100 Updated Oct 7, 2024
Python 52 2 Updated May 31, 2024

Prune transformer layers

Python 63 9 Updated May 30, 2024

Awesome LLM compression research papers and tools.

1,173 74 Updated Nov 7, 2024

An Open Source Toolkit For LLM Distillation

Python 348 37 Updated Sep 17, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,186 1,113 Updated Oct 14, 2024

Distributed Inference for mlx LLm

Python 67 7 Updated Aug 1, 2024

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,003 91 Updated Sep 9, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 11,023 641 Updated Nov 6, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 707 84 Updated Oct 29, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,508 879 Updated Oct 22, 2024

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 6,698 664 Updated Oct 3, 2024

An AI agent that writes (actually useful) code for you

TypeScript 2,885 228 Updated Oct 22, 2024

LLM101n: Let's build a Storyteller

29,637 1,620 Updated Aug 1, 2024

Replit Desktop App

TypeScript 116 5 Updated Nov 1, 2024

xLSTM as Generic Vision Backbone

Python 429 30 Updated Nov 4, 2024

MLX implementation of xLSTM model by Beck et al. (2024)

Python 23 1 Updated Jun 5, 2024

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 194 25 Updated Apr 23, 2024

Gemma 2B with 10M context length using Infini-attention.

Python 946 59 Updated May 12, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,679 1,093 Updated May 23, 2024

This is our own implementation of 'Layer Selective Rank Reduction'

Python 231 27 Updated May 26, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 211 20 Updated Aug 1, 2024

Run GreenBitAI's Quantized LLMs on Apple Devices with MLX

Python 14 3 Updated Nov 5, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,957 295 Updated Nov 4, 2024
Next