Skip to content
View fwtan's full-sized avatar
Block or Report

Block or report fwtan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • mlx Public

    Forked from ml-explore/mlx

    MLX: An array framework for Apple silicon

    C++ MIT License Updated Jun 16, 2024
  • lmquant Public

    Forked from mit-han-lab/lmquant
    Python Apache License 2.0 Updated Jun 12, 2024
  • relax Public

    Forked from mlc-ai/relax
    Python Apache License 2.0 Updated Jun 10, 2024
  • mlc-llm Public

    Forked from mlc-ai/mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

    Python Apache License 2.0 Updated Jun 10, 2024
  • llama2.c Public

    Forked from karpathy/llama2.c

    Inference Llama 2 in one file of pure C

    C MIT License Updated Jun 10, 2024
  • Universal cross-platform tokenizers binding to HF and sentencepiece

    C++ Apache License 2.0 Updated Jun 10, 2024
  • gcc Public

    Forked from gcc-mirror/gcc
    GNU General Public License v2.0 Updated Jun 9, 2024
  • The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

    Python BSD 3-Clause "New" or "Revised" License Updated May 29, 2024
  • torch_int Public

    Python MIT License Updated May 27, 2024
  • A framework for few-shot evaluation of autoregressive language models.

    Python MIT License Updated May 27, 2024
  • cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ Other Updated May 26, 2024
  • aimet Public

    Forked from quic/aimet

    AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

    Python Other Updated May 26, 2024
  • qserve Public

    Forked from mit-han-lab/qserve

    QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

    Python Apache License 2.0 Updated May 14, 2024
  • executorch Public

    Forked from pytorch/executorch

    On-device AI across mobile, embedded and edge for PyTorch

    C++ Other Updated Apr 19, 2024
  • nanotron Public

    Forked from huggingface/nanotron

    Minimalistic large language model 3D-parallelism training

    Python Apache License 2.0 Updated Apr 16, 2024
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Apr 15, 2024
  • AutoGPTQ Public

    Forked from AutoGPTQ/AutoGPTQ

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python MIT License Updated Apr 9, 2024
  • diffusers Public

    Forked from huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

    Python Apache License 2.0 Updated Apr 6, 2024
  • Robust recipes for to align language models with human and AI preferences

    Python Apache License 2.0 Updated Apr 2, 2024
  • OmniQuant Public

    Forked from OpenGVLab/OmniQuant

    OmniQuant is a simple and powerful quantization technique for LLMs.

    Python MIT License Updated Mar 26, 2024
  • Generative Models by Stability AI

    Python MIT License Updated Mar 25, 2024
  • llm-awq Public

    Forked from mit-han-lab/llm-awq

    AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python MIT License Updated Mar 25, 2024
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated Mar 11, 2024
  • Python Apache License 2.0 Updated Feb 27, 2024
  • HTML Updated Feb 6, 2024
  • [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

    Python MIT License Updated Jan 31, 2024
  • stk Public

    Forked from stanford-futuredata/stk
    Python Apache License 2.0 Updated Jan 11, 2024
  • llama.cpp Public

    Forked from ggerganov/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C MIT License Updated Jan 10, 2024
  • TinyLlama Public

    Forked from jzhang38/TinyLlama

    The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

    Python Apache License 2.0 Updated Dec 28, 2023
  • High-Resolution Image Synthesis with Latent Diffusion Models

    Python MIT License Updated Dec 21, 2023