-
Samsung AI Center
- Cambridge, UK
- https://fwtan.github.io/
Block or Report
Block or report fwtan
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
mlx Public
Forked from ml-explore/mlxMLX: An array framework for Apple silicon
C++ MIT License UpdatedJun 16, 2024 -
-
-
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedJun 10, 2024 -
llama2.c Public
Forked from karpathy/llama2.cInference Llama 2 in one file of pure C
C MIT License UpdatedJun 10, 2024 -
tokenizers-cpp Public
Forked from mlc-ai/tokenizers-cppUniversal cross-platform tokenizers binding to HF and sentencepiece
C++ Apache License 2.0 UpdatedJun 10, 2024 -
-
ai-hub-models Public
Forked from quic/ai-hub-modelsThe Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 29, 2024 -
-
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedMay 27, 2024 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedMay 26, 2024 -
aimet Public
Forked from quic/aimetAIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Python Other UpdatedMay 26, 2024 -
qserve Public
Forked from mit-han-lab/qserveQServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Python Apache License 2.0 UpdatedMay 14, 2024 -
executorch Public
Forked from pytorch/executorchOn-device AI across mobile, embedded and edge for PyTorch
C++ Other UpdatedApr 19, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedApr 16, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedApr 15, 2024 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedApr 9, 2024 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedApr 6, 2024 -
alignment-handbook Public
Forked from huggingface/alignment-handbookRobust recipes for to align language models with human and AI preferences
Python Apache License 2.0 UpdatedApr 2, 2024 -
OmniQuant Public
Forked from OpenGVLab/OmniQuantOmniQuant is a simple and powerful quantization technique for LLMs.
Python MIT License UpdatedMar 26, 2024 -
generative-models Public
Forked from Stability-AI/generative-modelsGenerative Models by Stability AI
Python MIT License UpdatedMar 25, 2024 -
llm-awq Public
Forked from mit-han-lab/llm-awqAWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedMar 25, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMar 11, 2024 -
-
-
LLM-Shearing Public
Forked from princeton-nlp/LLM-Shearing[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Python MIT License UpdatedJan 31, 2024 -
-
llama.cpp Public
Forked from ggerganov/llama.cppPort of Facebook's LLaMA model in C/C++
C MIT License UpdatedJan 10, 2024 -
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedDec 28, 2023 -
stablediffusion Public
Forked from Stability-AI/stablediffusionHigh-Resolution Image Synthesis with Latent Diffusion Models
Python MIT License UpdatedDec 21, 2023