- Copenhagen, Denmark
Highlights
- Pro
Block or Report
Block or report casper-hansen
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
AutoAWQ Public
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMay 6, 2024 -
-
worker-vllm Public
Forked from runpod-workers/worker-vllmThe RunPod worker template for serving our large language model endpoints. Powered by vLLM.
-
DGQ Public
Forked from ilur98/DGQOfficial Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
-
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Python GNU Affero General Public License v3.0 UpdatedDec 23, 2023 -
-
-
self-rag Public
Forked from emrgnt-cmplxty/self-ragThis includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Python MIT License UpdatedOct 26, 2023 -
axolotl Public
Forked from OpenAccess-AI-Collective/axolotlGo ahead and axolotl questions
Python Apache License 2.0 UpdatedOct 25, 2023 -
smoothquant Public
Forked from AniZpZ/smoothquant[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
-
torch-int Public
Forked from AniZpZ/torch-intThis repository contains integer operators on GPUs for PyTorch.
C++ MIT License UpdatedSep 21, 2023 -
-
llm-awq Public
Forked from mit-han-lab/llm-awqAWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedSep 11, 2023 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedJul 26, 2023 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedJun 9, 2023 -
-
Flask-Stripe-MySQL-Bootstrapped Public archive
Flask template with microservices architecture. Fully integrated with Stripe 🚀
-
Web-Scraping-Reddit Public archive
Web scraping Reddit without using Reddit API, and making a dataset, and using the dataset for a machine learning project.
-
model-stacking Public archive
Model stacking example on toy dataset using XGBoost, LightGBM and more, combined with mlxtend model stacking.
-
-
-
-
-
Neural-Network-From-Scratch Public archive
NumPy - PyTorch - TensorFlow (+Keras)
-
CNN-From-Scratch Public archive
Building A CNN From Scratch in NumPy
-
-
-