-
gradient.ai @Preemo-Inc
- San Francisco
-
10:42
(UTC -07:00) - michaelfeil.eu
- in/michael-feil
- @feilsystem
Highlights
Block or Report
Block or report michaelfeil
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJul 3, 2024 -
infinity Public
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
-
hf-hub-ctranslate2 Public
Connecting Transformers on HuggingFace Hub with CTranslate2
-
easyinference Public
A stable and easy-to-use inference library with a focus on a sync-to-async API
MIT License UpdatedJun 23, 2024 -
skyjo_rl Public
Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB
-
nlm-ingestor Public
Forked from nlmatics/nlm-ingestorThis repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
-
sentence-transformers Public
Forked from UKPLab/sentence-transformersMultilingual Sentence & Image Embeddings with BERT
-
Verba Public
Forked from weaviate/VerbaRetrieval Augmented Generation (RAG) chatbot powered by Weaviate
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 10, 2024 -
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python UpdatedMay 20, 2024 -
fastembed Public
Forked from qdrant/fastembedFast, Accurate, Lightweight Python library to make State of the Art Embedding
Python Apache License 2.0 UpdatedApr 26, 2024 -
shellingham Public
Forked from sarugaku/shellinghamTool to Detect Surrounding Shell
Python ISC License UpdatedApr 9, 2024 -
ragas Public
Forked from explodinggradients/ragasEvaluation framework for your Retrieval Augmented Generation (RAG) pipelines
Python Apache License 2.0 UpdatedApr 2, 2024 -
referencing Public
Forked from python-jsonschema/referencingCross-specification JSON referencing (JSON Schema, OpenAPI, and the one you just made up!)
Python MIT License UpdatedApr 1, 2024 -
-
hqq Public
Forked from mobiusml/hqqOfficial implementation of Half-Quadratic Quantization (HQQ)
Python Apache License 2.0 UpdatedMar 29, 2024 -
gpt-fast Public
Forked from pytorch-labs/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 13, 2024 -
UniTS Public
Forked from mims-harvard/UniTSA unified time series model.
MIT License UpdatedMar 4, 2024 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedFeb 22, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 18, 2024 -
-
kernl Public
Forked from ELS-RD/kernlKernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Jupyter Notebook Apache License 2.0 UpdatedFeb 16, 2024 -
AutoAWQ Public
Forked from casper-hansen/AutoAWQAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Python MIT License UpdatedFeb 5, 2024 -
foundation-model-stack Public
Forked from foundation-model-stack/foundation-model-stackPython Apache License 2.0 UpdatedFeb 4, 2024 -
m2-bert Public
Forked from HazyResearch/m2M2-with-wheels. Python wheels for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
Assembly Apache License 2.0 UpdatedJan 15, 2024 -
flash-fft-conv Public
Forked from HazyResearch/flash-fft-convFlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
C++ Apache License 2.0 UpdatedJan 12, 2024 -
optimum-neuron Public
Forked from huggingface/optimum-neuronEasy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Jupyter Notebook Apache License 2.0 UpdatedJan 11, 2024 -
deploy-weblink Public
Forked from soylent/deploy-weblinkDeploy your own weblink server to Heroku
Dockerfile UpdatedJan 9, 2024 -
proxxy Public
Forked from soylent/proxxyhttps and socks5 proxy server
Ruby MIT License UpdatedJan 9, 2024