michaelfeil

Michael Feil michaelfeil

AI@Gradient | building infinity | M.Sc. Robotics@TU-Munich

135 followers · 10 following

gradient.ai @Preemo-Inc
San Francisco
01:09 (UTC -07:00)
michaelfeil.eu
in/michael-feil
@feilsystem

Achievements

x3 x2 x3 x3

Achievements

x3 x2 x3 x3

Highlights

Developer Program Member
Pro

infinity Public

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

bert-embeddings text-embeddings llm

Python 1,206 82 MIT License 6 issues need help Updated Aug 31, 2024
llama-recipes Public
Forked from meta-llama/llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook Updated Aug 30, 2024
hf-hub-ctranslate2 Public

Connecting Transformers on HuggingFace Hub with CTranslate2

Python 32 2 MIT License Updated Aug 27, 2024
btp-generative-ai-hub-use-cases Public
Forked from SAP-samples/btp-generative-ai-hub-use-cases

Samples on how to build industry solution leveraging generative AI capabilities on top of SAP BTP and integrated with SAP S/4HANA Cloud.

Jupyter Notebook Apache License 2.0 Updated Aug 19, 2024
flash-deberta Public

Deberta, but Flash

Python 1 Updated Aug 1, 2024
datachain Public
Forked from iterative/datachain

DataChain 🔗 Process and curate unstructured data using local ML models and LLM calls

Python Apache License 2.0 Updated Jul 24, 2024
embed Public

A stable, fast and easy-to-use inference library with a focus on a sync-to-async API

Python 43 1 MIT License Updated Jul 23, 2024
qdrant-client Public
Forked from qdrant/qdrant-client

Python client for Qdrant vector search engine

Python Apache License 2.0 Updated Jul 20, 2024
fastembed Public
Forked from qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python Apache License 2.0 Updated Jul 20, 2024
pylabrobot Public
Forked from PyLabRobot/pylabrobot

An interactive & hardware agnostic interface for lab automation

Python MIT License Updated Jul 20, 2024
BentoInfinity Public
Forked from bentoml/BentoInfinity

Python Updated Jul 9, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated Jul 3, 2024
skyjo_rl Public

Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB

reinforcement-learning rllib pettingzoo mutli-agent

Jupyter Notebook 7 MIT License Updated Jun 20, 2024
nlm-ingestor Public
Forked from nlmatics/nlm-ingestor

This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.

Python 1 Apache License 2.0 Updated Jun 17, 2024
sentence-transformers Public
Forked from UKPLab/sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Python 1 Apache License 2.0 Updated Jun 11, 2024
Verba Public
Forked from weaviate/Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Python BSD 3-Clause "New" or "Revised" License Updated Jun 10, 2024
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attention

Ring attention implementation with flash attention

Python Updated May 20, 2024
shellingham Public
Forked from sarugaku/shellingham

Tool to Detect Surrounding Shell

Python ISC License Updated Apr 9, 2024
ragas Public
Forked from explodinggradients/ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python Apache License 2.0 Updated Apr 2, 2024
referencing Public
Forked from python-jsonschema/referencing

Cross-specification JSON referencing (JSON Schema, OpenAPI, and the one you just made up!)

Python MIT License Updated Apr 1, 2024
onefiveeight Public
Forked from Preemo-Inc/onefiveeight

Python Other Updated Apr 1, 2024
hqq Public
Forked from mobiusml/hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Python Apache License 2.0 Updated Mar 29, 2024
gpt-fast Public
Forked from pytorch-labs/gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 1 BSD 3-Clause "New" or "Revised" License Updated Mar 13, 2024
UniTS Public
Forked from mims-harvard/UniTS

A unified time series model.

MIT License Updated Mar 4, 2024
langchain Public
Forked from langchain-ai/langchain

⚡ Building applications with LLMs through composability ⚡

Python MIT License Updated Feb 22, 2024
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 1 MIT License Updated Feb 18, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Feb 18, 2024
gritlm Public
Forked from ContextualAI/gritlm

Python MIT License Updated Feb 16, 2024
kernl Public
Forked from ELS-RD/kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook Apache License 2.0 Updated Feb 16, 2024
AutoAWQ Public
Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

Python MIT License Updated Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Michael Feil michaelfeil

Achievements

Achievements

Highlights

Block or report michaelfeil

infinity Public

llama-recipes Public

hf-hub-ctranslate2 Public

btp-generative-ai-hub-use-cases Public

flash-deberta Public

datachain Public

embed Public

qdrant-client Public

fastembed Public

pylabrobot Public

BentoInfinity Public

triton Public

skyjo_rl Public

nlm-ingestor Public

sentence-transformers Public

Verba Public

ring-flash-attention Public

shellingham Public

ragas Public

referencing Public

onefiveeight Public

hqq Public

gpt-fast Public

UniTS Public

langchain Public

lm-evaluation-harness Public

vllm Public

gritlm Public

kernl Public

AutoAWQ Public