- San Francisco, CA
Highlights
- Pro
Block or Report
Block or report seme0021
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
🎓 Path to a free self-taught education in Computer Science!
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Instruct-tune LLaMA on consumer hardware
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
This project is a Python script that scrapes data from a Gumroad site, generates a colorful and well-designed HTML page using OpenAI's GPT-4 model, and deploys the generated page to Vercel.
Open-source vector similarity search for Postgres
Adding guardrails to large language models.
Code samples and instructions for implementing group messaging in WhatsApp with the Twilio Conversations API
Universal personal search engine, powered by a full text search algorithm written in pure Ink, indexing Linus's blogs and private note archives, contacts, tweets, and over a decade of journals.
the AI-native open-source embedding database
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
The Schema-Guided Dialogue Dataset
🐦 Quickly annotate data from the comfort of your Jupyter notebook
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
An open-source NLP research library, built on PyTorch.
PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"