Skip to content
View aburkov's full-sized avatar

Block or report aburkov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 15,176 1,103 Updated Sep 30, 2024

real time face swap and one-click video deepfake with only a single image

Python 37,334 5,276 Updated Sep 28, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,392 153 Updated Feb 24, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,302 929 Updated Sep 30, 2024

An implementation of Shazam's song recognition algorithm.

Go 1,829 113 Updated Sep 28, 2024

A vector search SQLite extension that runs anywhere!

C 3,862 131 Updated Sep 26, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,164 88 Updated Sep 24, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,126 139 Updated Sep 30, 2024

Apps Script samples for Google Workspace products.

JavaScript 4,510 1,834 Updated Sep 24, 2024

Use Large Language Models (LLM) in Google Sheets

JavaScript 19 2 Updated Jul 20, 2024

🔥Highlighting the top ML papers every week.

9,999 583 Updated Sep 23, 2024

Data validation using Python type hints

Python 20,637 1,853 Updated Sep 30, 2024

The platform for building AI from enterprise data

Python 26,336 4,788 Updated Sep 30, 2024

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

HTML 442 39 Updated Jun 5, 2024

âš¡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,158 86 Updated Sep 30, 2024

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 2,127 200 Updated Aug 4, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,172 219 Updated Sep 19, 2024

High-quality datasets, tools, and concepts for LLM fine-tuning.

1,739 165 Updated Aug 18, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,522 271 Updated Sep 30, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 4,050 370 Updated Sep 30, 2024

Create web-based user interfaces with Python. The nice way.

Python 8,928 542 Updated Sep 30, 2024

An awesome repository of local AI tools

1,169 96 Updated Jun 21, 2024

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 3,832 360 Updated Sep 30, 2024

Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.

Python 681 65 Updated Aug 22, 2024

Pydantic extension for annotating autocorrecting fields.

Python 205 3 Updated Jun 20, 2024

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,396 187 Updated Sep 23, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 91,481 7,197 Updated Sep 29, 2024

The official Python library for the OpenAI API

Python 22,220 3,078 Updated Sep 27, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,929 502 Updated Sep 30, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,230 399 Updated Sep 13, 2024
Next