Skip to content
View zoidburg's full-sized avatar

Block or report zoidburg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 12,779 904 Updated Sep 4, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 6,104 524 Updated Sep 4, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 10,731 793 Updated Sep 4, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 25,505 2,753 Updated Aug 30, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 4,877 405 Updated Aug 29, 2024

Stable Diffusion web UI

Python 139,040 26,394 Updated Sep 4, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 16,637 1,569 Updated Sep 4, 2024

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 10,745 1,939 Updated Sep 2, 2024

Omnivore is a complete, open source read-it-later solution for people who like reading.

TypeScript 12,236 615 Updated Sep 2, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,660 365 Updated Sep 3, 2024

Open source platform for the machine learning lifecycle

Python 18,269 4,131 Updated Sep 4, 2024

Parallel computing with task scheduling

Python 12,369 1,691 Updated Sep 4, 2024

Python-based Low-code ETL for data manipulation and transformation. Generates Python code you can deploy anywhere.

TypeScript 731 31 Updated Sep 1, 2024

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Python 26,687 3,543 Updated Sep 4, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,483 348 Updated Sep 4, 2024

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 10,627 816 Updated Aug 28, 2024

structured outputs for llms

Python 7,371 595 Updated Sep 4, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 448 53 Updated Sep 4, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,361 723 Updated Sep 4, 2024

Open source annotation tool for machine learning practitioners.

Python 9,363 1,706 Updated Sep 3, 2024

Toolkit for creating, sharing and using natural language prompts.

Python 2,639 346 Updated Oct 23, 2023

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,465 151 Updated Sep 4, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,345 4,473 Updated Sep 4, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,507 294 Updated Aug 29, 2024

🦄 Record your terminal and generate animated gif images or share a web player

JavaScript 15,224 496 Updated Aug 29, 2024

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,372 672 Updated Sep 4, 2024

OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…

Java 581 68 Updated Sep 4, 2024

Termux - a terminal emulator application for Android OS extendible by variety of packages.

Java 34,511 3,617 Updated Aug 27, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 16,260 1,654 Updated Sep 4, 2024

A formatter for Python files

Python 13,725 885 Updated Sep 2, 2024
Next