Skip to content
View zoidburg's full-sized avatar
Block or Report

Block or report zoidburg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 3,503 263 Updated Jul 29, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 23,623 2,454 Updated Jul 28, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 4,436 358 Updated Jul 29, 2024

Stable Diffusion web UI

Python 136,895 26,053 Updated Jul 29, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 13,287 1,127 Updated Jul 29, 2024

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 10,664 1,933 Updated Jul 29, 2024

Omnivore is a complete, open source read-it-later solution for people who like reading.

TypeScript 11,889 602 Updated Jul 29, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 3,914 324 Updated Jul 29, 2024

Open source platform for the machine learning lifecycle

Python 18,028 4,070 Updated Jul 29, 2024

Parallel computing with task scheduling

Python 12,286 1,691 Updated Jul 29, 2024

Low-code ETL for structured and unstructured data. Generates Python code you can deploy anywhere.

TypeScript 669 23 Updated Jul 29, 2024

⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.

JavaScript 22,679 3,215 Updated Jul 29, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 3,962 289 Updated Jul 27, 2024

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 10,033 742 Updated Jul 28, 2024

structured outputs for llms

Python 6,880 555 Updated Jul 29, 2024

Scalable toolkit for data curation

Jupyter Notebook 376 39 Updated Jul 29, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,175 715 Updated Jul 29, 2024

Open source annotation tool for machine learning practitioners.

Python 9,277 1,691 Updated Jul 28, 2024

Toolkit for creating, sharing and using natural language prompts.

Python 2,610 342 Updated Oct 23, 2023

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 1,830 128 Updated Jul 29, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,938 4,419 Updated Jul 29, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,285 262 Updated Jul 23, 2024

🦄 Record your terminal and generate animated gif images or share a web player

JavaScript 15,156 494 Updated Jul 12, 2024

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,263 666 Updated Jul 13, 2024

OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…

Java 529 66 Updated Jul 23, 2024

Termux - a terminal emulator application for Android OS extendible by variety of packages.

Java 33,506 3,531 Updated Jul 25, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 12,745 1,251 Updated Jul 29, 2024

A formatter for Python files

Python 13,701 888 Updated Jul 15, 2024

Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

TypeScript 2,620 298 Updated Jul 29, 2024

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 4,964 450 Updated Jul 29, 2024
Next