Skip to content
View e06084's full-sized avatar

Block or report e06084

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,050 748 Updated Nov 7, 2024

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,108 4,119 Updated Nov 9, 2024

The hardware design for AgiBot X1.

644 204 Updated Nov 5, 2024

The inference module for AgiBot X1.

C++ 1,169 368 Updated Oct 28, 2024

The reinforcement learning training code for AgiBot X1.

Python 1,090 343 Updated Oct 23, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,839 4,504 Updated Nov 9, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 15,736 1,535 Updated Oct 15, 2024
TypeScript 2 Updated Oct 7, 2024

The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

Python 1,259 185 Updated Nov 9, 2024

☁️ Build multimodal AI applications with cloud-native stack

Python 21,122 2,217 Updated Nov 8, 2024

A language model programming library.

Python 5,212 304 Updated Nov 5, 2024

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,142 765 Updated Nov 4, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,613 127 Updated Sep 19, 2023

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 2,164 888 Updated Nov 8, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,381 580 Updated Nov 6, 2024

Alpaca dataset from Stanford, cleaned and curated

Python 1,515 152 Updated Apr 14, 2023

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,029 144 Updated Oct 31, 2024

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Python 436 104 Updated Jul 4, 2022

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Go 25,005 2,655 Updated Nov 9, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,099 515 Updated Nov 7, 2024

Data-Driven Evaluation for LLM-Powered Applications

Python 446 29 Updated Sep 2, 2024

The Open-Source Data Annotation Platform

TypeScript 557 44 Updated Nov 6, 2024

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Jupyter Notebook 132 4 Updated Sep 6, 2024
Python 349 26 Updated Jul 26, 2024

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform…

Python 2,201 191 Updated Aug 18, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,400 362 Updated Oct 24, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,201 80 Updated Aug 13, 2024

GRUtopia: Dream General Robots in a City at Scale

Python 504 24 Updated Sep 5, 2024
Next