Skip to content
View ZanePoe's full-sized avatar
Block or Report

Block or report ZanePoe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

data

329 repositories

the AI-native open-source embedding database

Rust 14,107 1,184 Updated Aug 15, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,296 227 Updated Aug 6, 2024

A cloud-native vector database, storage for next generation AI applications

Go 28,887 2,778 Updated Aug 15, 2024

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 19,460 1,318 Updated Aug 15, 2024

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 32,668 3,939 Updated Aug 13, 2024

Fast and secure standalone server for resizing and converting remote images

Go 8,679 621 Updated Aug 14, 2024

Selenium plugin to manage multi level shadow DOM elements on web page.

Python 46 9 Updated Feb 11, 2023

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 10,375 910 Updated Aug 15, 2024

为ChatGLM设计的微调数据集生成工具,速来制作自己的猫娘。

Python 587 71 Updated Feb 25, 2024

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,167 64 Updated Jul 11, 2024

CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.

Java 2,239 263 Updated Aug 6, 2024

A revolutionary ORM framework for both java and kotlin.

Java 688 70 Updated Aug 14, 2024

Captcha solver extension for humans, available for Chrome, Edge and Firefox

JavaScript 7,686 580 Updated Jun 4, 2024

承影 - 一款安全工具箱,集成了目录扫描、JWT、Swagger 测试、编/解码、轻量级 BurpSuite、杀软辅助功能

Go 354 25 Updated Jun 11, 2023

Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.

TypeScript 135 22 Updated Jul 25, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 888 34 Updated Jun 11, 2024

A Chinese serif font derived from IPAmj Mincho. 一款衍生于「IPAmj明朝」的中文宋体字型。

615 5 Updated Aug 15, 2024

User input simulation for Buster

Go 249 36 Updated Dec 15, 2022

A caching library for Python

Python 415 44 Updated Dec 22, 2023

已迁移新仓库,此版本将不再维护

8,353 1,293 Updated Sep 7, 2023

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 12,943 1,691 Updated Aug 15, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,684 255 Updated Aug 1, 2024

Segment Anything in High Quality [NeurIPS 2023]

Python 3,603 217 Updated Jul 7, 2024

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

Python 8,309 1,308 Updated Jul 7, 2024
HTML 25 28 Updated Jul 3, 2023

Parallel programming with Python

Python 410 62 Updated Jul 17, 2024

团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器

Python 6,857 519 Updated Jul 8, 2024

🐤 Kiwi-国际化翻译全流程解决方案

TypeScript 2,514 228 Updated Apr 18, 2024

🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.

Java 14,585 1,631 Updated Aug 15, 2024

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon

C++ 8,778 487 Updated Aug 15, 2024