Skip to content
View Jay-ju's full-sized avatar

Block or report Jay-ju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pythonic file-system interface for TOS(Tinder Object Storage)https://tosfs.readthedocs.io/en/latest/

Python 9 Updated Nov 26, 2024

Distributed data engine for Python/SQL designed for the cloud, powered by Rust

Rust 2,357 169 Updated Nov 27, 2024

All-in-one text de-duplication

Python 623 71 Updated May 21, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,178 5,803 Updated Nov 27, 2024
Java 3 3 Updated Nov 4, 2024

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 1 Updated Oct 8, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 1 Updated Nov 26, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 135,527 27,129 Updated Nov 27, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 99,558 7,932 Updated Nov 27, 2024

Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.

Java 797 187 Updated Nov 14, 2024
C++ 3 Updated Jun 20, 2023

StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.

Java 1 Updated Sep 26, 2024

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,191 1,826 Updated Nov 27, 2024

Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution

Java 139 68 Updated Jan 3, 2023

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,042 28,346 Updated Nov 27, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 69,370 14,534 Updated May 10, 2024

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

Java 678 89 Updated Dec 21, 2023

自然语言处理、知识图谱、对话系统三大技术研究与应用。

1,641 362 Updated May 21, 2023

Open source platform for the machine learning lifecycle

Python 18,878 4,247 Updated Nov 27, 2024