Skip to content
View SecretSun's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report SecretSun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,362 146 Updated Aug 17, 2024

For recording and retrieving metadata associated with ML developer and data scientist workflows.

C++ 610 139 Updated Aug 15, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 24,965 3,604 Updated Aug 19, 2024

markdown docs

Shell 62 58 Updated Aug 18, 2024

Complete container management platform

Go 23,108 2,939 Updated Aug 18, 2024

LlamaIndex is a data framework for your LLM applications

Python 34,504 4,871 Updated Aug 19, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 42,098 5,796 Updated Aug 18, 2024

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 172 24 Updated Aug 5, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,109 2,222 Updated Aug 1, 2024

The reference implementation of the Linux FUSE (Filesystem in Userspace) interface

C 5,178 1,123 Updated Aug 14, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,422 1,471 Updated Aug 18, 2024

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…

Python 1,776 285 Updated Dec 2, 2023

Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API

C++ 4 1 Updated Aug 11, 2024
Python 66 38 Updated Aug 15, 2024

Export s3fs for aliyun oss.

C++ 733 153 Updated Jun 6, 2024

本项目是一个 Redis 成本优化工程沉淀的工具集,包含了 Redis 的常用操作,比如 Redis 流量复制回放、Redis 数据在线压缩解压缩、Redis 数据清理TTL设置等工具集。

Java 27 8 Updated Jul 19, 2024

Kubernetes-native Job Queueing

Go 1,295 227 Updated Aug 18, 2024

A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC

Go 1,075 265 Updated May 22, 2023

Example TensorFlow codes and Caicloud TensorFlow as a Service dev environment.

Jupyter Notebook 2,936 2,080 Updated Aug 13, 2019

TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.

Python 180 33 Updated May 26, 2022
Go 157 2 Updated Jul 10, 2024

Read and write Tensorflow TFRecord data from Apache Spark.

Scala 284 57 Updated Apr 22, 2024

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

C++ 696 284 Updated Aug 14, 2024

alibabacloud-pai-dsw-cn-demo

Python 7 1 Updated Mar 13, 2020

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,419 347 Updated Aug 18, 2024

A FastAPI Middleware of https://github.com/joerick/pyinstrument to check your service performance.

Python 224 13 Updated May 17, 2024

KubeMQ is a Kubernetes native message queue broker

Go 654 48 Updated Feb 18, 2023

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 4,726 396 Updated Aug 3, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 15,340 1,398 Updated Aug 18, 2024
Next