Skip to content
View pepesi's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report pepesi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Python 20 Updated Jun 21, 2024

Cloud Native API Gateway | 云原生API网关

Go 2,515 411 Updated Jul 3, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 16,054 3,722 Updated Jul 3, 2024

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,245 315 Updated Jul 3, 2024

Kubernetes-native Job Queueing

Go 1,249 222 Updated Jul 3, 2024

3D Visualization of an GPT-style LLM

TypeScript 3,200 359 Updated Apr 11, 2024

Descheduler for Kubernetes

Go 4,230 645 Updated Jul 2, 2024

HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container

C 37 14 Updated May 7, 2024

OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical ca…

Go 437 85 Updated May 21, 2024

基于d4nst/RotNet的使用,实现模拟完成旋转拖动验证码

Python 158 61 Updated Nov 21, 2022
Go 6 2 Updated Jan 22, 2024

A Huggingface proxy deployed on Cloudflare Workers, tailored for Chinese users. 🌐🚀

TypeScript 18 2 Updated Jul 3, 2024

Meshery, the cloud native manager

JavaScript 5,100 1,590 Updated Jul 3, 2024

CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资…

Vue 1,462 183 Updated Jul 3, 2024

Unify Efficient Fine-Tuning of 100+ LLMs

Python 25,515 3,157 Updated Jul 3, 2024

OpenID Connect (OIDC) identity and OAuth 2.0 provider with pluggable connectors

Go 9,193 1,666 Updated Jul 3, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 31,749 3,803 Updated Jul 2, 2024

RDMA core userspace libraries and daemons

C 1,404 659 Updated Jul 2, 2024

Mellanox libibverbs

C++ 48 13 Updated Aug 28, 2019

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4

C++ 2,807 325 Updated Jun 24, 2024

Inference code for CodeLlama models

Python 15,411 1,785 Updated May 21, 2024

Hamibot遥控器

C# 31 5 Updated Oct 5, 2023

NVIDIA NCCL Tests for Distributed Training

Shell 39 13 Updated Jun 26, 2024

A conda-forge distribution.

Shell 5,735 303 Updated Jul 1, 2024

A Cloud Native Batch System (Project under CNCF)

Go 3,909 902 Updated Jul 3, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,685 794 Updated Jun 10, 2024

Cloud native networking and network security

Go 5,691 1,268 Updated Jul 3, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 163,589 43,414 Updated Jul 3, 2024

K8s 集群证书过期处理,更新 kubeadm 生成的证书有效期为 10 年。支持全部版本。

Shell 484 269 Updated May 15, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,661 3,954 Updated Jul 2, 2024
Next