Skip to content
View starsy's full-sized avatar
  • Cisco Systems
  • Shanghai, China

Block or report starsy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Convert PDF to markdown quickly with high accuracy

Python 16,280 916 Updated Sep 7, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,732 370 Updated Sep 10, 2024

LLM inference in C/C++

C++ 64,728 9,275 Updated Sep 10, 2024

🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 643 81 Updated Sep 8, 2024

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

991 66 Updated Jul 3, 2024

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Python 1,236 85 Updated Sep 3, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 30,525 3,754 Updated Sep 9, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 26,223 3,846 Updated Sep 10, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,011 1,041 Updated May 23, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,789 633 Updated Aug 26, 2024

A Python library to extract tabular data from PDFs

Python 2,908 462 Updated Aug 19, 2024

Go ahead and axolotl questions

Python 7,502 809 Updated Sep 7, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 44,669 6,272 Updated Sep 10, 2024

Use PEFT or Full-parameter to finetune 300+ LLMs or 80+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 3,335 279 Updated Sep 10, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,319 174 Updated Jul 16, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,311 459 Updated Aug 19, 2024

Security and compliance proxy for LLM APIs

JavaScript 43 9 Updated Jul 21, 2023

Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 12,092 1,397 Updated Sep 10, 2024

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Vue 31,249 11,215 Updated Aug 16, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 17,799 4,030 Updated Aug 21, 2024

Container plugin for Slurm Workload Manager

C 275 31 Updated Jul 31, 2024

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

C++ 23,171 1,753 Updated Sep 10, 2024

🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。

Python 9,847 1,307 Updated Sep 10, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,717 821 Updated Aug 31, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 110,335 15,016 Updated Sep 3, 2024

Minimal keyword extraction with BERT

Python 3,421 342 Updated Jul 16, 2024

structured outputs for llms

Python 7,454 594 Updated Sep 8, 2024

Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.

TypeScript 45,043 4,899 Updated Sep 9, 2024

Fork of turndown-plugin-gfm for Jopin

JavaScript 12 5 Updated Jun 27, 2021

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,300 3,717 Updated Sep 10, 2024
Next