Stars
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
PyTorch code and models for the DINOv2 self-supervised learning method.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A programming framework for agentic AI 🤖
🦜🔗 Build context-aware reasoning applications
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
sunkx109 / llama
Forked from meta-llama/llamaInference code for LLaMA models
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
The Triton TensorRT-LLM Backend
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc,…
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程