Skip to content
View qy1026's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report qy1026

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,533 177 Updated Aug 7, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,067 330 Updated Aug 7, 2024

Agentic components of the Llama Stack APIs

Python 2,840 268 Updated Aug 6, 2024

The memory layer for Personalized AI

Python 19,172 1,804 Updated Aug 7, 2024

Ollama-friendly OpenAI Embeddings Proxy. This script bridges the gap between OpenAI's embedding API and Ollama, making it compatible with the current version of Graphrag.

Python 9 Updated Jul 8, 2024

Ollama Python library

Python 3,448 283 Updated Aug 2, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,194 1,246 Updated Aug 7, 2024

Fast and memory-efficient exact attention

Python 12,812 1,149 Updated Aug 6, 2024

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 838 71 Updated Apr 29, 2024

The official implementation of Self-Play Preference Optimization (SPPO)

Python 420 55 Updated Aug 4, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 358 39 Updated Jul 31, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 40,690 5,593 Updated Aug 7, 2024

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 2,831 267 Updated Jul 26, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,363 209 Updated Aug 2, 2024

Robust recipes to align language models with human and AI preferences

Python 4,315 371 Updated Aug 1, 2024

GoMate:RAG Framework within Reliable input,Trusted output

Python 349 29 Updated Jul 26, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 49,640 11,181 Updated Aug 7, 2024

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 18,024 2,464 Updated Aug 7, 2024

The official implementation of "Aligning Large Language Models by On-Policy Self-Judgment"

4 Updated Mar 5, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,588 3,498 Updated Aug 7, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 586 32 Updated Aug 6, 2024

Open Multilingual Chatbot for Everyone

1,218 69 Updated May 4, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,737 389 Updated Aug 5, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,926 1,043 Updated Jul 30, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,192 1,581 Updated Aug 6, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,285 225 Updated Aug 6, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,885 3,218 Updated May 7, 2024

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Python 5,164 544 Updated Aug 7, 2024

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Python 20,110 2,089 Updated Aug 7, 2024

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision…

TypeScript 36,451 8,636 Updated Aug 7, 2024
Next