qy1026

🎯

Focusing

qy1026

🎯

Focusing

1 follower · 5 following

Block or Report

Block or report qy1026

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,533 177 Updated Aug 7, 2024

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,067 330 Updated Aug 7, 2024

meta-llama / llama-agentic-system

Agentic components of the Llama Stack APIs

Python 2,840 268 Updated Aug 6, 2024

mem0ai / mem0

The memory layer for Personalized AI

Python 19,172 1,804 Updated Aug 7, 2024

MaxPyx / ollama_embeddings_proxy

Ollama-friendly OpenAI Embeddings Proxy. This script bridges the gap between OpenAI's embedding API and Ollama, making it compatible with the current version of Graphrag.

Python 9 Updated Jul 8, 2024

ollama / ollama-python

Ollama Python library

Python 3,448 283 Updated Aug 2, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 14,194 1,246 Updated Aug 7, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 12,812 1,149 Updated Aug 6, 2024

yuchenlin / LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 838 71 Updated Apr 29, 2024

uclaml / SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Python 420 55 Updated Aug 4, 2024

lm-sys / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 358 39 Updated Jul 31, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 40,690 5,593 Updated Aug 7, 2024

dnhkng / GlaDOS

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 2,831 267 Updated Jul 26, 2024

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,363 209 Updated Aug 2, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,315 371 Updated Aug 1, 2024

gomate-community / GoMate

GoMate：RAG Framework within Reliable input,Trusted output

Python 349 29 Updated Jul 26, 2024

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 49,640 11,181 Updated Aug 7, 2024

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 18,024 2,464 Updated Aug 7, 2024

oddqueue / self-judge

The official implementation of "Aligning Large Language Models by On-Policy Self-Judgment"

4 Updated Mar 5, 2024

hiyouga / LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,588 3,498 Updated Aug 7, 2024

princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 586 32 Updated Aug 6, 2024

OpenBuddy / OpenBuddy

Open Multilingual Chatbot for Everyone

1,218 69 Updated May 4, 2024

QwenLM / Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,737 389 Updated Aug 5, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,926 1,043 Updated Jul 30, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,192 1,581 Updated Aug 6, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,285 225 Updated Aug 6, 2024

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,885 3,218 Updated May 7, 2024

weaviate / Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Python 5,164 544 Updated Aug 7, 2024

danielmiessler / fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Python 20,110 2,089 Updated Aug 7, 2024

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision…

TypeScript 36,451 8,636 Updated Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly