Skip to content
View yao5461's full-sized avatar
  • NULL
  • Hangzhou, Zhejiang, China
Block or Report

Block or report yao5461

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"

Python 164 21 Updated Jul 5, 2024

雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)

206 6 Updated Mar 28, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,237 63 Updated Jul 3, 2024

Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.

Python 17 1 Updated Jun 13, 2024

Continual Learning of Large Language Models: A Comprehensive Survey

158 11 Updated Jul 2, 2024

FuseAI Project

Python 72 28 Updated Jun 12, 2024
Python 3 Updated May 20, 2024
Python 197 8 Updated Mar 26, 2024

👫 A curated list of Model Merging methods.

50 4 Updated Mar 9, 2024

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Python 145 9 Updated Feb 15, 2024

A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training

Python 264 19 Updated Jan 18, 2024

Code for "Merging Text Transformers from Different Initializations"

Python 15 Updated Mar 28, 2024

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,428 124 Updated May 25, 2024
69 Updated Mar 29, 2024

Codebase for Merging Language Models (ICML 2024)

Python 689 40 Updated May 5, 2024

Tools for merging pretrained large language models.

Python 4,042 350 Updated Jul 6, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 17,878 1,834 Updated Apr 30, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 12,834 1,170 Updated Jul 2, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 36,436 4,928 Updated Jul 6, 2024

Drag & drop UI to build your customized LLM flow

TypeScript 27,287 14,089 Updated Jul 6, 2024

Official inference library for Mistral models

Jupyter Notebook 9,145 801 Updated Jun 22, 2024

Official Kaggle API

Python 6,016 1,060 Updated Jul 6, 2024

[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".

Python 96 7 Updated May 14, 2024

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that auton…

Python 126 12 Updated Jun 26, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,554 291 Updated Jul 6, 2024

Fast and memory-efficient exact attention

Python 11,886 1,054 Updated Jul 6, 2024

[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"

Python 61 8 Updated Jun 6, 2024

Aligning Large Language Models on Information Extraction

Python 23 1 Updated May 10, 2024

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Python 80 6 Updated Jun 27, 2024
Next