Skip to content
View hupidong's full-sized avatar
Block or Report

Block or report hupidong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,027 783 Updated Jul 17, 2024

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,322 505 Updated Jul 2, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 234 27 Updated Jul 22, 2024

Structured Text Generation

Python 7,279 374 Updated Jul 22, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 27,435 3,380 Updated Jul 22, 2024
Python 5 1 Updated Jan 8, 2024

雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)

228 7 Updated Mar 28, 2024

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,796 134 Updated Jul 17, 2024

structured outputs for llms

Python 6,790 544 Updated Jul 23, 2024

ACL 2023 research paper

Python 8 Updated Sep 4, 2023

Autoregressive Entity Retrieval

Python 750 95 Updated Jul 6, 2023

A full spaCy pipeline and models for scientific/biomedical documents.

Python 1,653 223 Updated Mar 30, 2024

Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".

Python 25 3 Updated Apr 19, 2024

🦙 Integrating LLMs into structured NLP pipelines

Python 1,027 81 Updated Jul 11, 2024

code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》

Python 28 2 Updated Jan 9, 2024

Tevatron - A flexible toolkit for neural retrieval research and development.

Python 443 88 Updated Jun 26, 2024

This is the repo for the survey of LLM4IR.

373 32 Updated Jul 9, 2024
Python 2,593 297 Updated Jul 17, 2024

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 474 41 Updated Mar 10, 2024

Awesome-LLM: a curated list of Large Language Model

16,278 1,304 Updated Jul 22, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,453 321 Updated Jul 16, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,317 477 Updated Jul 16, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,151 442 Updated Jul 14, 2024

The official Meta Llama 3 GitHub site

Python 23,461 2,534 Updated Jul 17, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,753 1,028 Updated Jun 27, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,428 366 Updated Jul 18, 2024

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Python 4,053 547 Updated Jul 8, 2024

Grok open release

Python 49,190 8,311 Updated May 29, 2024
Python 4,062 508 Updated Mar 19, 2024
Next