Block or Report
Block or report hupidong
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
A full spaCy pipeline and models for scientific/biomedical documents.
Official implementation of our LREC-COLING 2024 paper "Generative Multimodal Entity Linking".
🦙 Integrating LLMs into structured NLP pipelines
code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》
Tevatron - A flexible toolkit for neural retrieval research and development.
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
Awesome-LLM: a curated list of Large Language Model
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Retrieval and Retrieval-augmented LLMs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.