Block or Report
Block or report countback
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
Retrieval and Retrieval-augmented LLMs
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Code for the paper ''SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully''
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking
[WWW2022] Geometric Graph Representation Learning via Maximizing Rate Reduction
Instruction Tuning with GPT-4
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
Fine-grained Angular Contrastive Learning with Coarse Labels - Official Repository
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.
code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
This is the official repository for the IBKD knowledge distillation method, as described in the paper .
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
LlamaIndex is a data framework for your LLM applications
The Official Implementation of CFCD. Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
Knowledge distillation methods implemented with Tensorflow (now there are 11 (+1) methods, and will be added more.)
Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。
SGPT: GPT Sentence Embeddings for Semantic Search
古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard