chenchongthu

Follow

Chong Chen chenchongthu

Follow

Tsinghua University

201 followers · 5 following

Tsinghua University
https://chenchongthu.github.io

Achievements

Achievements

Stars

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,446 460 Updated Aug 6, 2024

THUIR / T2Ranking

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 148 9 Updated Jul 3, 2023

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,602 246 Updated Dec 12, 2023

miiiiiko / ce_pretrain

预训练中英文混合bert模型

Python 1 Updated Feb 6, 2023

Arslan-Z / zuowen-dataset-pt1

:paper: 作文数据集 - 第 1 部分

11 Updated Apr 9, 2020

wuyaoxuehun / colbert

colbert for dense retrieval, including multi view version, dureader-retrieval as an example

Python 6 Updated Jun 16, 2022

OpenMatch / OpenMatch

An Open-Source Package for Information Retrieval

Python 149 20 Updated Oct 8, 2024

yashprakash13 / haystack-search-engine

A Semantic Search Engine Built on Arxiv dataset from Kaggle.

Jupyter Notebook 7 2 Updated May 7, 2021

deepset-ai / haystack

🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…

Python 17,232 1,887 Updated Oct 21, 2024

staoxiao / RetroMAE

Codebase for RetroMAE and beyond.

Python 232 18 Updated Jun 7, 2024

facebookresearch / contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 668 59 Updated Apr 7, 2023

IDEA-CCNL / Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Python 4,011 376 Updated Aug 13, 2024

tamlhp / awesome-machine-unlearning

Awesome Machine Unlearning (A Survey of Machine Unlearning)

Jupyter Notebook 718 51 Updated Sep 18, 2024

thunlp / LEVEN

Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"

Python 103 26 Updated Aug 4, 2023

Wenorm / BERT-PLI

bert-pli应用于LeCaRD

Python 15 4 Updated Nov 14, 2021

Guangxuan-Xiao / Case-Search

Jupyter Notebook 4 1 Updated Jun 7, 2022

thunlp / LegalPLMs

Source code and checkpoints for legal pre-trained language models.

Python 173 25 Updated May 9, 2021

AtomEcho / WebTable

A python package that takes tables from a web page and processes them to get high quality tables

Python 53 2 Updated Aug 30, 2022

jeykigung / P5

Python 309 43 Updated Oct 9, 2023

myx666 / LeCaRD

A Chinese legal case retrieval dataset.

Python 118 15 Updated Jan 2, 2024

THUwangcy / DirectAU

KDD'2022: Towards Representation Alignment and Uniformity in Collaborative Filtering

Python 65 5 Updated Oct 27, 2022

thunlp / PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

4,068 379 Updated Jul 17, 2023

evison / XAI

Towards Explainable Artificial Intelligence

5 Updated Jun 20, 2020

evison / Human-XAI

Human-centered Explainable AI

5 1 Updated Mar 19, 2022

chenchongthu / Recommendation-Unlearning

Python 38 10 Updated Nov 24, 2023

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 4,558 1,075 Updated Sep 27, 2024

jindi-tju / U-GCN

Source code of "NeurIPS21 - Universal Graph Convolutional Networks"

Python 19 5 Updated Nov 18, 2021

gr8joo / MVTCAE

Python 13 4 Updated Oct 27, 2021

samihaija / isvd

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

Python 19 Updated Nov 4, 2021

Gabe-YHLee / NRAE-public

Python 28 2 Updated Mar 30, 2023