Skip to content
View chenchongthu's full-sized avatar

Block or report chenchongthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,446 460 Updated Aug 6, 2024

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 148 9 Updated Jul 3, 2023

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,602 246 Updated Dec 12, 2023

预训练中英文混合bert模型

Python 1 Updated Feb 6, 2023

:paper: 作文数据集 - 第 1 部分

11 Updated Apr 9, 2020

colbert for dense retrieval, including multi view version, dureader-retrieval as an example

Python 6 Updated Jun 16, 2022

An Open-Source Package for Information Retrieval

Python 149 20 Updated Oct 8, 2024

A Semantic Search Engine Built on Arxiv dataset from Kaggle.

Jupyter Notebook 7 2 Updated May 7, 2021

🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…

Python 17,232 1,887 Updated Oct 21, 2024

Codebase for RetroMAE and beyond.

Python 232 18 Updated Jun 7, 2024

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 668 59 Updated Apr 7, 2023

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,011 376 Updated Aug 13, 2024

Awesome Machine Unlearning (A Survey of Machine Unlearning)

Jupyter Notebook 718 51 Updated Sep 18, 2024

Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"

Python 103 26 Updated Aug 4, 2023

bert-pli应用于LeCaRD

Python 15 4 Updated Nov 14, 2021
Jupyter Notebook 4 1 Updated Jun 7, 2022

Source code and checkpoints for legal pre-trained language models.

Python 173 25 Updated May 9, 2021

A python package that takes tables from a web page and processes them to get high quality tables

Python 53 2 Updated Aug 30, 2022
Python 309 43 Updated Oct 9, 2023

A Chinese legal case retrieval dataset.

Python 118 15 Updated Jan 2, 2024

KDD'2022: Towards Representation Alignment and Uniformity in Collaborative Filtering

Python 65 5 Updated Oct 27, 2022

Must-read papers on prompt-based tuning for pre-trained language models.

4,068 379 Updated Jul 17, 2023

Towards Explainable Artificial Intelligence

5 Updated Jun 20, 2020

Human-centered Explainable AI

5 1 Updated Mar 19, 2022

LaTeX Thesis Template for Tsinghua University

TeX 4,558 1,075 Updated Sep 27, 2024

Source code of "NeurIPS21 - Universal Graph Convolutional Networks"

Python 19 5 Updated Nov 18, 2021
Python 13 4 Updated Oct 27, 2021

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

Python 19 Updated Nov 4, 2021
Python 28 2 Updated Mar 30, 2023
Next