Skip to content
View xuanyuan14's full-sized avatar

Highlights

  • Pro

Block or report xuanyuan14

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RWKV in nanoGPT style

Python 176 11 Updated Jun 9, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,629 858 Updated Oct 31, 2024
Python 2 Updated Aug 15, 2023
Python 7 4 Updated Jan 26, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,614 191 Updated Mar 8, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,289 106 Updated Nov 7, 2024

[SIGIR 2023] This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Formula for CTR Prediction".

Python 15 5 Updated Jul 31, 2024

This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Formula for CTR Prediction".

Python 25 3 Updated Jul 31, 2024

Code for AAAI 2024 paper Wikiformer

Python 16 Updated Dec 21, 2023

A series of large language models developed by Baichuan Intelligent Technology

Python 4,085 295 Updated Jun 22, 2024

🕹️ A basic gameboy emulator with terminal "Cloud Gaming" support

Go 4,678 233 Updated Jan 16, 2024

LexiLaw - 中文法律大模型

Python 715 94 Updated Jul 31, 2023

Build, evaluate, understand, and fix LLM-based apps

Jupyter Notebook 484 33 Updated Jan 16, 2024

deepspeed+trainer简单高效实现多卡微调大模型

Python 116 10 Updated May 27, 2023

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 150 9 Updated Jul 3, 2023

Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2

Python 23 1 Updated May 12, 2023

The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval

Python 24 2 Updated Jun 7, 2023

The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval

Python 73 5 Updated May 9, 2023

A Large-Scale Chinese Legal Case Retrieval Dataset

47 Updated Oct 7, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,651 5,211 Updated Jun 27, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,632 2,219 Updated Jul 29, 2024

Making large AI models cheaper, faster and more accessible

Python 38,777 4,342 Updated Nov 6, 2024

Our code for WSDM Cup 2023 Task 1 and 2

Python 9 Updated Jan 16, 2023

an unbias-learning-to-rank dataset of Baidu

Python 60 8 Updated Aug 3, 2024

THUIR website

HTML 8 17 Updated Sep 6, 2024

An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.

Python 59 5 Updated May 17, 2023

SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction

Python 24 3 Updated Nov 8, 2022

程序员延寿指南 | A programmer's guide to live longer

29,954 2,100 Updated Jan 30, 2024

SimCSE在中文任务上的简单实验

Python 591 83 Updated Aug 7, 2023

EMNLP 2021 - Pre-training architectures for dense retrieval

Python 244 23 Updated Mar 18, 2022
Next