-
Tsinghua University
- Beijing, China
- https://xuanyuan14.github.io/
Highlights
- Pro
Stars
BlinkDL / nanoRWKV
Forked from karpathy/nanoGPTRWKV in nanoGPT style
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
[SIGIR 2023] This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Formula for CTR Prediction".
RUCAIBox / EulerNet
Forked from Ethan-TZ/EulerNetThis is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Formula for CTR Prediction".
A series of large language models developed by Baichuan Intelligent Technology
🕹️ A basic gameboy emulator with terminal "Cloud Gaming" support
Build, evaluate, understand, and fix LLM-based apps
deepspeed+trainer简单高效实现多卡微调大模型
T2Ranking: A large-scale Chinese benchmark for passage ranking.
Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2
The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval
The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Instruct-tune LLaMA on consumer hardware
Making large AI models cheaper, faster and more accessible
an unbias-learning-to-rank dataset of Baidu
An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.
SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction
程序员延寿指南 | A programmer's guide to live longer
EMNLP 2021 - Pre-training architectures for dense retrieval