Skip to content
View hzhwcmhf's full-sized avatar

Organizations

@thu-coai @QwenLM
Block or Report

Block or report hzhwcmhf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 240 21 Updated Apr 20, 2024

ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.

Python 21 Updated Jun 24, 2024

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python 17 Updated May 29, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 300 15 Updated Jul 18, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,420 366 Updated Jul 18, 2024
Python 40 2 Updated Apr 2, 2024

[ACL'24 Oral] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 314 13 Updated Jul 9, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,392 333 Updated May 28, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,107 181 Updated Jul 22, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,749 1,028 Updated Jun 27, 2024

隐藏miui剪贴板对话框

Kotlin 12 2 Updated Jul 24, 2022

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 3,916 540 Updated May 23, 2024

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,025 603 Updated Nov 21, 2022

记录本人整理的一些数据集

967 131 Updated Jun 16, 2022

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 17,990 1,896 Updated Apr 4, 2024

GPT4 & LangChain Chatbot for large PDF docs

TypeScript 14,764 2,999 Updated Mar 25, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,045 318 Updated Jul 21, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,494 93 Updated Feb 16, 2024

a large-scale Chinese parabank via machine translation

1 Updated Oct 30, 2022

Improving Non-autoregressive Generation with Mixup Training

Python 7 1 Updated Sep 5, 2022

Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"

Python 118 15 Updated Sep 10, 2023

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,407 376 Updated Jul 20, 2024

SimCSE在中文任务上的简单实验

Python 586 85 Updated Aug 7, 2023

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

305 29 Updated Mar 15, 2023

The entmax mapping and its loss, a family of sparse softmax alternatives.

Python 400 43 Updated Jun 22, 2024

CUDA kernels for generalized matrix-multiplication in PyTorch

Jupyter Notebook 78 13 Updated Oct 11, 2021

Development repository for the Triton language and compiler

C++ 12,053 1,434 Updated Jul 23, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,538 154 Updated Jul 12, 2024

pytorch memory track code

Python 975 153 Updated May 4, 2021
Next