Starred repositories
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Pytorch version of BERT-whitening
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
[ ECCV 2020 Spotlight ] Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets"
SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition
EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
Guideline following Large Language Model for Information Extraction
雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
Unified Structure Generation for Universal Information Extraction
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Jupyter Notebooks to help you get hands-on with Pinecone vector databases
Chinese Vision-Language Understanding Evaluation
PyTorch implementation for "Few-Shot Learning with Class Imbalance"