Lists (3)
Sort Name ascending (A-Z)
Stars
医学影像数据集列表 『An Index for Medical Imaging Datasets』
Retrieval and Retrieval-augmented LLMs
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
An annotated implementation of the Transformer paper.
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
A high-throughput and memory-efficient inference and serving engine for LLMs
pytorch distribute tutorials
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
OpenAI CLIP text encoders for multiple languages!
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
Code and documentation to train Stanford's Alpaca models, and generate the data.
Code samples used on cloud.google.com
A library for efficient similarity search and clustering of dense vectors.
A code to evaluate libretranslate (I must improve the code)