Block or Report
Block or report lvchigo
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Research Code for Multimodal-Cognition Team in Ant Group
Enjoy the magic of Diffusion models!
ImageNet dataset downloader. Creates a custom dataset by specifying the required number of classes and images in a class.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
An open source implementation of CLIP.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Code examples and resources for DBRX, a large language model developed by Databricks
澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!
API接口大全不断更新中~欢迎Fork和Star(✎ 1.一言(古诗句版)api ✎ 2.必应每日一图api ✎ 3.在线ip查询 ✎ 4.m3u8视频在线解析api ✎ 5.随机生成二次元图片api ✎ 6.快递查询api-支持国内百家快递 ✎ 7.flv视频在线解析api ✎ 8.抖音视频无水印解析api✎ 9.一句话随机图片api✎ 10.QQ用户信息获取api✎11.哔哩哔哩封面图获…
Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Metadata and versioning details for the Common Voice dataset
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)