Skip to content
View lvchigo's full-sized avatar
Block or Report

Block or report lvchigo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Research Code for Multimodal-Cognition Team in Ant Group

Python 65 2 Updated Jul 11, 2024

Enjoy the magic of Diffusion models!

Python 5,858 524 Updated Jul 12, 2024

ImageNet dataset downloader. Creates a custom dataset by specifying the required number of classes and images in a class.

Python 18 7 Updated Jul 13, 2021

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,006 418 Updated Nov 29, 2023

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,296 2,055 Updated Jan 23, 2024

Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.

Go 117 15 Updated Dec 10, 2023

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,102 162 Updated Jul 11, 2024

Robust fine-tuning of zero-shot models

Python 602 62 Updated Apr 29, 2022

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,392 881 Updated Jul 16, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 30,801 4,642 Updated Jul 15, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,027 140 Updated Jul 12, 2024

An open source implementation of CLIP.

Python 9,229 917 Updated Jul 4, 2024

一个超级轻量的百度图片爬虫

Python 855 387 Updated May 29, 2023

Instant voice cloning by MyShell.

Python 27,315 2,648 Updated Jul 6, 2024

Boost.org regex module

C++ 82 90 Updated Apr 19, 2024

The official Meta Llama 3 GitHub site

Python 23,265 2,490 Updated Jul 17, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,097 277 Updated May 4, 2024

端到端语音唤醒工具箱,从模型训练到模型推理。

Python 54 8 Updated Apr 29, 2024

商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

C++ 464 57 Updated May 15, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,126 222 Updated Jun 28, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,482 231 Updated May 1, 2024

澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!

Python 319 56 Updated Oct 18, 2022

API接口大全不断更新中~欢迎Fork和Star(✎ 1.一言(古诗句版)api ✎ 2.必应每日一图api ✎ 3.在线ip查询 ✎ 4.m3u8视频在线解析api ✎ 5.随机生成二次元图片api ✎ 6.快递查询api-支持国内百家快递 ✎ 7.flv视频在线解析api ✎ 8.抖音视频无水印解析api✎ 9.一句话随机图片api✎ 10.QQ用户信息获取api✎11.哔哩哔哩封面图获…

PHP 1,592 340 Updated May 27, 2023

Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"

Python 40 1 Updated Apr 14, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

1,994 135 Updated Jul 16, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,541 448 Updated Jul 16, 2024

Metadata and versioning details for the Common Voice dataset

JavaScript 133 15 Updated Jul 1, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,158 751 Updated Jul 10, 2024

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 2,402 222 Updated Jul 17, 2024

Gorilla: An API store for LLMs

Python 10,861 871 Updated Jul 17, 2024
Next