HelloTheWholeWorld

pyhou1999 HelloTheWholeWorld

加油！

10 followers · 27 following

Achievements

Starred repositories

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,203 851 Updated Sep 13, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,000 439 Updated Oct 10, 2024

shibing624 / parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

Python 462 84 Updated Mar 10, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,238 664 Updated Oct 10, 2024

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 12,186 1,003 Updated Jul 5, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 9,974 961 Updated Oct 9, 2024

lyswhut / lx-music-desktop

一个基于 electron 的音乐软件

TypeScript 39,928 5,932 Updated Sep 24, 2024

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,705 192 Updated May 20, 2024

yekeren / VCR-shortcut-effects-study

Code and data of our AAAI2021 paper "A Case Study of the Shortcut Effects in Visual Commonsense Reasoning"

Python 8 1 Updated Mar 15, 2021

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,747 954 Updated Aug 23, 2024

shenweichen / DeepCTR-Torch

【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.

Python 2,990 702 Updated Jul 2, 2024

limanling / clip-event

Python 98 7 Updated Apr 11, 2022

MLNLP-World / Paper-Writing-Tips

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

3,532 457 Updated May 29, 2022

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,766 2,515 Updated Oct 10, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,349 1,547 Updated Oct 7, 2024