Stars
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Open-source simulator for autonomous driving research.
Through the mobile phone control computer racing game, using socket protocol to achieve a virtual handle.通过手机控制电脑上的赛车游戏,使用套接字协议实现的一款虚拟手柄。
Graphical User Interface for creating and running Scratch 3.0 projects.
People Counting in Real-Time with an IP camera.
MobileNetV1-SSD + SORT based Real-Time Tracking and Counting on Jetson Nano
Counting the number of people in a video.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
SoftVC VITS Singing Voice Conversion
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
Using Low-rank adaptation to quickly fine-tune diffusion models.
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
Image-to-Image Translation in PyTorch
Image-to-image translation with conditional adversarial nets