-
Tau Motors
- Redwood City, CA
Block or Report
Block or report nickvazz
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse🛹 skate 🛹
A deep learning approach of learning how to kickflip a skateboard
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Resources (papers, datasets, rendering methods) in the domain of object pose estimation.
A toolbox for skeleton-based action recognition.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Classifying skateboard tricks from video clips using DeepMind's I3D model and an audio feature extractor..
Code for FLAVR: A fast and efficient frame interpolation technique.
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
PyTorch code and models for the DINOv2 self-supervised learning method.
Inpaint anything using Segment Anything and inpainting models.
[CVPR 2023] Code for "VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos"
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
iDisc: Internal Discretization for Monocular Depth Estimation [CVPR 2023]
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
Easy & Modular Computer Vision Detectors, Trackers & SAM - Run YOLOv9,v8,v7,v6,v5,R,X in under 10 lines of code.
Mask-Free Video Instance Segmentation [CVPR 2023]
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities