Skip to content
View kylemcdonald's full-sized avatar

Highlights

  • Pro

Organizations

@fatlab @scratchml @ITPNYU

Block or report kylemcdonald

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Encode and decode audio samples to/from compressed latent representations!

Python 96 4 Updated Aug 16, 2024

Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!

Python 422 37 Updated Aug 11, 2024

Fast running Live Portrait with TensorRT and ONNX models

Python 110 9 Updated Jul 30, 2024

Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"

Python 404 30 Updated Jun 24, 2024

Fast Gaussian Blur algorithm

C++ 89 16 Updated Aug 18, 2024
Python 154 5 Updated Feb 14, 2024

This list contains the airport codes of IATA airport code and ICAO airport code together with country code and region name supported in IP2Location geolocation database.

147 57 Updated Jul 31, 2024

The subtitles and translations are generated in real-time and displayed as pop-ups.

Python 107 19 Updated Jun 8, 2023

An AppleScript that extracts the presenter notes from an open Keynote presentation and save them to a text file on the Desktop. It also copies the presenter notes to the clipboard.

AppleScript 3 Updated Jun 14, 2023

Source code from our RecSys 2020 paper: "Making neural network interpretable with attribution: application to implicit signals prediction" (D. Afchar, R. Hennequin)

Jupyter Notebook 14 2 Updated Oct 2, 2020
Jupyter Notebook 86 3 Updated Jun 18, 2024

Interactive visualization and analytics on ADS-B data with ClickHouse

HTML 221 6 Updated Jun 13, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,181 409 Updated Jul 30, 2024
Jupyter Notebook 7,169 512 Updated Jun 16, 2024

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,138 112 Updated Aug 20, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 27,473 3,448 Updated Aug 6, 2024

Tiny AutoEncoder for Stable Diffusion

Python 519 27 Updated Aug 10, 2024

🎛 🔊 A Python library for audio.

C++ 5,092 258 Updated Aug 23, 2024

✈️🗄 2023 Historical data for all aircrafts traces known to adsb.lol. Openly licensed.

Shell 22 1 Updated Jan 2, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 3,784 379 Updated Aug 22, 2024

Talk to GPT-4 and create a story together.

Python 74 16 Updated Dec 2, 2023

HeadGAN - Official PyTorch Implementation (ICCV 2021)

Python 71 6 Updated Aug 4, 2023
Python 111 16 Updated Jul 12, 2023
Jupyter Notebook 2,384 449 Updated Dec 16, 2023

Port of OpenAI's Whisper model in C/C++

C 33,990 3,438 Updated Aug 21, 2024

Streaming MP3 decoder for Python

Python 27 2 Updated Apr 24, 2023

With Twilio Media Streams, you can now extend the capabilities of your Twilio-powered voice application with real time access to the raw audio stream of phone calls. This project transcribes speech…

JavaScript 45 22 Updated Mar 16, 2023

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 13,302 1,682 Updated Aug 21, 2024

A simple HTML5/JS demo that uses Recorder.js to record audio as uncompressed pcm (wav) and POST it to a server side script.

JavaScript 405 228 Updated Jan 21, 2022
Next