Skip to content
View tmoroney's full-sized avatar
🙃
🙃

Highlights

  • Pro

Block or report tmoroney

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Cross-platform audio/video downloader

TypeScript 27 1 Updated Jun 22, 2024

Tauri & ReactJS boilerplate for a modern desktop application. Not a project nor a substitute for my Tauri video tutorials.

JavaScript 187 38 Updated Jul 21, 2024

Rapidly scaffold out a new tauri app project.

Rust 1,013 86 Updated Oct 8, 2024

A Tarui Python Sidecar Example, using Pyinstaller.

CSS 67 6 Updated Aug 27, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 43,045 7,721 Updated Oct 8, 2024

pure luajit ffi socket bindings for unix and windows

Lua 38 5 Updated Sep 1, 2022

Learning to cut end-to-end pretrained modules

Python 27 3 Updated Jul 16, 2024

This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing"

Python 45 1 Updated Nov 28, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,055 8,125 Updated Sep 30, 2024

Deploying a React App (created using create-react-app) to GitHub Pages

TypeScript 6,504 916 Updated Sep 28, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,487 142 Updated Oct 4, 2024

Open Source API and interchange format for editorial timeline information.

Python 1,448 287 Updated Oct 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,010 4,136 Updated Oct 8, 2024

MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.

Python 314 24 Updated Oct 6, 2024

Transcription with speaker diarization pipeline

Python 83 17 Updated Apr 27, 2023

Speaker diarization model

Python 20 6 Updated Apr 1, 2023

A PyTorch-based Speech Toolkit

Python 8,680 1,375 Updated Oct 7, 2024

A python package to analyze and compare voices with deep learning

Python 2,752 425 Updated Oct 12, 2023

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

Jupyter Notebook 57 15 Updated Oct 30, 2022

Papers, code and datasets about deep learning and multi-modal learning for video analysis

753 171 Updated Oct 10, 2021

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL 732 149 Updated Mar 15, 2023

🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Choose your transcription deity: MLX Whisper (local), Groq (spee…

Python 61 8 Updated Aug 31, 2024

Examples in the MLX framework

Python 5,967 847 Updated Oct 8, 2024

An extremely fast implementation of whisper optimized for Apple Silicon using MLX.

Python 548 26 Updated May 8, 2024

Transcribe and summarize videos using whisper and llms on apple mlx framework

Python 71 6 Updated Jan 28, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,553 281 Updated Jul 12, 2024

Caesium is an image compression software that helps you store, send and share digital pictures, supporting JPG, PNG, WebP and TIFF formats. You can quickly reduce the file size (and resolution, if …

C++ 3,569 214 Updated Sep 27, 2024

Distribute and run LLMs with a single file.

C++ 19,547 986 Updated Oct 8, 2024

"EasyRec: Simple yet Effective Language Model for Recommendation"

Python 84 11 Updated Sep 12, 2024

A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿

Python 310 5 Updated Aug 23, 2024
Next