Block or Report
Block or report menorki
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our c…
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
A pytorch quantization backend for optimum
Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)
Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.
#1 Locally hosted web application that allows you to perform various operations on PDF files
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
GroqNotes: Generate organized notes from audio using Groq, Whisper, and Llama3
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
ControlNet++: All-in-one ControlNet for image generations and editing!
4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022
Solution for 2nd place in Visual Product Recognition Challenge 2023
2nd place solution to Google Universal Image Embedding Challenge!
1st Place Solution in Google Universal Image Embedding
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
[CVPR 2024] Official implementation for "SVGDreamer: Text Guided SVG Generation with Diffusion Model" https://arxiv.org/abs/2312.16476
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
ImageSlider custom component for gradio.
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.