Stars
Here is step by step description.receives voice input from user > save as audio file > passes audio file to Whisper (Openai's voice to text API) > receives transcribed text from Whisper API > use t…
This project converts written material into speech by using Google AI (Gemini) for text creation or internet searches.
This Python script allows you to communicate with Google's Gemini on the Python terminal using your microphone.
A Streamlit application to generate code from images
Making large AI models cheaper, faster and more accessible
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
MTxm / Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader
Forked from HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-UploaderA series of 3 programs that will automatically receive scripts from Reddit, allow the user to edit them, then be sent off to a video generator where they will be uploaded to YouTube automatically.
注:外接的N手项目,仅开源供大家学习使用,禁止从事商业活动,如出现一切法律问题自行承担!!!
This page is for the SlimYOLOv3: Narrower, Faster and Better for UAV Real-Time Applications
Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning
Deep Learning Autonomous Car based on Raspberry Pi, SunFounder PiCar-V Kit, TensorFlow, and Google's EdgeTPU Co-Processor
A car detection model implemented in Tensorflow.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Detect mobile's device model using javascript
专注批量推送的小而美的工具,目前支持:模板消息-公众号、模板消息-小程序、微信客服消息、微信企业号/企业微信消息、阿里云短信、阿里大于模板短信 、腾讯云短信、云片网短信、E-Mail、HTTP请求、钉钉、华为云短信、百度云短信、又拍云短信、七牛云短信
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
All Algorithms implemented in Python
微信机器人&soul机器人&抖音机器人,支持微信检测是否是好友、soul灵魂匹配、机器人聊天、命令行回复等...手机无需root,利用人工智能接口实现的
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
License Plate Detection and Recognition in Unconstrained Scenarios