- Redwood City, CA
-
17:04
(UTC -08:00) - https://www.andrewroberts.blog
- @andrew_roberts
- in/andrewr
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Collection of custom elements that appear hand drawn. Great for wireframes or a fun look.
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
A simple Python script to turn non-OCRed PDFs into searchable, OCRed PDFs under an enterprise-friendly, open source license.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…
Drop in replacement for the OpenAI Assistants API
This sample project demonstrate the OpenAI Assistants API’s ability to manage single-threaded multi-user interactions through a full-stack app using Node.js, Vue.js, and socket.io for server-client…
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Inference and training library for high-quality TTS models.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
A secure authentication module to manage user access in a Streamlit application.
Examples and guides for using the OpenAI API
Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, D…
The official Python API for ElevenLabs Text to Speech.
🔊 Text-Prompted Generative Audio Model
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
A multi-voice TTS system trained with an emphasis on quality
A Python library that can apply: darth vader, echo, radio, robotic, and ghost effects to audio samples.
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip …
Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.