Lists (3)
Sort Name ascending (A-Z)
Starred repositories
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Convert any PDF into a podcast episode!
Model components of the Llama Stack APIs
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Represent, send, store and search multimodal data
Drag & drop UI to build your customized LLM flow
An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
Minimalistic web app designed for sending private and secure notes.
Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Hunt down social media accounts by username across social networks
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
The Security Toolkit for LLM Interactions
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for text data.
The simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs