Stars
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Meta has built accurate population density maps using satellite imagery. Meta also provides a variety of open source tools we used to build them in order to assist others working in this field. Lea…
React app for inspecting, building and debugging with the Realtime API
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
[Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enlarged hidden dimension to build super frontier vision languag…
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A tool that enhances language learning by displaying two subtitles simultaneously on YouTube videos.
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
nivo provides a rich set of dataviz components, built on top of the awesome d3 and React libraries
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
Open source implementation of AlphaFold3
Collection of Summer 2025 tech internships!
AI-Powered Photos App for the Decentralized Web 🌈💎✨
High-resolution models for human tasks.
Fast drag and drop for any experience on any tech stack
Bring portraits to life via webcam!
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
A modular graph-based Retrieval-Augmented Generation (RAG) system
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs