![tensorflow logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/tensorflow/tensorflow.png)
Block or Report
Block or report austingg
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition,…
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
✨✨Latest Advances on Multimodal Large Language Models
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-Sora: Democratizing Efficient Video Production for All
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Offical implement of NCL-IML (Pre-training-free Image Manipulation Localization through Non-Mutually Contrastive Learning), ICCV2023
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
A lightweight and high-performance reverse proxy for NAT traversal, written in Rust. An alternative to frp and ngrok.
Video+code lecture on building nanoGPT from scratch
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
A lightweight, optionally typed expression language with a custom grammar for matching arbitrary Python objects.
SigLIP-based Aesthetic Score Predictor