Lists (1)
Sort Name ascending (A-Z)
Stars
OCR, layout analysis, reading order, line detection in 90+ languages
Chai-1, SOTA model for biomolecular structure prediction
An introduction to theorem proving in Lean for the impatient.
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
The code of a graph neural network (GNN) for molecules, which is based on learning representations of r-radius subgraphs (i.e., fingerprints) in molecules.
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Easy to use Beamformers for multi-channel speech separation/enhancement
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
A MLX port of FLUX based on the Huggingface Diffusers implementation.
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
A formalized proof of Carleson's theorem in Lean
Reference implementations of several LangChain agents as Streamlit apps
Convert PDF to markdown quickly with high accuracy
SpeechGPT Series: Speech Large Language Models
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Fast & Simple repository for pre-training and fine-tuning T5-style models
Ongoing Lean formalisation of the proof of Fermat's Last Theorem
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Official Repository for the Uni-Mol Series Methods
This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快
SimPO: Simple Preference Optimization with a Reference-Free Reward