Block or Report
Block or report jiamingkong
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Last updated
Stars
Language
Sort by: Recently starred
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Official Repository for the Uni-Mol Series Methods
This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data
A Comprehensive Toolkit for High-Quality PDF Content Extraction
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快
SimPO: Simple Preference Optimization with a Reference-Free Reward
simple delaysum, MVDR and CGMM-MVDR
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"
Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text.
Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Expressive Anechoic Recordings of Speech (EARS)
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.
Lecture notes for a course on writing proofs, on paper and in the Lean proof assistant
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
Tools for merging pretrained large language models.
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Towards a general-purpose foundation model for computational pathology - Nature Medicine