Skip to content
View jiamingkong's full-sized avatar
Block or Report

Block or report jiamingkong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

C++ 166 43 Updated Jan 10, 2023

The memory layer for Personalized AI

Python 17,641 1,680 Updated Jul 26, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 88 3 Updated Jul 15, 2024
Python 1 1 Updated Jul 26, 2024

Official Repository for the Uni-Mol Series Methods

Python 624 115 Updated Jul 24, 2024

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data

Python 318 67 Updated Apr 19, 2022

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 3,210 198 Updated Jul 26, 2024

基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快

Python 526 51 Updated Jun 30, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 559 31 Updated Jul 20, 2024

simple delaysum, MVDR and CGMM-MVDR

Python 221 73 Updated Jan 19, 2019

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,597 98 Updated Jul 26, 2024

Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"

6 Updated Jun 11, 2024

Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text.

Python 10 2 Updated Mar 29, 2023

Efficient Low-Memory Aligner

C 135 29 Updated Jun 20, 2024

Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"

Python 24 6 Updated Dec 14, 2022

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

918 17 Updated Jul 10, 2024

LLM101n: Let's build a Storyteller

25,587 1,356 Updated Jul 21, 2024

All-in-one text de-duplication

Python 555 68 Updated May 21, 2024

Expressive Anechoic Recordings of Speech (EARS)

Python 100 5 Updated Jun 25, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 781 45 Updated Jul 25, 2024

Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.

Jupyter Notebook 33 4 Updated May 30, 2024
Python 415 37 Updated Jun 7, 2024

Lecture notes for a course on writing proofs, on paper and in the Lean proof assistant

HTML 156 63 Updated May 30, 2024
Python 228 25 Updated Jul 5, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,401 436 Updated May 3, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,760 339 Updated May 8, 2024

Tools for merging pretrained large language models.

Python 4,175 362 Updated Jul 26, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 292 32 Updated Jul 25, 2024

Towards a general-purpose foundation model for computational pathology - Nature Medicine

Jupyter Notebook 254 32 Updated Jul 16, 2024
Next