jiamingkong

Follow

jiamingkong

Follow

18 followers · 28 following

Achievements

Achievements

Block or Report

Block or report jiamingkong

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Lists (1)

Sort

datasets

Datasets that I like

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

lovemefan / telespeech-asr-python

Python 25 2 Updated Jul 17, 2024

MrZilinXiao / Hyper-Table-OCR

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

C++ 166 43 Updated Jan 10, 2023

mem0ai / mem0

The memory layer for Personalized AI

Python 17,641 1,681 Updated Jul 26, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 88 3 Updated Jul 15, 2024

AGENDD / RWKV-PEFT

Forked from JL-er/RWKV-PEFT

Python 1 1 Updated Jul 26, 2024

deepmodeling / Uni-Mol

Official Repository for the Uni-Mol Series Methods

Python 624 115 Updated Jul 24, 2024

tencent-ailab / grover

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data

Python 318 67 Updated Apr 19, 2022

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 3,210 198 Updated Jul 26, 2024

jingsongliujing / OnnxOCR

基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快

Python 526 51 Updated Jun 30, 2024

princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 559 31 Updated Jul 20, 2024

AkojimaSLP / Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Python 221 73 Updated Jan 19, 2019

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,597 98 Updated Jul 26, 2024

ictnlp / CTC-S2UT

Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"

6 Updated Jun 11, 2024

ashutosh1919 / data2vec-pytorch

Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text.

Python 10 2 Updated Mar 29, 2023

robertostling / eflomal

Efficient Low-Memory Aligner

C 135 29 Updated Jun 20, 2024

ghchen18 / cdalign

Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"

Python 24 6 Updated Dec 14, 2022

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

918 17 Updated Jul 10, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

25,586 1,356 Updated Jul 21, 2024

ChenghaoMou / text-dedup

All-in-one text de-duplication

Python 555 68 Updated May 21, 2024

facebookresearch / ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Python 100 5 Updated Jun 25, 2024

sustcsonglin / flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 781 45 Updated Jul 25, 2024

SuperBianC / scMulan

Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.

Jupyter Notebook 33 4 Updated May 30, 2024

Tele-AI / TeleSpeech-ASR

Python 415 37 Updated Jun 7, 2024

hrmacbeth / math2001

Lecture notes for a course on writing proofs, on paper and in the Lean proof assistant

HTML 156 63 Updated May 30, 2024

huggingface / dataspeech

Python 228 25 Updated Jul 5, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,401 436 Updated May 3, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,760 339 Updated May 8, 2024

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 4,175 362 Updated Jul 26, 2024

facebookresearch / SONAR

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 292 32 Updated Jul 25, 2024

mahmoodlab / UNI

Towards a general-purpose foundation model for computational pathology - Nature Medicine

Jupyter Notebook 254 32 Updated Jul 16, 2024