INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

619 42 Updated Aug 9, 2024

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 653 69 Updated Aug 17, 2024

google-research / perch

Python 161 37 Updated Aug 14, 2024

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,020 5,797 Updated Aug 16, 2024

microsoft / CLAP

Learning audio concepts from natural language supervision

Python 446 36 Updated May 27, 2024

bojone / rerope

Rectified Rotary Position Embeddings

Python 327 27 Updated May 20, 2024

YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 303 25 Updated Feb 21, 2024

MontaEllis / Pytorch-Medical-Segmentation

This repository is an unoffical PyTorch implementation of Medical segmentation in 2D and 3D.

Python 838 196 Updated Feb 29, 2024

MLNLP-World / MyArxiv

Arxiv个性化定制化模版，实现对特定领域的相关内容、作者与学术会议的有效跟进。

CSS 226 19 Updated Aug 16, 2024

symless / synergy-core

Open source core of Synergy, the cross-platform keyboard and mouse sharing tool (Windows, macOS, Linux)

C++ 10,185 3,623 Updated Aug 16, 2024

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,014 1,052 Updated Aug 16, 2024

microsoft / torchscale

Foundation Architecture for (M)LLMs

Python 2,988 202 Updated Apr 11, 2024

Jamie-Stirling / RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

Python 1,152 99 Updated Oct 22, 2023

InternLM / InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,079 431 Updated Aug 14, 2024

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,383 248 Updated Apr 24, 2024

alinlab / ifseg

IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)

Python 79 9 Updated Sep 5, 2023

Long-Kai / ADV_CE

Source code for paper "Improving Task-Specific Generalization in Few-Shot Learning via Adaptive Vicinal Risk Minimization"

Python 4 Updated Mar 1, 2023

bigscience-workshop / bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 971 100 Updated Jul 29, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,341 4,011 Updated Aug 16, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 779 122 Updated Jul 18, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

26,145 2,179 Updated Jun 18, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,374 2,464 Updated Aug 12, 2024

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TianHao Zhang Zth9730

Achievements

Achievements

Highlights

Block or report Zth9730

Stars

pliang279 / awesome-multimodal-ml

MingLunHan / CIF-HieraDist

YuanGongND / cav-mae

MontaEllis / SSL-For-Medical-Segmentation

facebookresearch / WavAugment

BradyFU / Awesome-Multimodal-Large-Language-Models

lucidrains / flamingo-pytorch

jasonppy / PromptingWhisper

DmitryRyumin / INTERSPEECH-2023-24-Papers