Skip to content

Latest commit

 

History

History
77 lines (55 loc) · 1.13 KB

feature_list.md

File metadata and controls

77 lines (55 loc) · 1.13 KB

Features

Dataset

  • Aishell
  • Librispeech
  • THCHS30
  • TIMIT

Speech Recognition

Language Model

  • Ngram

Decoder

  • ctc greedy
  • ctc prefix beam search
  • greedy
  • beam search
  • attention rescore

Deployment

  • Paddle Inference

Aligment

  • MFA
  • CTC Aligment

Speech Frontend

  • Audio
    • Auto Gain
  • Feature
    • kaldi fbank
    • kaldi mfcc
    • linear
    • delta detla

Speech Augmentation

  • Audio
    • Volume Perturbation
    • Speed Perturbation
    • Shifting Perturbation
    • Online Bayesian normalization
    • Noise Perturbation
    • Impulse Response
  • Spectrum
    • SpecAugment
    • Adaptive SpecAugment

Tokenizer

  • Chinese/English Character
  • English Word
  • Sentence Piece

Word Segmentation

Grapheme To Phoneme

  • syallable
  • phoneme