Skip to content

Latest commit

 

History

History
491 lines (409 loc) · 49.9 KB

CHANGELOG.md

File metadata and controls

491 lines (409 loc) · 49.9 KB

Changelog

0.8.0-rc (2024-08-13)

Added

  • ✨ add openvoice api #100 [0a7735a]
  • ✨ add openvoice model/dcls/downloader #100 [a14600d]
  • ✨ add webui ssml ``from_subtitle` tab [3ab5d3f]
  • ✨ add chattts txt_smp eos #133 [854e477]
  • ✨ add asr_tab output download [26d9b0e]
  • ✨ add webui `asr` tab [a2b0670]
  • ✨ add webui `post process` tab [6a7d9de]
  • ✨ improve `model_zoo` [f04c1a4]
  • ✨ add `/v1/stt/transcribe` api [ccf21d3]
  • ✨ replace `openai.whisper` => `faster_whisper` [60d4d2c]
  • ✨ add faster whisper downloader [108d691]
  • ✨ add script `get_spk_emb.py` #128 [e6a58c5]
  • ✨ add stt api #92 [92b992f]
  • ✨ impl whisper model [ae3f416]
  • ✨ api support specify model_id [a486098]
  • ✨ ChatTTS 支持 reference 推理 #113 [ff4991e]
  • ✨ webui add Speaker Builder [c5b5a27]
  • ✨ update chatts 下载脚本 [702d63d]
  • ✨ api support latest speaker structure [e210f03]
  • ✨ add `/text/normalize` api [ca0f7b2]
  • ✨ add excepthook [241bceb]
  • ✨ support more tts models #90 [e3d1b14]
  • ✨ add `CosyVoiceModel` #90 [3adb574]
  • ✨ webui `tts_generate_with_history` [58fe589]
  • ✨ webui min_temp `0.1` => `0.000000001` [3a41ef9]
  • ✨ add xtts_v2 api `no_cache` [a8d913d]
  • ✨ add cosyvoice spk file [2854902]
  • ✨ add spk edit script [837b49b]
  • ✨ add cosyvoice config [6556306]
  • ✨ add cosyvoice downloader [cc60120]
  • ✨ support fish speech model [1835565]
  • ✨ refacotor style prompt [9c8c0de]
  • ✨ add fishspeech model dl [bdb3ee5]
  • ✨ improve tts handler [8b85878]
  • ✨ tn 兼容 windows [0919b01]
  • ✨ webui in `tts_pipeline` [ea39d6b]
  • ✨ streaming encoder [50d41a9]
  • ✨ 完善 stream mode [3095d4c]
  • ✨ add InferCache [fd93e79]
  • ✅ pass TN test cases [f604f5c]
  • ✨ models load by zoo [7835eb6]
  • ✨ fit tests case to `dev_tts_pipeline` [b24cb17]
  • ✨ fit api => tts pipeline [c4baeb1]
  • ✨ add `--train_mse` [7e54bb7]

Changed

Breaking changes

Removed

  • 🔥 remove test files [b1d5d69]
  • 🔥 remove old speaker files [caf8787]

Fixed

Miscellaneous

  • Merge pull request #136 from fangd123/main [fbea889]
  • update the instruction of installing WeTextProcessing [c816d67]
  • replace chattts modelscope_repo with AI-ModelScope [0f76bce]
  • fix: ensure SpkEncoder tensor with dtype.float [b9491a6]
  • fix: topK and topP not valid #115 [3ce2424]
  • Merge pull request #115 from wenyangchou/main [f9cb075]
  • fix: topK and topP not valid [c0bfe02]
  • Merge pull request #102 from lenML/dev_tts_pipeline [4028da1]
  • merge main into dev_tts_pipeline [846d18f]
  • Merge pull request #93 from lenML/dependabot/pip/certifi-2024.7.4 [2f252df]

0.7.0 (2024-07-04)

Added

  • ✨ add stream mode to openai api [989c3c5]
  • ✨ add `PYTORCH_ENABLE_MPS_FALLBACK` [2024e0b]
  • ✨ add `--flash_attn` args [a449091]
  • ✅ add stream test cases [02fbb71]
  • ✨ improve refine ui [4676c71]
  • ✨ improve infer [f9b81aa]
  • ✨ improve generate [b1f13b5]
  • ✨ improve podcast tab [fa63491]
  • ✨ improve webui split_tab [9578cc7]
  • ✨ SentenceSplitter use tokenizer [d8b8596]
  • ✨ add warning add docs [7370ba2]
  • ✨ add adjuster to webui [01f09b4]
  • ✨ stream mode support adjuster [585d2dd]
  • ✨ improve xtts_v2 api [fec66c7]
  • ✨ improve normalize [d0da37e]
  • ✨ improve normalize/spliter [163b649]
  • ✨ add loudness equalization [bc8bda7]
  • ✨ support `--use_cpu=chattts,enhancer,trainer,all` [23023bc]
  • ✨ improve normalizetion.py [1a7c0ed]
  • ✨ ignore_useless_warnings [4b9a32e]
  • ✨ enhance logger, info => debug [73bc8e7]
  • ✨ add playground.stream page [31377b0]
  • ✨ tts api support stream #5 [15e0b2c]

Changed

  • ⬆️ sync upstream changes [5d174b3]
  • ⬆️ sync upstream changes [e49dbb8]
  • 🎨 format [f5f483a]
  • ⬆️ sync upstream changes for ALL [f7fb126]
  • ⬆️ sync upstream changes for `dvae.py` [cc3ca09]
  • 🎨 format [6af9e24]
  • ♻️ refactor SentenceSplitter [d90c862]
  • ♻️ refactor models_setup [ff9c7c0]
  • 🍱 add _p_en [56f1fbf]
  • 🍱 update prompt [4f95b31]
  • ⚡ Reduce popping sounds [2d0fd68]
  • ⚡ improve `apply_character_map` [ea7399f]

Fixed

  • 🐛 fix straem generate [7f19d4f]
  • 🐛 add map_location [a5f90cb]
  • 🐛 fix dve indices.dtype [db1e571]
  • 🐛 fix missing `trange` [8577a53]
  • 🐛 fix Including `&` escape char causes normalization errors #77 [85c98f6]
  • 🐛 fix speaker loader #71 [e7b759f]
  • 🐛 fix load speaker from seed #69 [304c318]
  • 🐛 fix apply_prosody [7fa55d9]
  • 🐛 fix normalization lang detect [bd5e6eb]
  • 🐛 remove rubberband-cli dependencies #68 unit test [650a668]
  • 🐛 remove rubberband-cli dependencies #68 [1cd34c3]
  • 🐛 fix `apply_normalize` missing `sr` [2db6d65]
  • 🐛 fix sentence spliter [5d8937c]
  • 🐛 fix playground url_join [53e7cbc]
  • 🐛 fix generate_audio args [a7a698c]
  • 🐛 fix infer func [b0de527]
  • 🐛 fix webui logging format [4adc29e]
  • 🐛 fix webui speaker_tab missing progress [fafe242]

Miscellaneous

  • Merge pull request #84 from wenyangchou/main [f811e3d]
  • Update Dockerfile [e0cc31c]
  • optimize docker build [7b187b6]
  • 🔨 add download_audio_backend.py [8dd6925]
  • 💩 revert libsora => pyrubberband [4ead989]
  • Merge pull request #66 from WannaTen/main [9af0361]
  • fix port in api mode [40ab68b]
  • Windows not yet supported for torch.compile fix [74ac27d]
  • fix: replace mispronounced words in TTS [de66e6b]
  • feat: support stream mode [3da0f0c]
  • optimize: mps audio quality by contiguous scores [1e4d79f]

0.6.1 (2024-06-18)

Added

  • ✨ add `--preload_models` [73a41e0]
  • ✨ add webui progress [778802d]
  • ✨ add merger error [51060bc]
  • ✨ tts prompt => experimental [d3e6315]
  • ✨ add 基本的 speaker finetune ui [5f68f19]
  • ✨ add speaker finetune [5ce27ed]
  • ✨ add `--ino_half` remove `--half` [5820e57]
  • ✨ add webui podcast 默认值 [dd786a8]
  • ✨ add webui 分割器配置 [589327b]
  • ✨ add `eos` params to all api [79c994f]

Changed

Fixed

Miscellaneous

0.6.0 (2024-06-12)

Added

  • ✨ add XTTSv2 api #42 [d1fc63c]
  • ✨ google api 支持 enhancer [14fecdb]
  • ✨ 修改 podcast 脚本默认 style [98186c2]
  • ✨ playground google api [4109adb]
  • ✨ 添加 unload api [ed9d61a]
  • ✨ support api workers [babdada]
  • ✨ add ffmpeg version to webui footer [e9241a1]
  • ✨ support use internal ffmpeg [0e02ab0]
  • ✨ 增加参数 debug_generate [94e876a]
  • ✨ 支持 api 服务与 webui 并存 [4901491]
  • ✨ refiner api support normalize [ef665da]
  • ✨ add webui 音色编辑器 [fb4c7b3]
  • ✨ add localization [c05035d]
  • ✨ SSML 支持 enhancer [5c2788e]
  • ✨ webui 增加 podcast 工具 tab [b0b169d]
  • ✨ 完善 enhancer [205ebeb]

Changed

  • ⚡ improve synthesize_audio [759adc2]
  • ⚡ reduce enhancer chunk vram usage [3464b42]
  • ⚡ 增加默认说话人 [d702ad5]
  • 🍱 add `podcast` `podcast_p` style [2b9e5bf]
  • 🎨 improve code [317951e]
  • 🍱 update banner [dbc293e]
  • ⚡ 增强 TN [092c1b9]
  • ⚡ enhancer 支持 off_tqdm [94d34d6]
  • ⚡ 增加 git env [43d9c65]
  • ⚡ 修改 webui 保存文件格式 [2da41c9]

Breaking changes

  • 💥 enhancer support --half [fef2ed6]

Removed

Fixed

  • 🐛 fix worker env loader [5b0bf4e]
  • 🐛 fix colab default lang missing [d4e5190]
  • 🐛 fix "reflection_pad1d" not implemented for 'Half' [536c19b]
  • 🐛 fix #33 [76e0b58]
  • 🐛 fix localization error [507dbe7]
  • 🐛 block main thread #30 [3a7cbde]
  • 🐛 fix webui skip no-translate [a8d595e]
  • 🐛 fix hf.space force abort [f564536]
  • 🐛 fix missing device #25 [07cf6c1]
  • 🐛 fix Chat.refiner_prompt() [0839863]
  • 🐛 fix --language type check [50d354c]
  • 🐛 fix hparams config #22 [61d9809]
  • 🐛 fix enhance 下载脚本 [d2e14b0]
  • 🐛 fix 'trange' referenced [d1a8dae]
  • 🐛 fix ssml to mp3 error `bad sample width` [564b7eb]
  • 🐛 fix seed context exit behavior [d4e33c8]
  • 🐛 fix colab script [687cc2c]

Miscellaneous

0.5.5 (2024-06-08)

Added

  • ✨ add webui speaker creator [df26549]
  • ✨ webui speaker tab and merger [7ad71fd]
  • ✨ add enhance download script [37adec6]
  • ✨ add audio enhance/denoise [00cbc8e]
  • ✅ add speakers api test [fbe4304]
  • ✅ add unit test [e7f9385]

Changed

Removed

Fixed

  • 🐛 fix warning and hf.spaces error [f9700bb]
  • 🐛 fix model thread competition [0ade6ac]
  • 🐛 fix hf space error `ZeroGPU has not been initialized` [562e17c]
  • 🐛 fix openai api [49088c5]

Miscellaneous

  • 🐳 fix pip requirements [4256371]

0.5.2 (2024-06-06)

Changed

Fixed

  • 🐛 fix window proxy env [d0f9760]

Miscellaneous

0.5.1 (2024-06-06)

Fixed

  • 🐛 fix SynthesizeSegments seed [83b63bd]

0.5.0 (2024-06-06)

Added

Changed

Fixed

Miscellaneous