cndn
|
d80093f9f8
Import ggml to SC
|
1 rok temu |
Anna Sun
|
14b013315e
[streaming] add s2s + s2t expressive demo (#153)
|
1 rok temu |
Pierre Andrews
|
6be7fd3468
add missing prosody params (#141)
|
1 rok temu |
Kaushik Ram Sadagopan
|
fb59ee0a49
Enable M4T vocoder inference on the GPU in fp16. (#151)
|
1 rok temu |
Yilin Yang
|
64c0e73ac0
Adding watermarked PretsselVocoderAgent (#149)
|
1 rok temu |
Anna Sun
|
8373db2ee5
Enable joint s2t + s2s output for demo (#146)
|
1 rok temu |
Kaushik Ram Sadagopan
|
26bc428198
Clean up aligner assets, refactor aligner test to run on GPU with fp16 as well. (#148)
|
1 rok temu |
Kaushik Ram Sadagopan
|
e568857c64
Fix all mypy issues in streaming, and some minor bugs in tree pipeline. (#147)
|
1 rok temu |
Abinesh Ramakrishnan
|
b5b98699c6
Tree Pipeline (#140)
|
1 rok temu |
Yilin Yang
|
7d076869ef
update ProsodyUnitY checkpoint to fairseq2 format (#143)
|
1 rok temu |
Yilin Yang
|
26a020227f
bump watermarked vocoder's max_seq_len to allow longer input (#145)
|
1 rok temu |
Ning
|
fad6308372
Fix medium model loading (#144)
|
1 rok temu |
Abinesh Ramakrishnan
|
a1b5d918eb
Vocoder SimulEval Agent and getting online S2ST parity. (#135)
|
1 rok temu |
Yilin Yang
|
8d49dd0450
small afterwards fix due to conda env break yesterday (#142)
|
1 rok temu |
Ilia Kulikov
|
78d6dac3a9
UnitY2 aligner for release (#112)
|
1 rok temu |
Can Balioglu
|
f9608d8cbd
Various nit fixes (#138)
|
1 rok temu |
Anna Sun
|
3e673a7468
mixins come first (#137)
|
1 rok temu |
Kaushik Ram Sadagopan
|
6fadf9e320
Introduce online unit decoder SimulEval agent. (#115)
|
1 rok temu |
Anna Sun
|
d877073d7c
Bump simuleval version, override simuleval update_target to save memory (#133)
|
1 rok temu |
Tuan Tran
|
9f6ade6ee4
Make a public-facing watermarked vocoder (PretsselVocoder) (#97)
|
1 rok temu |
Anna Sun
|
2ccf28ad24
[streaming] Port changes for streaming demo (#130)
|
1 rok temu |
Yilin Yang
|
00118c21cc
Enabling 24khz vocoder for demo/OSS (#132)
|
1 rok temu |
Kaushik Ram Sadagopan
|
7537081d50
Return durations from the variance adaptor. (#134)
|
1 rok temu |
Abinesh Ramakrishnan
|
b1027e9858
Silero VAD Agent (#120)
|
1 rok temu |
Pierre Andrews
|
87e10d101b
mintox - Add option to consume pretranscribed text + log mintox for cloudwatch (#131)
|
1 rok temu |
Kaushik Ram Sadagopan
|
5a2d61655f
Make unit_extractor configurable by dtype. (#128)
|
1 rok temu |
Abinesh Ramakrishnan
|
bc88690d56
Ability to change tgt_lang dynamically during streaming inference. (#121)
|
1 rok temu |
Kaushik Ram Sadagopan
|
5dd9722b8d
Introduce MMASpeechToTextDecoderAgent and related agents for online_text_decoder. (#113)
|
1 rok temu |
Ruslan Mavlyutov
|
c9f611a0b2
Fix inconsistency between model vocab info and associated tokenizers (inherit directly from the tokenizers) (#126)
|
1 rok temu |
Yilin Yang
|
b9f101b2b7
Loose the PretsselModel test check by allowing one unit different b/t runs (#127)
|
1 rok temu |