소스 검색

Switching out seamlessM4T_v2_large to the multitask model with the text encoder. (#55)

Kaushik Ram Sadagopan 1 년 전
부모
커밋
f45bd3a4b9
2개의 변경된 파일2개의 추가작업 그리고 2개의 파일을 삭제
  1. 1 1
      src/seamless_communication/assets/cards/seamlessM4T_v2_large.yaml
  2. 1 1
      src/seamless_communication/models/unity/builder.py

+ 1 - 1
src/seamless_communication/assets/cards/seamlessM4T_v2_large.yaml

@@ -8,7 +8,7 @@ name: seamlessM4T_v2_large
 base: unity_nllb-100
 model_arch: base_v2
 char_tokenizer: "file://checkpoint/krs/unity2/spm_char_lang38_tc.model"
-checkpoint: "file://checkpoint/lpw/m4t_v2_final.pt"
+checkpoint: "file://large_experiments/seamless/ust/elbayadm/multitasking_models/m4t_v2_multitask_unity2.pt"
 num_units: 10000
 unit_langs:
   - arb

+ 1 - 1
src/seamless_communication/models/unity/builder.py

@@ -156,7 +156,7 @@ def _base_v2() -> UnitYConfig:
         w2v2_encoder_config=w2v2_chunk_encoder_config,
         mt_model_config=mt_model_config,
         t2u_config=t2u_config,
-        use_text_encoder=False,
+        use_text_encoder=True,
         use_conformer_adaptor=False,
         num_adaptor_layers=1,
         adaptor_kernel_size=8,