Spaces:

PogusTheWhisper
/

Audio-to-Blog-Summarizer

Sleeping

Naphat Sornwichai commited on 24 days ago

Commit

995e28f

1 Parent(s): b7496a7

update major files

Files changed (1) hide show

app.py CHANGED Viewed

@@ -117,11 +117,8 @@ def transcribe_and_summarize(audio_file: str, youtube_url: str, progress=gr.Prog
             sampling_rate=16000
         ).input_features.to(device, dtype=torch_dtype)
-        # Set the generation language and task for Thai transcription
-        decoder_prompt_ids = processor.get_decoder_prompt_ids(language="th", task="transcribe")
-        # Generate token IDs from the input features
-        predicted_ids = model.generate(input_features, forced_decoder_ids=decoder_prompt_ids)
         # Decode the token IDs to text
         transcribed_text = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]

             sampling_rate=16000
         ).input_features.to(device, dtype=torch_dtype)
+        # Generate token IDs from the input features, passing task and language directly
+        predicted_ids = model.generate(input_features, language="th", task="transcribe")
         # Decode the token IDs to text
         transcribed_text = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]