You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Not sure if this is a bug. Trying do use alignment or refinement feature it processes the task up to 100%, but then instead of saving result files it aborts and displays an error message: "Error occured while doing alignment. can only concatenate str (not "int") to str".
When I uncheck "Remove repetitions" in Result Modification settings it will finish normally, but then files will be filled with hallucination repetitions. What can I do about it?
To Reproduce
Steps to reproduce the behavior:
Go to 'Tools' > 'Alignment'.
Start nalignment process with txt file and audio file.
Wait until processing finishes.
At 100%, when trying to save results error message shows up.
Expected behavior
Clean file without repetitions by using postprocessing feature "remove repetitions" as it worked before.
File "D:\Codes_Projects\Python\Speech-Translate\speech_translate\utils\audio\file.py", line 1229, in mod_result
File "D:\Codes_Projects\Python\Speech-Translate.venv\Lib\site-packages\stable_whisper\result.py", line 1632, in remove_repetition
TypeError: can only concatenate str (not "int") to str
2024-04-01 23:24:26.216 | INFO | main.py:2008 [Thread-105 (mod_result)] - Stopping alignment...
2024-04-01 23:24:26.249 | INFO | main.py:2023 [Thread-105 (mod_result)] - Stopped
Desktop (please complete the following information):
OS: Windows 10
App Installation version: 1.3.10 GPU-Version
App / Python version: ...
Additional context
It did work like a charm before. After playingf with repetition settings it stopped working and the behaviour I described appeared. Even after setting bacl all default settings, it won't work any more. Thanks for any help.
The text was updated successfully, but these errors were encountered:
Describe the bug
Not sure if this is a bug. Trying do use alignment or refinement feature it processes the task up to 100%, but then instead of saving result files it aborts and displays an error message: "Error occured while doing alignment. can only concatenate str (not "int") to str".
When I uncheck "Remove repetitions" in Result Modification settings it will finish normally, but then files will be filled with hallucination repetitions. What can I do about it?
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Clean file without repetitions by using postprocessing feature "remove repetitions" as it worked before.
Log
2024-04-01 23:23:19.852 | DEBUG | main.py:1964 [MainThread] - all_english: False
2024-04-01 23:23:19.860 | INFO | helper.py:37 [MainThread] - Checking model name
2024-04-01 23:23:19.860 | DEBUG | helper.py:38 [MainThread] - modelKey: 🐌 Large V3 [10GB VRAM] (Most Accurate), src_english: False
2024-04-01 23:23:19.860 | DEBUG | helper.py:43 [MainThread] - modelName: large-v3
2024-04-01 23:23:19.860 | DEBUG | main.py:1539 [MainThread] - Running disabler...
2024-04-01 23:23:22.095 | DEBUG | main.py:1617 [MainThread] - Running enabler...
2024-04-01 23:23:22.260 | INFO | file.py:981 [Thread-105 (mod_result)] - Start Process (MOD FILE)
2024-04-01 23:23:22.261 | DEBUG | load.py:366 [Thread-105 (mod_result)] - Mode load args get: {'device': 'cuda', 'cpu_preload': True, 'success': True}
2024-04-01 23:23:32.560 | DEBUG | load.py:439 [Thread-105 (mod_result)] - Pass kwarg:
2024-04-01 23:23:32.560 | DEBUG | load.py:440 [Thread-105 (mod_result)] - {'temperature': (0.0, 0.2, 0.4, 0.6, 0.8, 1.0), 'best_of': 3, 'beam_size': 3, 'patience': 1.0, 'compression_ratio_threshold': 2.4, 'logprob_threshold': -1.0, 'no_speech_threshold': 0.72, 'suppress_tokens': '', 'suppress_blank': True, 'initial_prompt': None, 'prefix': None, 'condition_on_previous_text': True, 'max_initial_timestamp': 1.0, 'fp16': True}
2024-04-01 23:23:32.560 | DEBUG | load.py:366 [Thread-105 (mod_result)] - Mode align args get: {'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'min_word_dur': 0.1, 'q_levels': 20, 'k_size': 5, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'only_voice_freq': False, 'prepend_punctuations': '"'“¿([{-', 'append_punctuations': '"'.。,,!!??::”)]}、', 'threads': 0, 'success': True}
2024-04-01 23:23:32.560 | INFO | file.py:996 [Thread-105 (mod_result)] - Model Args: {'device': 'cuda', 'cpu_preload': True, 'download_root': 'C:\Users\Ben\.cache\whisper'}
2024-04-01 23:23:32.560 | INFO | file.py:997 [Thread-105 (mod_result)] - Process Args: {'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'min_word_dur': 0.1, 'q_levels': 20, 'k_size': 5, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'only_voice_freq': False, 'prepend_punctuations': '"'“¿([{-', 'append_punctuations': '"'.。,,!!??::”)]}、'}
2024-04-01 23:23:32.584 | DEBUG | file.py:1111 [Thread-105 (mod_result)] - PROCESSING: ['C:/Transkription/Assets/audio/MSE600.mp3', 'C:/Transkription/Textdaten/Rohtext/MSE600.txt', 'german']
2024-04-01 23:23:32.584 | DEBUG | file.py:1140 [Thread-105 (mod_result)] - Save_name: MSE600 01-04-2024-23-23-32
2024-04-01 23:23:32.584 | DEBUG | language.py:269 [Thread-105 (mod_result)] - GETTING WHISPER LANGUAGE FROM SIMILAR LANGUAGE NAME
2024-04-01 23:23:32.584 | DEBUG | language.py:274 [Thread-105 (mod_result)] - Found key german while searching for german
2024-04-01 23:23:32.585 | DEBUG | language.py:275 [Thread-105 (mod_result)] - FULL KEY GET ['german']
2024-04-01 23:23:33.636 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 0% | ## | 0/837.41 [00:00<?, ?sec/s]
2024-04-01 23:23:35.679 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 1% | ## | 7.46/837.41 [00:02<03:47, 3.65sec/s]
2024-04-01 23:23:36.266 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 2% | ## | 18.74/837.41 [00:02<01:40, 8.18sec/s]
2024-04-01 23:23:36.837 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 5% | ## | 45.38/837.41 [00:03<00:41, 19.26sec/s]
2024-04-01 23:23:37.401 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 9% | ## | 73.16/837.41 [00:03<00:27, 27.93sec/s]
2024-04-01 23:23:37.987 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 12% | ### | 99.6/837.41 [00:04<00:22, 33.08sec/s]
2024-04-01 23:23:38.558 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 15% | ### | 126.04/837.41 [00:04<00:19, 36.98sec/s]
2024-04-01 23:23:39.129 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 18% | ### | 150.42/837.41 [00:05<00:17, 38.68sec/s]
2024-04-01 23:23:39.709 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 21% | ### | 179.42/837.41 [00:06<00:15, 42.09sec/s]
2024-04-01 23:23:40.280 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 24% | ### | 203.82/837.41 [00:06<00:14, 42.28sec/s]
2024-04-01 23:23:40.851 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 27% | ### | 230.02/837.41 [00:07<00:14, 43.36sec/s]
2024-04-01 23:23:41.415 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 30% | ### | 254.34/837.41 [00:07<00:13, 43.28sec/s]
2024-04-01 23:23:41.986 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 33% | ### | 277.44/837.41 [00:08<00:13, 42.43sec/s]
2024-04-01 23:23:42.573 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 36% | ### | 298.42/837.41 [00:08<00:13, 40.40sec/s]
2024-04-01 23:23:43.143 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 39% | ### | 324.7/837.41 [00:09<00:12, 42.08sec/s]
2024-04-01 23:23:43.723 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 41% | ### | 346.26/837.41 [00:10<00:12, 40.60sec/s]
2024-04-01 23:23:44.295 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 44% | ### | 371.22/837.41 [00:10<00:11, 41.52sec/s]
2024-04-01 23:23:44.865 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 47% | ### | 394.98/837.41 [00:11<00:10, 41.56sec/s]
2024-04-01 23:23:45.429 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 50% | ### | 417.96/837.41 [00:11<00:10, 41.31sec/s]
2024-04-01 23:23:46.000 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 53% | ### | 445.98/837.41 [00:12<00:08, 43.65sec/s]
2024-04-01 23:23:46.587 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 56% | ### | 468.24/837.41 [00:12<00:08, 41.91sec/s]
2024-04-01 23:23:47.157 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 59% | ### | 491.76/837.41 [00:13<00:08, 41.70sec/s]
2024-04-01 23:23:47.737 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 62% | ### | 520.48/837.41 [00:14<00:07, 44.06sec/s]
2024-04-01 23:23:48.302 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 65% | ### | 544.0/837.41 [00:14<00:06, 43.34sec/s]
2024-04-01 23:23:48.879 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 67% | ### | 564.68/837.41 [00:15<00:06, 41.09sec/s]
2024-04-01 23:23:49.465 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 70% | ### | 586.9/837.41 [00:15<00:06, 40.11sec/s]
2024-04-01 23:23:50.030 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 73% | ### | 611.96/837.41 [00:16<00:05, 41.38sec/s]
2024-04-01 23:23:50.616 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 76% | ### | 634.5/837.41 [00:16<00:05, 40.48sec/s]
2024-04-01 23:23:51.187 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 79% | ### | 663.3/837.41 [00:17<00:04, 43.45sec/s]
2024-04-01 23:23:51.767 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 82% | ### | 687.08/837.41 [00:18<00:03, 42.71sec/s]
2024-04-01 23:23:52.342 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 85% | ### | 710.84/837.41 [00:18<00:02, 42.30sec/s]
2024-04-01 23:23:52.909 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 88% | ### | 734.68/837.41 [00:19<00:02, 42.23sec/s]
2024-04-01 23:23:53.495 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 91% | ### | 761.2/837.41 [00:19<00:01, 43.14sec/s]
2024-04-01 23:23:54.060 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 94% | ### | 784.32/837.41 [00:20<00:01, 42.50sec/s]
2024-04-01 23:23:54.646 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 96% | ### | 807.92/837.41 [00:21<00:00, 41.81sec/s]
2024-04-01 23:23:55.186 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 97% | ### | 815.98/837.41 [00:21<00:00, 34.12sec/s]
2024-04-01 23:23:55.186 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 100% | #### | 837.41/837.41 [00:21<00:00, 38.86sec/s]
2024-04-01 23:23:57.959 | ERROR | file.py:1279 [Thread-105 (mod_result)] - Error occured while doing alignment
2024-04-01 23:23:57.959 | ERROR | file.py:1280 [Thread-105 (mod_result)] - can only concatenate str (not "int") to str
Traceback (most recent call last):
File "D:\Codes_Projects\Python\Speech-Translate\speech_translate\utils\audio\file.py", line 1229, in mod_result
File "D:\Codes_Projects\Python\Speech-Translate.venv\Lib\site-packages\stable_whisper\result.py", line 1632, in remove_repetition
TypeError: can only concatenate str (not "int") to str
2024-04-01 23:24:26.216 | INFO | main.py:2008 [Thread-105 (mod_result)] - Stopping alignment...
2024-04-01 23:24:26.249 | INFO | main.py:2023 [Thread-105 (mod_result)] - Stopped
Desktop (please complete the following information):
Additional context
It did work like a charm before. After playingf with repetition settings it stopped working and the behaviour I described appeared. Even after setting bacl all default settings, it won't work any more. Thanks for any help.
The text was updated successfully, but these errors were encountered: