Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] so slooow and large #89

Open
fireheart2008 opened this issue Oct 7, 2024 · 0 comments
Open

[BUG] so slooow and large #89

fireheart2008 opened this issue Oct 7, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@fireheart2008
Copy link

https://http-ws.bbc.co.uk.edgesuite.net/mp3/learningenglish/2012/01/bbc_6_minute_work_emails_web_120112_6min_english_work_emails_audio_au_bb.mp3

English Rewind - 6 Minute English_ Work emails

this 6 minute audio file
took almost 6 minutes to transcribe
and the srt is 135 kb

after converting it to lrc elsewhere
(it would have been great if .lrc could be generated by this app btw)

it looks like this

[ti:English Rewind - 6 Minute English_ Work emails (192kbit_AAC)]
[re: Subtitle Edit]
[ve: 4.0.7.0]
[00:00.34]Hello,
[00:00.72]
[00:01.28]Catherine here from BBC Learning English.
[00:01.80]Catherine here from BBC Learning English.
[00:02.14]Catherine here from BBC Learning English.
[00:02.66]Catherine here from BBC Learning English.
[00:03.22]Catherine here from BBC Learning English.
[00:03.86]
[00:04.44]Just so you know,
[00:04.76]Just so you know,
[00:04.98]Just so you know,
[00:05.14]Just so you know,
[00:05.70]
[00:05.94]this programme is from the BBC Learning English archive.
[00:06.06]this programme is from the BBC Learning English archive.
[00:06.44]this programme is from the BBC Learning English archive.
[00:06.78]this programme is from the BBC Learning English archive.
[00:07.04]this programme is from the BBC Learning English archive.
[00:07.14]this programme is from the BBC Learning English archive.
[00:07.70]this programme is from the BBC Learning English archive.
[00:08.30]this programme is from the BBC Learning English archive.
[00:08.76]this programme is from the BBC Learning English archive.
[00:09.14]
[00:10.06]It was originally broadcast in January 2012 on our website.
[00:10.12]It was originally broadcast in January 2012 on our website.
[00:10.40]It was originally broadcast in January 2012 on our website.
[00:11.00]It was originally broadcast in January 2012 on our website.
[00:11.44]It was originally broadcast in January 2012 on our website.
[00:11.68]It was originally broadcast in January 2012 on our website.
[00:12.20]It was originally broadcast in January 2012 on our website.
[00:12.52]It was originally broadcast in January 2012 on our website.
[00:12.66]It was originally broadcast in January 2012 on our website.
[00:13.32]It was originally broadcast in January 2012 on our website.
[00:13.54]It was originally broadcast in January 2012 on our website.
[00:13.80]It was originally broadcast in January 2012 on our website.
[00:14.04]It was originally broadcast in January 2012 on our website.
[00:14.32]
[00:15.24]Enjoy!
[00:15.46]
[00:16.24]This is Six Minute English from BBCLearningEnglish.com
[00:16.60]This is Six Minute English from BBCLearningEnglish.com
[00:16.88]This is Six Minute English from BBCLearningEnglish.com
[00:17.14]This is Six Minute English from BBCLearningEnglish.com
[00:17.32]This is Six Minute English from BBCLearningEnglish.com
[00:17.80]This is Six Minute English from BBCLearningEnglish.com
[00:18.20]This is Six Minute English from BBCLearningEnglish.com
[00:19.40]This is Six Minute English from BBCLearningEnglish.com
[00:19.98]
[00:20.46]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:20.74]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:21.12]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:21.44]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:21.74]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:22.04]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:22.22]Hello and welcome to Six Minute English from BBCLearningEnglish.
[00:22.62]Hello and welcome to Six Minute English from BBCLearningEnglish.

my question is how to make it shorter and less repetitive

anyways thank you for the great app and hope you can help

log

2024-10-07 12:55:30.284 | INFO    | main.py:2152 [MainThread] - App Version: 1.3.10 - TIME: 2024-10-07 12:55:30
2024-10-07 12:55:30.284 | INFO    | main.py:2153 [MainThread] - OS: Windows 10 10.0.22631 | CPU: Intel64 Family 6 Model 140 Stepping 1, GenuineIntel
2024-10-07 12:55:30.284 | DEBUG   | main.py:2154 [MainThread] - Sys args: ['C:\\Users\\hp14\\AppData\\Local\\Programs\\Speech Translate CPU\\SpeechTranslate.exe']
2024-10-07 12:55:30.284 | DEBUG   | main.py:2155 [MainThread] - Loading UI...
2024-10-07 12:55:30.361 | INFO    | main.py:208 [MainThread] - Tray created successfully
2024-10-07 12:55:30.736 | DEBUG   | main.py:281 [MainThread] - Available Theme to use: ['vista', 'sun-valley-light', 'sun-valley-dark', 'winnative', 'clam', 'alt', 'default', 'classic', 'xpnative']
2024-10-07 12:55:30.736 | DEBUG   | style.py:32 [MainThread] - Setting theme: sun-valley-dark
2024-10-07 12:55:30.771 | DEBUG   | style.py:66 [MainThread] - Setting custom dark theme style
2024-10-07 12:55:33.364 | INFO    | main.py:299 [Thread-3 (cuda_check)] - GPU: No GPU detected | CUDA: CUDA is not available! Using CPU instead
2024-10-07 12:55:34.191 | DEBUG   | main.py:713 [Thread-7 (check_ffmpeg)] - Checking ffmpeg...
2024-10-07 12:55:34.195 | DEBUG   | main.py:715 [Thread-7 (check_ffmpeg)] - Checking ffmpeg done
2024-10-07 12:55:38.543 | INFO    | about.py:123 [MainThread] - Checking for update on start
2024-10-07 12:55:38.881 | INFO    | about.py:150 [Thread-18 (req_update_check)] - Checking for update...
2024-10-07 12:55:40.881 | INFO    | about.py:167 [Thread-18 (req_update_check)] - No update available
2024-10-07 13:00:53.397 | INFO    | helper.py:37 [MainThread] - Checking model name
2024-10-07 13:00:53.397 | DEBUG   | helper.py:38 [MainThread] - modelKey: ⚡ Tiny [1GB VRAM] (Fastest), src_english: True
2024-10-07 13:00:53.397 | DEBUG   | helper.py:43 [MainThread] - modelName: tiny.en
2024-10-07 13:00:53.397 | DEBUG   | main.py:1539 [MainThread] - Running disabler...
2024-10-07 13:00:54.680 | DEBUG   | download.py:110 [MainThread] - Connecting to huggingface server to verify model
2024-10-07 13:00:55.309 | DEBUG   | main.py:1617 [MainThread] - Running enabler...
2024-10-07 13:00:57.593 | INFO    | file.py:624 [Thread-19 (process_file)] - Start Process (FILE)
2024-10-07 13:00:57.595 | DEBUG   | load.py:366 [Thread-19 (process_file)] - Mode load args get: {'device': 'cpu', 'success': True}
2024-10-07 13:00:57.595 | DEBUG   | load.py:489 [Thread-19 (process_file)] - Loading model for transcribe using faster-whisper
2024-10-07 13:01:00.269 | DEBUG   | load.py:539 [Thread-19 (process_file)] - Model loaded | Is Faster Whisper: True | Load Status:
2024-10-07 13:01:00.269 | DEBUG   | load.py:540 [Thread-19 (process_file)] - TC: Set
2024-10-07 13:01:00.269 | DEBUG   | load.py:541 [Thread-19 (process_file)] - TL: Not Set
2024-10-07 13:01:00.269 | DEBUG   | load.py:542 [Thread-19 (process_file)] - func_tc: Set
2024-10-07 13:01:00.269 | DEBUG   | load.py:543 [Thread-19 (process_file)] - func_tl: Not Set
2024-10-07 13:01:00.269 | DEBUG   | load.py:439 [Thread-19 (process_file)] - Pass kwarg:
2024-10-07 13:01:00.269 | DEBUG   | load.py:440 [Thread-19 (process_file)] - {'temperature': (0.0, 0.2, 0.4, 0.6, 0.8, 1.0), 'best_of': 3, 'beam_size': 3, 'patience': 1.0, 'compression_ratio_threshold': 2.4, 'logprob_threshold': -1.0, 'no_speech_threshold': 0.72, 'suppress_tokens': None, 'suppress_blank': True, 'initial_prompt': None, 'prefix': None, 'condition_on_previous_text': True, 'max_initial_timestamp': 1.0, 'fp16': True}
2024-10-07 13:01:00.270 | DEBUG   | load.py:366 [Thread-19 (process_file)] - Mode transcribe args get: {'word_timestamps': True, 'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'q_levels': 20, 'k_size': 5, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'min_word_dur': 0.1, 'only_voice_freq': False, 'beam_size': 3, 'best_of': 3, 'patience': 1.0, 'no_speech_threshold': 0.72, 'compression_ratio_threshold': 2.4, 'condition_on_previous_text': True, 'initial_prompt': None, 'prefix': None, 'suppress_blank': True, 'suppress_tokens': None, 'max_initial_timestamp': 1.0, 'threads': 0, 'success': True}
2024-10-07 13:01:00.270 | DEBUG   | language.py:269 [Thread-19 (process_file)] - GETTING WHISPER LANGUAGE FROM SIMILAR LANGUAGE NAME
2024-10-07 13:01:00.271 | DEBUG   | language.py:274 [Thread-19 (process_file)] - Found key english while searching for english
2024-10-07 13:01:00.271 | DEBUG   | language.py:275 [Thread-19 (process_file)] - FULL KEY GET ['english']
2024-10-07 13:01:00.273 | INFO    | file.py:652 [Thread-19 (process_file)] - Model Args: {'device': 'cpu', 'download_root': 'C:\\Users\\hp14\\.cache\\whisper'}
2024-10-07 13:01:00.273 | INFO    | file.py:653 [Thread-19 (process_file)] - Process Args: {'word_timestamps': True, 'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'q_levels': 20, 'k_size': 5, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'min_word_dur': 0.1, 'only_voice_freq': False, 'beam_size': 3, 'best_of': 3, 'patience': 1.0, 'no_speech_threshold': 0.72, 'compression_ratio_threshold': 2.4, 'condition_on_previous_text': True, 'initial_prompt': None, 'prefix': None, 'suppress_blank': True, 'suppress_tokens': None, 'max_initial_timestamp': 1.0, 'language': 'en'}
2024-10-07 13:01:00.361 | DEBUG   | file.py:802 [Thread-19 (process_file)] - FILE PROCESSING: D:/Downloads/Music/English Rewind - 6 Minute English_ Work emails (192kbit_AAC).m4a
2024-10-07 13:01:00.361 | DEBUG   | file.py:810 [Thread-19 (process_file)] - Save_name: 2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/{task-lang}
2024-10-07 13:01:00.362 | DEBUG   | file.py:862 [Thread-19 (process_file)] - saved metadata
2024-10-07 13:01:00.405 | INFO    | file.py:294 [Thread-23 (cancellable_tc)] - --------------------------------------------------
2024-10-07 13:01:00.424 | INFO    | file.py:295 [Thread-23 (cancellable_tc)] - Transcribing
2024-10-07 13:01:00.424 | DEBUG   | file.py:296 [Thread-23 (cancellable_tc)] - Source Language: english
2024-10-07 13:01:00.425 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Running Whisper transcribe...
2024-10-07 13:01:06.490 | ERROR   | _logging.py:62 [Thread-24 (run_whisper)] - Detected Language: english
2024-10-07 13:01:06.490 | ERROR   | _logging.py:62 [Thread-24 (run_whisper)] - Transcribing with faster-whisper (tiny.en)...
2024-10-07 13:01:06.520 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:   0% | ## | 0/389.93 [00:00<?, ?sec/s]
2024-10-07 13:01:39.198 | ERROR   | _logging.py:62 [Thread-24 (run_whisper)] - Transcribe:   1%|          | 3.86/389.93 [00:32<54:28,  8.47s/sec]
2024-10-07 13:02:13.424 | ERROR   | _logging.py:62 [Thread-24 (run_whisper)] - Transcribe:   8%|7         | 31.12/389.93 [01:06<11:23,  1.91s/sec]
2024-10-07 13:02:41.809 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  16% | ### | 60.94/389.93 [01:35<07:22,  1.35s/sec]
2024-10-07 13:03:21.421 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  23% | ### | 90.38/389.93 [02:14<06:43,  1.35s/sec]
2024-10-07 13:04:01.193 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  29% | ### | 115.0/389.93 [02:54<06:36,  1.44s/sec]
2024-10-07 13:04:40.215 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  38% | ### | 147.94/389.93 [03:33<05:23,  1.34s/sec]
2024-10-07 13:05:22.095 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  45% | ### | 176.54/389.93 [04:15<04:54,  1.38s/sec]
2024-10-07 13:06:03.754 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  53% | ### | 204.9/389.93 [04:57<04:20,  1.41s/sec]
2024-10-07 13:06:39.920 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  59% | ### | 229.34/389.93 [05:33<03:49,  1.43s/sec]
2024-10-07 13:07:17.001 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  66% | ### | 258.9/389.93 [06:10<02:59,  1.37s/sec]
2024-10-07 13:07:45.294 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  73% | ### | 285.18/389.93 [06:38<02:14,  1.28s/sec]
2024-10-07 13:08:19.043 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  80% | ### | 310.38/389.93 [07:12<01:43,  1.30s/sec]
2024-10-07 13:08:44.903 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  84% | ### | 326.16/389.93 [07:38<01:27,  1.37s/sec]
2024-10-07 13:09:10.790 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  94% | ### | 364.94/389.93 [08:04<00:26,  1.08s/sec]
2024-10-07 13:09:19.860 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe:  99% | ### | 387.0/389.93 [08:13<00:02,  1.10sec/s]
2024-10-07 13:09:19.862 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Transcribe: 100% | #### | 389.93/389.93 [08:13<00:00,  1.27s/sec]
2024-10-07 13:09:22.449 | INFO    | _logging.py:55 [Thread-24 (run_whisper)] - Whisper transcribe done
2024-10-07 13:09:22.520 | DEBUG   | language.py:269 [Thread-23 (cancellable_tc)] - GETTING WHISPER LANGUAGE FROM SIMILAR LANGUAGE NAME
2024-10-07 13:09:22.521 | DEBUG   | language.py:274 [Thread-23 (cancellable_tc)] - Found key english while searching for english
2024-10-07 13:09:22.521 | DEBUG   | language.py:275 [Thread-23 (cancellable_tc)] - FULL KEY GET ['english']
2024-10-07 13:09:22.522 | ERROR   | _logging.py:62 [Thread-23 (cancellable_tc)] - Removed: [id:145] [06:24.460 --> 06:27.000] " Thank you."
2024-10-07 13:09:22.560 | DEBUG   | save.py:94 [Thread-23 (cancellable_tc)] - Saving to txt
2024-10-07 13:09:22.562 | DEBUG   | load.py:366 [Thread-23 (cancellable_tc)] - Mode save args get: {'filepath': 'C:\\Users\\hp14\\AppData\\Local\\Programs\\Speech Translate CPU\\lib\\speech_translate\\export\\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/transcribed english', 'strip': True, 'reverse_text': False, 'success': True}
2024-10-07 13:09:22.638 | ERROR   | _logging.py:62 [Thread-23 (cancellable_tc)] - Saved: C:\Users\hp14\AppData\Local\Programs\Speech Translate CPU\lib\speech_translate\export\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)\transcribed english.txt
2024-10-07 13:09:22.639 | DEBUG   | save.py:94 [Thread-23 (cancellable_tc)] - Saving to srt
2024-10-07 13:09:22.640 | DEBUG   | load.py:366 [Thread-23 (cancellable_tc)] - Mode save args get: {'filepath': 'C:\\Users\\hp14\\AppData\\Local\\Programs\\Speech Translate CPU\\lib\\speech_translate\\export\\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/transcribed english', 'segment_level': True, 'word_level': True, 'tag': None, 'strip': True, 'reverse_text': False, 'success': True}
2024-10-07 13:09:22.733 | ERROR   | _logging.py:62 [Thread-23 (cancellable_tc)] - Saved: C:\Users\hp14\AppData\Local\Programs\Speech Translate CPU\lib\speech_translate\export\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)\transcribed english.srt
2024-10-07 13:09:22.746 | DEBUG   | save.py:94 [Thread-23 (cancellable_tc)] - Saving to vtt
2024-10-07 13:09:22.748 | DEBUG   | load.py:366 [Thread-23 (cancellable_tc)] - Mode save args get: {'filepath': 'C:\\Users\\hp14\\AppData\\Local\\Programs\\Speech Translate CPU\\lib\\speech_translate\\export\\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/transcribed english', 'segment_level': True, 'word_level': True, 'tag': None, 'strip': True, 'reverse_text': False, 'success': True}
2024-10-07 13:09:22.826 | ERROR   | _logging.py:62 [Thread-23 (cancellable_tc)] - Saved: C:\Users\hp14\AppData\Local\Programs\Speech Translate CPU\lib\speech_translate\export\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)\transcribed english.srt
2024-10-07 13:09:22.827 | DEBUG   | save.py:94 [Thread-23 (cancellable_tc)] - Saving to json
2024-10-07 13:09:23.073 | DEBUG   | save.py:94 [Thread-23 (cancellable_tc)] - Saving to ass
2024-10-07 13:09:23.081 | DEBUG   | load.py:366 [Thread-23 (cancellable_tc)] - Mode save args get: {'filepath': 'C:\\Users\\hp14\\AppData\\Local\\Programs\\Speech Translate CPU\\lib\\speech_translate\\export\\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/transcribed english', 'segment_level': True, 'word_level': True, 'tag': None, 'font': 'Arial', 'font_size': 48, 'strip': True, 'karaoke': False, 'reverse_text': False, 'success': True}
2024-10-07 13:09:23.154 | ERROR   | _logging.py:62 [Thread-23 (cancellable_tc)] - Saved: C:\Users\hp14\AppData\Local\Programs\Speech Translate CPU\lib\speech_translate\export\2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)\transcribed english.ass
2024-10-07 13:09:23.155 | DEBUG   | file.py:352 [Thread-23 (cancellable_tc)] - Transcribing Audio: 2024-10-07 361018 English Rewind - 6 Minute English_ Work emails (192kbit_AAC)/transcribed english | Time Taken: 502.75s
2024-10-07 13:09:23.156 | DEBUG   | file.py:365 [Thread-23 (cancellable_tc)] - Updated tc metadata
2024-10-07 13:09:23.730 | INFO    | file.py:908 [Thread-19 (process_file)] - End process (FILE) [Total time: 503.46s]
2024-10-07 13:10:09.523 | INFO    | main.py:1842 [Thread-19 (process_file)] - Stopping file import processing...
2024-10-07 13:10:09.664 | INFO    | main.py:1861 [Thread-19 (process_file)] - Stopped

@fireheart2008 fireheart2008 added the bug Something isn't working label Oct 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant