Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Maximum length? #4

Open
peteruhrig opened this issue Nov 18, 2018 · 2 comments
Open

Maximum length? #4

peteruhrig opened this issue Nov 18, 2018 · 2 comments

Comments

@peteruhrig
Copy link

peteruhrig commented Nov 18, 2018

Dear Alberto,

thanks a lot for this great list! Even though you do not list this property in the table, I would be interested to hear if you have any insights as to the maximum length of recordings and transcripts for the various tools. At the Distributed Little Red Hen Lab, we need to align one-hour recordings with their transcripts. This works great with Gentle for English, but for instance the Montreal Forced Aligner is only happy with relatively short passages. I'd be grateful for any pointers you have.

Best regards,
Peter

@pettarin
Copy link
Owner

I agree on the usefulness of a "maximum length" column. On the other hand, it is difficult to be objective there: the "max processable length" might depend on multiple factors (like: RAM available, acoustic/linguistic models being used, etc.) so it would be difficult to give a fair assessment without spending a considerable amount of time or without setting up a rigorous test environment.
Also, some algorithms might trade execution time, space/memory required and alignment accuracy.

Probably a qualitative/approximate description ("5 minutes" vs "10 hours" with 8GB RAM) would suffice.

@pettarin
Copy link
Owner

Ah, for clarity: I do not have a data point for each of the aligners currently in the table. In fact, I wiped out the linux laptop where I had most of them installed...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants