-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate languages and inconsistencies in languagelookup.json #30
Comments
related TODO: make sure TED language lookup is mappable to LE language names:
|
In the choice of This seems to be because the language list is used in the wrong way... is this the misunderstanding?
Maybe we should maintain a separate list of unique language codes without the ambiguity? |
The The recommended approach for looking up language codes is using the lang_obj = getlang_by_name('Swahili')
lang_obj.code
## 'swa'
lang_obj = getlang_by_native_name('Kiswahili')
lang_obj.code
## 'swa' There is also
The language codes in le-utils are referred to as the "internal representation" — a far-from-perfect, and far-from-consisten convention that is used by Ricecooker, Studio, and Kolibri. All external language codes must be mapped to one of the internal codes upon "entering" the Kolibri ecosystem. We maintain le-utils language codes partially out of compatibility (for channels already out there) can learn all about the different work and helpers and utils for doing this mapping of external-to-internal the docs, the examples, or the tests where you'll see various non-standard external language codes we handle [KA lang codes, youtube, ISOs, native name variations, etc]. Next steps on this issue:
Some numbersThe
let's hope none of them have been published otherwise we'll have to keep For
|
Looking through
languagelookup.json
I found some inconsistencies to langauges:"name":"Panjabi, Punjabi", should be "name":"Panjabi; Punjabi"
These can be fixed manually (change , to ; )
Some languages have two different internal representation codes:
Might be a good idea to remove duplicates --- check if exist in CCServer before removing.
The text was updated successfully, but these errors were encountered: