Skip to content

[website] Alternative pronunciation sources #41

@kylebgorman

Description

@kylebgorman

As part of the multi-source g2p project we hope to have generated pronunciations for many more sources (including PronLex, CELEX, and NETTalk) and formats (e.g., DISC, ARPABet, and NETTalk). These will be read from some remote location by populate.py and will live in the database rather than being generated on the fly. We will also want separate "source" checkboxes for those generated automatically and those actually found in the DB. This might look a bit like the following if you select PronLex and WikiPron US

...

  • PronLex (CC BY 4.0)
    - [x] ARPAbet (generated)
    - [ ] DISC (generated)
    - [ ] IPA (generated)
    - [ ] NETTalk (generated)
  • WikiPron US (Apache 2.0)
    - [ ] ARPAbet (generated)
    - [ ] DISC (generated)
    - [x] IPA
    - [ ] IPA (generated)
    - [ ] NETTalk (generated)
    ...

(NB: PronLex itself is proprietary, but my plan was to distribute generated forms for it under a CC license, obscuring whether or not the form was in the original or not.)

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions