Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

勘校字音源码表 #342

Open
wants to merge 18 commits into
base: master
Choose a base branch
from
Open

勘校字音源码表 #342

wants to merge 18 commits into from

Conversation

kkhkl
Copy link

@kkhkl kkhkl commented Dec 27, 2024

2024-12-28_110948
更改多音字常用读音优先级,补充其他未标注的多音字字音和规则词组,勘校字音源码表和规则表。
按照单字重新聚合排序方便后续修改多音字规则直观可视,以较少用读音特别注音简化规则提高读音准确性范围。
修复拼音编码丢失
特别注释:
【串行】一词在现代汉语词典第7版中,只有hang读音,表行列、并列的意思。在通信电子行业中意义相同,故再次改为正确hang音。并行有多音多义,故不添加规则词组中。
【咯血】一词此处血为口语形容词,取正确读音xie。
【一行行】取形容意都读hang,不取一行行,行行行绕口令非日用习惯读音。

同步ChineseCode更改常用多音字字音优先级,简化多音字规则,增强多音字识别和修改音准。
更改一些多音字常用读音优先级,并对部分多音字的读音进行补充和修正。
@kkhkl kkhkl marked this pull request as ready for review December 27, 2024 10:03
规则词组补充,此次更改【一行行】取形容意都读hang,不取一行行,行行行绕口令非日用习惯读音。
WordPinyin规则中常用读音调整
多音字规则词组中每个多音字按照单个字聚合并列排序,方便后续添加和修改规则直观性。此次提交补充更多规则中词组,采取多音字以较少用读音特别注音,常用读音为默认读音,简化注音规则提高读音准确性。
调整在tumuyan历史提交中‘南无’音译词在地名以南结尾+无线的冲突和增加对古诗中连接注音的问题。
参照字典字义校对所有字频大于1的多音字注音错误和常用读音调整。
@kkhkl kkhkl changed the title 增强多音字识别 勘校字音源码表 Dec 29, 2024
@kkhkl
Copy link
Author

kkhkl commented Dec 30, 2024

编码校对已完成,配置更改待完成。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant