Skip to content

fix #11: preserve Chinese punctuation (pattern 42)#19

Open
cjxzdzh wants to merge 1 commit into
op7418:mainfrom
cjxzdzh:fix/11-chinese-punctuation
Open

fix #11: preserve Chinese punctuation (pattern 42)#19
cjxzdzh wants to merge 1 commit into
op7418:mainfrom
cjxzdzh:fix/11-chinese-punctuation

Conversation

@cjxzdzh
Copy link
Copy Markdown

@cjxzdzh cjxzdzh commented May 18, 2026

Fix #11: Chinese punctuation being replaced with English equivalents

Problem

After humanizer-zh processes Chinese text, Chinese punctuation marks (。,、;:?!「」『』《》) are replaced with English equivalents (.,,;:?!""'').

Changes

  • Core rule 6: Add explicit instruction to preserve Chinese punctuation
  • Pattern 42: New pattern "Punctuation Replacement — Chinese → English" with full punctuation table
  • Quick checklist: Add Chinese punctuation check item
  • Test example: Add example demonstrating Chinese punctuation preservation
  • Version: Bump to 2.1.0

Chinese punctuation to preserve

Punctuation Description
句号 (period)
逗号 (comma)
顿号 (enumeration comma)
分号 (semicolon)
冒号 (colon)
问号
感叹号
「」 中文引号
『』 中文引号(嵌套)
《》 书名号
() 括号
—— 破折号
…… 省略号

Closes #11

- Add core rule 6: preserve Chinese punctuation (。,、;:?!「」『』《》)
- Add new pattern 42: Punctuation Replacement — Chinese → English
- Add Chinese punctuation check to quick checklist
- Add test example demonstrating Chinese punctuation preservation
- Update version to 2.1.0
- Update pattern count references from 41 to 42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

优化后的中文标点不正确,均为英文标点

1 participant