Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conjuncts are not selected as a single unit when styling initials #111

Open
r12a opened this issue Feb 5, 2020 · 1 comment
Open

Conjuncts are not selected as a single unit when styling initials #111

r12a opened this issue Feb 5, 2020 · 1 comment
Labels
doc:gujr gap i:initials Styling initials l:gu Gujurati language & script p:basic s:gujr Gurajati script

Comments

@r12a
Copy link
Contributor

r12a commented Feb 5, 2020

This issue is applicable to most languages that form conjuncts from consonant clusters using an invisible virama.

Because of the problems associated with grapheme cluster boundaries (see above), first-letter selection in CSS doesn't work well for conjuncts. For example, chrome fails to style the whole conjunct in પ્રૌદ્યોગીકી when using ::first-letter in a selector, and styles only the પ્ instead of પ્રૌ. Similar is the case of Internet Explorer, it only styles પ. This is problematic for many words in a script such as Gujarati, and forces the content author to use explicit spans rather than the proper mechanism for selecting initial letter.

Indian Layout Requirements provides a grammar for indian orthographic syllable boundaries which works for Gujarati, and CSS uses the concept of 'typographic character unit', rather than grapheme cluster, in its specs with the explanation that these cases are beyond the scope of the grapheme cluster concept and that implementations should provide appropriate support. In addition, a modification to the concept of grapheme cluster is currently in development at the Unicode Consortium, which is likely to resolve the problem for a script like Gujarati.

In addition, the alignment of styled initial-letter character glyphs with the rest of the text is not clearly specified or implemented.

For more details, see this GitHub issue, which is being used to track this gap.

@r12a
Copy link
Contributor Author

r12a commented Feb 5, 2020

The first comment in this issue contains text that will automatically appear in one or more gap-analysis documents as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

Relevant gap analysis documents include:
BengaliDevanagariGujarati

@r12a r12a changed the title Incorrect segmentation for styling initials Consonant clusters with a visible virama should only have the first consonant+virama highlighted Aug 12, 2021
@r12a r12a changed the title Consonant clusters with a visible virama should only have the first consonant+virama highlighted Conjuncts are not selected as a single unit when styling initials Aug 12, 2021
@r12a r12a added l:gu Gujurati language & script l:hi Hindi, Devanagari script l:bn Bengali language & script labels May 1, 2024
@r12a r12a added s:gujr Gurajati script s:beng Bengali script s:deva Devanagari script and removed l:hi Hindi, Devanagari script l:bn Bengali language & script s:beng Bengali script s:deva Devanagari script labels Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc:gujr gap i:initials Styling initials l:gu Gujurati language & script p:basic s:gujr Gurajati script
Projects
None yet
Development

No branches or pull requests

1 participant