You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since this library works on Jupyter notebook (and behind the scene js being used is an implementation detail), it would be great if we can get text length in Python friendly way; so that we can do the further processing on the same notebook without any issues.
The text was updated successfully, but these errors were encountered:
Thank you for developing/maintaining this tool 🙏
I've encountered seemingly wrong entity span information when I have emojis in a text.
How to reproduce the issue
First in a jupyter notebook,
Annotate "John" as NAME in all 3 texts.
Spans are set like this
Expect slicing texts with position information all give "John", but actually not when emojis present before "John"s.
Expected behaviour
It prints below (all the same)
Possible cause
It seems to be related to the difference in how Python and JavaScript count string length. c.f. JavaScript vs Python emoji length
Since this library works on Jupyter notebook (and behind the scene js being used is an implementation detail), it would be great if we can get text length in Python friendly way; so that we can do the further processing on the same notebook without any issues.
The text was updated successfully, but these errors were encountered: