Define normalization of full-width numeric characters in <input type=number> user input #11616

kyouhei-horizumi · 2025-09-02T18:42:30Z

Summary

This patch adds guidance that user agents should accept and normalize full-width characters commonly produced by CJK input methods in fields. This includes:

Full-width digits (U+FF10–U+FF19)
Full-width hyphen-minus (U+FF0D)
Prolonged sound mark (U+30FC)
Unicode minus sign (U+2212)
Full-width full stop (U+FF0E)

These characters must be normalized to their ASCII equivalents before applying floating-point parsing rules.

This normalization applies only to user input (keyboard or IME), not to script-assigned values via the .value IDL attribute, which must remain ASCII-only.

Motivation

In Japanese environments, users frequently stay in full-width input modes while tabbing through forms. Requiring an IME/keyboard mode switch just to enter numbers is poor UX.
Popular Japanese IMEs can emit U+30FC when a leading minus is added after digits in full-width mode; accepting it at edit time avoids failed entries and matches user expectations.
The change keeps the wire format locale-neutral and aligns with I18N guidance to localize display via Intl.NumberFormat.

Non-goals

No interpretation on scripted assignment (input.value = ...).
No locale-dependent grouping/formatting beyond the enumerated characters.
This PR intentionally limits acceptance to the listed characters; broader sets can be discussed in the linked issue.

Related issue

Fixes/Refs: Define optional normalization of full-width characters in <input type="number"> #11395

Checklist

At least two implementers are interested (and none opposed):
- …
- …
Tests are written and can be reviewed and commented upon at:
- …
Implementation bugs are filed:
- Chromium: …
- Gecko: …
- WebKit: …
- Deno (only for timers, structured clone, base64 utils, channel messaging, module resolution, web workers, and web storage): …
- Node.js (only for timers, structured clone, base64 utils, channel messaging, and module resolution): …
Corresponding HTML AAM & ARIA in HTML issues & PRs:
MDN issue is filed: …
The top of this comment includes a clear commit message to use.

(See WHATWG Working Mode: Changes for more details.)

…number> user input This patch adds guidance that user agents should accept and normalize full-width characters commonly produced by CJK input methods in <input type="number"> fields. This includes: - Full-width digits (U+FF10–U+FF19) - Full-width hyphen-minus (U+FF0D) - Prolonged sound mark (U+30FC) - Unicode minus sign (U+2212) - Full-width full stop (U+FF0E) These characters must be normalized to their ASCII equivalents before applying floating-point parsing rules. This normalization applies only to user input (keyboard or IME), not to script-assigned values via the `.value` IDL attribute, which must remain ASCII-only. See issue: whatwg#11395

annevk · 2025-09-15T14:35:57Z

source

+    <li>the minus sign (U+2212), and</li>
+    <li>full-width full stop (U+FF0E).</li>
+  </ul>
+  These characters must be interpreted according to their Unicode numeric meaning and normalized to their ASCII equivalents


This is a good start, but I think we want to spell out what they end up mapping to.

We can partially explain it with NFKC I think, but some substitutions might be needed as well, for instance for U+30FC.

@annevk Thanks, I agree — spelling out the exact mappings makes sense, and NFKC alone won’t cover everything.

I’ll be giving a talk at a conference in Japan next Sunday, where I expect to gather input from Japanese developers who actively face these issues.
I’d like to incorporate their feedback before updating the PR, so I plan to follow up in about 2–3 weeks.

Partial NFKC explanation probably makes the situation less clear than giving an explicit mapping for this handful of characters.

(I think it generally makes sense to do a mapping like this.)

@whatwg/i18n is there anything in Unicode we could borrow for this that's better suited than normalization? It's not directly web-exposed, but it still seems unfortunate to have mappings of characters maintained outside of Unicode.

The numeric property does a better job for digits than normalization. The other symbols that can appear in number formats (decimal separators, grouping separators, plus/minus signs, etc) can be found in CLDR data (noting that the meaning of the symbols depends on the locale). To @hsivonen's point, there are just a handful of wide/narrow equivalents. CLDR doesn't list these and it would be better to make that list than to introduce NFKC (even thought NFKC for the characters in question is identical). I'd probably push on the CLDR folks to address this, so that it percolates downstream into ICU, Intl/JS, etc. (and not just HTML), but I don't believe that there is a ready-made mapping.

PS> I added this to I18N's agenda for 2025-09-25

kyouhei-horizumi mentioned this pull request Sep 2, 2025

Define optional normalization of full-width characters in <input type="number"> #11395

Open

annevk reviewed Sep 15, 2025

View reviewed changes

annevk added normative change topic: forms i18n-tracker Group bringing to attention of Internationalization, or tracked by i18n but not needing response. i18n-jlreq Notifies Japanese script experts of relevant issues labels Sep 15, 2025

w3cbot mentioned this pull request Sep 16, 2025

Define normalization of full-width numeric characters in <input type=number> user input w3c/i18n-activity#2032

Open

xfq added the i18n-clreq Notifies Chinese script experts of relevant issues label Sep 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Define normalization of full-width numeric characters in <input type=number> user input #11616

Define normalization of full-width numeric characters in <input type=number> user input #11616

Uh oh!

kyouhei-horizumi commented Sep 2, 2025

Uh oh!

annevk Sep 15, 2025

Uh oh!

kyouhei-horizumi Sep 15, 2025 •

edited

Loading

Uh oh!

hsivonen Sep 22, 2025

Uh oh!

annevk Sep 22, 2025

Uh oh!

aphillips Sep 22, 2025

Uh oh!

aphillips Sep 22, 2025

Uh oh!

Uh oh!

Define normalization of full-width numeric characters in <input type=number> user input #11616

Are you sure you want to change the base?

Define normalization of full-width numeric characters in <input type=number> user input #11616

Uh oh!

Conversation

kyouhei-horizumi commented Sep 2, 2025

Summary

Motivation

Non-goals

Related issue

Checklist

Uh oh!

annevk Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

kyouhei-horizumi Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hsivonen Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

annevk Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

aphillips Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

aphillips Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kyouhei-horizumi Sep 15, 2025 •

edited

Loading