Unicode support for Lean 4.
Unicode properties that are currently supported by UnicodeBasic include:
AlphabeticBidi_ClassBidi_ControlBidi_MirroredCanonical_Combining_ClassCase_IgnorableCasedDecomposition_MappingDecomposition_TypeGeneral_CategoryHex_DigitMathNameNumeric_TypeNumeric_ValueSimple_Lowercase_MappingSimple_Uppercase_MappingSimple_Titlecase_MappingLowercaseUppercaseWhite_Space
To keep the UnicodeLibrary library lightweight, only commonly used properties can be supported. If you need a property not yet in the list above, please submit a feature request!
Add the following dependency to your project's lakefile.toml:
[[require]]
name = "UnicodeBasic"
git = "https://github.com/fgdorais/lean4-unicode-basic.git"
rev = "main"Or your project's lakefile.lean:
require UnicodeBasic from git
"https://github.com/fgdorais/lean4-unicode-basic" @ "main"Then add import UnicodeBasic at the top of any Lean file where you plan to use this library.
Use the branch "docs" instead of "main" if you need to have local library documentation. (doc-gen4 cannot be used directly because of a circular dependency.)
Documentation for the library is located at dorais.org/lean4-unicode-basic/docs/.
The main entry point is the root file UnicodeBasic. This file contains a description of the main API as well as all primary library functions. The file UnicodeBasic.Types contains all the primary data types used in the library.
The remaining files are implementation details. Some of these may be of interest for developers. For example UnicodeBasic.CharacterDatabase defines a stream type for parsing files from the Unicode Character Database.
- The
Lean 4 / Unicode Basiclibrary is copyright © 2023-2024 François G. Dorais. The library is released under the Apache 2.0 license. See the file LICENSE for additional details. - The
UnicodeData.txtandPropList.txtfiles are copyright © 1991-2024 Unicode, Inc. The files are distributed under the Unicode® Copyright and Terms of Use. See the file LICENSE-UNICODE for additional details.