Python正则表达式modu中的简单大小写折叠与完整大小写折叠

Python 3.6.7 (default, Oct 22 2018, 11:32:17) [GCC 8.2.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import regex >>> r = regex.compile("(?V0i)и") >>> r regex.Regex('(?V0i)и', flags=regex.I | regex.V0) >>> r.search("И") <regex.Match object; span=(0, 1), match='И'> >>> regex.search("(?V0i)é", "É") <regex.Match object; span=(0, 1), match='É'> >>> regex.search("(?V0i)é", "E") >>> regex.search("(?V1i)é", "E")

1条回答

网友

1楼 · 发布于 2024-09-29 23:25:23

它跟在Unicode case folding table后面。节选：

# The entries in this file are in the following machine-readable format:
#
# <code>; <status>; <mapping>; # <name>
#
# The status field is:
# C: common case folding, common mappings shared by both simple and full mappings.
# F: full case folding, mappings that cause strings to grow in length. Multiple characters are separated by spaces.
# S: simple case folding, mappings to single characters where different from F.

[...]

# Usage:
#  A. To do a simple case folding, use the mappings with status C + S.
#  B. To do a full case folding, use the mappings with status C + F.

只有少数特殊字符的折叠方式不同，例如小型大写拉丁字母s：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章