正则表达式不喜欢国际字符 - 问答 - Python中文网

正则表达式不喜欢国际字符

2024-10-04 09:25:16 发布

您现在位置：Python中文网/ 问答频道 /正文

男 | 程序猿一只，喜欢编程写python代码。

Possible Duplicate:
matching unicode characters in python regular expressions

使用

re.findall(r'\w+', ip)

on Fältskog返回F和{}。我试过使用字符串和unicode，但都是一样的。结果

Tags：字符串 in ip re on unicode expressions matching

2条回答

网友

1楼 · 编辑于 2024-10-04 09:25:16

您需要设置appropriate flags（在本例中，^{}告诉re什么是\w）：

re.findall(r'\w+', ip, re.UNICODE)

# EDIT

Python 2.7.3 (default, Aug  1 2012, 05:16:07) 
[GCC 4.6.3] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import re
>>> re.findall(r"\w+", u"Fältskog", re.UNICODE)
[u'F\xe4ltskog']
>>>

网友

2楼 · 编辑于 2024-10-04 09:25:16

在关于芬德尔（r'[å2019;Ä201; \w]+'，ip）

你也可以这样做，如果你想更直观。在

相关问题更多 >

编程相关推荐

热门问题

热门文章