如何用Python编写通用/灵活的正则表达式？

1条回答

网友

1楼 · 发布于 2024-09-29 19:34:08

使用split()：

names = ["John M. Drell", "John Drell"]
for name in names:
    firstname, *middlenames, lastname = name.split()
    print(f'First name: {firstname}, Middle name(s): {" ".join(middlenames)}, Last name: {lastname}')

见Python proof

使用regex，学习使用可选组和\S匹配任何非空白字符：

^(?P<firstname>\S+)(?:\s+(?P<middlename>\S+(?: +\S+)*))?\s+(?P<lastname>\S+)$

见regex proof

解释

                                        
  ^                        the beginning of the string
                                        
  (?P<firstname>           group and capture to "firstname":
                                        
    \S+                      non-whitespace (all but \n, \r, \t, \f,
                             and " ") (1 or more times (matching the
                             most amount possible))
                                        
  )                        end of "firstname"
                                        
  (?:                      group, but do not capture (optional
                           (matching the most amount possible)):
                                        
    \s+                      whitespace (\n, \r, \t, \f, and " ") (1
                             or more times (matching the most amount
                             possible))
                                        
    (?P<middlename>            group and capture to "middlename":
                                        
      \S+                      non-whitespace (all but \n, \r, \t,
                               \f, and " ") (1 or more times
                               (matching the most amount possible))
                                        
      (?:                      group, but do not capture (0 or more
                               times (matching the most amount
                               possible)):
                                        
         +                       ' ' (1 or more times (matching the
                                 most amount possible))
                                        
        \S+                      non-whitespace (all but \n, \r, \t,
                                 \f, and " ") (1 or more times
                                 (matching the most amount possible))
                                        
      )*                       end of grouping
                                        
    )                        end of "middlename"
                                        
  )?                       end of grouping
                                        
  \s+                      whitespace (\n, \r, \t, \f, and " ") (1 or
                           more times (matching the most amount
                           possible))
                                        
  (?P<lastname>             group and capture to "lastname":
                                        
    \S+                      non-whitespace (all but \n, \r, \t, \f,
                             and " ") (1 or more times (matching the
                             most amount possible))
                                        
  )                        end of "lastname"
                                        
  $                        before an optional \n, and the end of the
                           string

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何用Python编写通用/灵活的正则表达式？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >