从字符串解析格式怪异的时间表达式

estimated = track_data['project']['estimate']['estimate'].split('PT')[1] estimated_hours = estimated.split('H')[0] estimated_minutes = estimated_hours.split('M')[0] estimated_seconds = estimated_minutes.split('S')[0]

3条回答

网友

1楼 · 编辑于 2024-09-28 05:27:17

可以使用正则表达式：

import re

PROJECT_TIME_REGEX = re.compile(r'PT(?:(\d+)H)?(?:(\d+)M)?(?:(\d+)S)?')

def get_project_time(s):
    m = PROJECT_TIME_REGEX.match(s)
    if not m:
        raise ValueError('invalid string')
    hour, min, sec = (int(g) if g is not None else 0 for g in m.groups())
    return hour, min, sec

print(get_project_time('PT5H12M3S'))
# (5, 12, 3)
print(get_project_time('PT12M3S'))
# (0, 12, 3)
print(get_project_time('PT0S'))
# (0, 0, 0)
print(get_project_time('PT5H'))
# (5, 0, 0)

网友

2楼 · 编辑于 2024-09-28 05:27:17

怎么样？你知道吗

import re

def parsept(ptstring):
    regex = re.compile(
            r'PT'
            r'(?:(?P<h>\d+)H)?'
            r'(?:(?P<m>\d+)M)?'
            r'(?:(?P<s>\d+)S)?')
    m = regex.match(ptstring)
    if m:
        return (int(m.group('h')) if m.group('h') else 0, 
            int(m.group('m') if m.group('m') else 0,
            int(m.group('s') if m.group('s') else 0)
    # else
    raise ValueError('{0} does not look like a valid PTxHyMzS string'.format(ptstring))

网友

3楼 · 编辑于 2024-09-28 05:27:17

您可以使用正则表达式和正则表达式中的组来捕获小时、分钟和秒—所有这些都是可选的。你知道吗

大致如下： /PT(\d*)H?(\d*)M?(\d*)S?/

括号表示组。因此，您的捕获组将包含小时、分钟和秒（所有这些都是可选的）。你知道吗

但是正则表达式不是那么可读。我强烈建议尝试像Parsec这样的解析器组合库。解析器组合器更具可读性和可维护性，编写起来也很有趣。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章