用Python获取部分文件名

网友

1楼 · 编辑于 2024-09-25 18:15:15

如果数字是可变长度的，则需要regex模块“re”

import re

# create and compile a regex pattern
pattern = re.compile(r"_([0-9]+)\.[^\.]+$")

pattern.search("abc_ID_8423.pdf").group(1)
Out[23]: '8423'

Regex通常用于匹配变量字符串。我刚写的regex说：

查找下划线（“\u”），后跟可变位数（“[0-9]+”），后跟字符串中的最后一个句点（“\.[^.]+$”）

网友

2楼 · 编辑于 2024-09-25 18:15:15

这里还有另一种选择，使用re.split()，这可能更接近于您正试图做的事情的精神（尽管使用re.match()和re.search()等解决方案同样有效、有用和有指导意义）：

>>> import re
>>> re.split("[_.]", "dddddd_ID_4421.pdf")[-2]
'4421'
>>>

网友

3楼 · 编辑于 2024-09-25 18:15:15

下面是一个使用re模块的简单解决方案，如其他答案中所述。

# Libraries
import re

# Example filenames. Use glob as described below to grab your pdf filenames
file_list = ['name_ID_123.pdf','name2_ID_456.pdf'] # glob.glob("*.pdf") 

for fname in file_list:
    res = re.findall("ID_(\d+).pdf", fname)
    if not res: continue
    print res[0] # You can append the result to a list

下面应该是你的输出。你应该能够适应其他模式。

# Output
123
456

祝你好运！

相关问题更多 >

编程相关推荐

热门问题

热门文章

用Python获取部分文件名

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >