<p>请试试这个:</p>
<pre><code>def main():
line = r'<para>06.00 na hrvatskom Zona gradnje: Izgradnja zelenog Pekinga</para>'
if re.search(r'^(<para>)\d{2}.\d{2}', line):
line_time = re.findall(r'\d{2}.\d{2}',line)[0]
line_title = line[line.find(re.findall(r'[A-Z]',line)[0]):line.find(':')]
line_sub = line[line.find(':')+1:line.find(r'</')].strip()
print '<start>'+line_time+'</start><title>'+line_title+'</title><sub>'+line_sub+'</sub>'
</code></pre>
<p>如果这是你需要的,请告诉我。你知道吗</p>
<p><strong>输出</strong></p>
<pre><code><start>06.00</start><title>Zona gradnje</title><sub>Izgradnja zelenog Pekinga</sub>
</code></pre>
<p>用于从文件中读取行:</p>
<pre><code> with open(r'D:\Trading\PythonScholar\input\input.tx','r') as file:
for line in file:
if re.search(r'^(<para>)\d{2}.\d{2}', line.strip()):
line_time = re.findall(r'\d{2}.\d{2}',line)[0]
line_title = line[line.find(re.findall(r'[A-Z]',line)[0]):line.find(':')]
line_sub = line[line.find(':')+1:line.find(r'</')].strip()
print '<start>'+line_time+'</start><title>'+line_title+'</title><sub>'+line_sub+'</sub>'
</code></pre>
<p>希望这有帮助。你知道吗</p>