<p>首先,你必须<a href="https://stackoverflow.com/questions/606191/convert-bytes-to-a-python-string">convert your ^{<cd1>}s to ^{<cd2>} objects</a>。你知道吗</p>
<p>通过您提供的示例:</p>
<pre><code>examples = [
b'2097 - MILANO CENTRALE|2097-S01700\n',
b'123 - ROMA TERMINI|123-S01358\n',
b'123 - TREVIGLIO|123-S01703\n'
]
</code></pre>
<p>假设格式为:</p>
<pre><code>[TRAIN_NAME]|[TRAIN_NAME_REPEATED]-[TRAIN_NUMBER]\n
</code></pre>
<p>我们不需要任何正则表达式,只需按分隔符拆分条目:</p>
<pre><code>for example_bytes in examples:
example = example_bytes.decode("utf-8").split("|")
# example = ['2097 - MILANO CENTRALE', '2097-S01700\n']
train_name = example[0]
# train_name = '2097 - MILANO CENTRALE'
train_number = example[1].split("-")[1]
# train_number = 'S01358'
A.append(train_name)
B.append(train_number.rstrip())
</code></pre>
<p>然后看看结果:</p>
<pre><code>print(A)
# ['2097 - MILANO CENTRALE', '123 - ROMA TERMINI', '123 - TREVIGLIO']
print(B)
# ['S01700', 'S01358', 'S01703']
</code></pre>
<p>如果您不想重复输入(如果可能的话),我建议您使用<a href="https://docs.python.org/2/library/sets.html" rel="nofollow noreferrer">sets</a>而不是列表。你知道吗</p>
<p>查看API文档,您将取决于它提供的条目的格式。你知道吗</p>