<p>您可以尝试:</p>
<pre class="lang-py prettyprint-override"><code>
data = """2.000 Company A 8.876 0,02
248 Enterprise B 26.028 0,07
193
dasdasdasd (asasas) sdasdasd
adsadsd asdasd asasa asassaas asas
asas asas 31. January 2018 (continue)
asdasd – 99,00% (31. March 2017 – 99,98%) (continue)
amasdasd asas
asasas asas
asas asssssssssss
DDD
asdasdads in %
asdasd adasd asddasad
(continue)
415 Company C Ltd. 21.412 0,06
668 Enterprise D AG 17.332 0,05
1.240 Company E GmbH 31.394 0,09
798 Enterprise OHG 52.586 0,14"""
reader = StringIO(data)
pattern = re.compile(r'([\d\.\,]+)\s+(\D*)([\d\.\,]+)\s([\d\.\,]+)$')
rows = []
for row in reader:
match = pattern.search(row)
if match:
rows.append([match.group(1), match.group(2), match.group(3), match.group(4)])
df = pd.DataFrame(rows, columns=["Column 1", "Column 2", "Column 3", "Column 4"])
</code></pre>
<hr/>
<p><strong>输出</strong></p>
<pre class="lang-py prettyprint-override"><code>Column 1 Column 2 Column 3 Column 4
0 2.000 Company A 8.876 0,02
1 248 Enterprise B 26.028 0,07
2 415 Company C Ltd. 21.412 0,06
3 668 Enterprise D AG 17.332 0,05
4 1.240 Company E GmbH 31.394 0,09
5 798 Enterprise OHG 52.586 0,14
</code></pre>