<p>@rusu\u ro1给出的答案是正确的。然而,我认为<a href="https://pandas.pydata.org/" rel="nofollow noreferrer"><em>Pandas</em></a>是适合job的工具。你知道吗</p>
<p>可以使用<a href="https://pandas.pydata.org/pandas-docs/version/0.23/generated/pandas.read_html.html" rel="nofollow noreferrer"><em>pandas.read_html</em></a>获取页面中的所有表。然后使用<em><a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_excel.html" rel="nofollow noreferrer">pandas.DataFrame.to_excel</a></em>只将最后4个表写入excel工作簿。你知道吗</p>
<p>下面的脚本将刮取数据并将每个表写入不同的工作表。你知道吗</p>
<pre><code>import pandas as pd
all_tables = pd.read_html(
"https://www.proff.no/regnskap/yara-international-asa/oslo/hovedkontortjenester/IGB6AV410NZ/"
)
with pd.ExcelWriter('output.xlsx') as writer:
# Last 4 tables has the 'konsernregnskap' data
for idx, df in enumerate(all_tables[4:8]):
# Remove last column (empty)
df = df.drop(df.columns[-1], axis=1)
df.to_excel(writer, "Table {}".format(idx))
</code></pre>
<p><strong>备注:</strong></p>
<ul>
<li>你也可以<em><a href="https://stackoverflow.com/questions/32957441/putting-many-python-pandas-dataframes-to-one-excel-worksheet">write all the DataFrames to a single sheet</a></em>。你知道吗</li>
<li>确保已安装<em>lxml</em>库。<code>pip install lxml</code></li>
</ul>
<blockquote>
<p>flavor : str or None, container of strings</p>
<p>The parsing engine to use. ‘bs4’ and ‘html5lib’ are synonymous with
each other, they are both there for backwards compatibility. The
default of None tries to use lxml to parse and if that fails it falls
back on bs4 + html5lib.</p>
</blockquote>
<p>从<a href="https://pandas.pydata.org/pandas-docs/version/0.23/io.html#io-html-gotchas" rel="nofollow noreferrer"><em>HTML Table Parsing Gotchas</em></a></p>
<blockquote>
<p>html5lib generates valid HTML5 markup from invalid markup
automatically. This is extremely important for parsing HTML tables,
since it guarantees a valid document. However, that does NOT mean that
it is “correct”, since the process of fixing markup does not have a
single definition.</p>
</blockquote>
<p>在您的特定情况下,它会删除第5个表(只返回7)。也许第1和第5个表中的b'coz都有相同的数据。你知道吗</p>