擅长:python、mysql、java
<p>看起来像是<a href="https://docs.python.org/3.8/library/re.html?#module-re" rel="nofollow noreferrer">regular expressions</a>的工作!您可以使用它来匹配字符串中的模式。在本例中,所有数据都发生在<code></span></code>标记之后,后跟换行和缩进。因此,如果我们这样匹配该模式:</p>
<pre class="lang-py prettyprint-override"><code>import re
your_data=[] # Initialize the list so we can access it outside scope of with
with open('your_file.html','r') as f:
your_code = f.read()
your_data = re.findall('</span>\n +(.+)',your_code)
print(your_data)
</code></pre>
<p>我们可以得到输出<code>['0004', 'March 2020', '$300,950', '2161 sq.ft.', '2', '3', '2.5', '2']</code></p>