擅长:python、mysql、java
<p>此脚本将创建包含<code>time</code>和<code>date</code>列的数据帧:</p>
<pre><code>import pandas as pd
from bs4 import BeautifulSoup
html_string = '''
<time class="published-date relative-date" data-published-date="2020-07-21T18:49:14Z" datetime="2020-07-21T18:49:14Z"></time>
'''
soup = BeautifulSoup(html_string, 'html.parser')
all_data = []
for t in soup.select('time.published-date.relative-date'):
all_data.append(t.get('data-published-date'))
df = pd.DataFrame(all_data)
df[0] = pd.to_datetime(df[0])
df['date'] = df[0].dt.date
df['time'] = df[0].dt.time
print(df)
</code></pre>
<p>印刷品:</p>
<pre><code> 0 date time
0 2020-07-21 18:49:14+00:00 2020-07-21 18:49:14
</code></pre>