擅长:python、mysql、java
<ul>
<li>遍历这些文件</li>
<li>清理数据帧
<ul>
<li>没有更改任何数据清理步骤,因此答案假定<code>data</code>是一个完整的数据帧</李>
</ul>
</li>
<li>将<code>data</code>附加到数据帧列表中</li>
<li>最后,使用<a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.concat.html" rel="nofollow noreferrer">^{<cd3>}</a>将数据帧列表合并到单个数据帧中</li>
<li>使用<a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_csv.html" rel="nofollow noreferrer">^{<cd4>}</a>将数据帧写入文件</li>
</ul>
<pre class="lang-py prettyprint-override"><code>import glob
import pandas as pd
path = 'path/.../*.txt'
files = glob.glob(path)
df_list = list()
for fname in files:
data = pd.read_csv(fname, header=None,sep=' ')
data = data.drop([1,2,3,4], axis=1)
data = data.drop_duplicates()
#print(data)
data=data.reset_index(drop=True)
#print(data)
data = data.T
#print(data)
data.insert(0, 'name', shortname, True)
#print(data)
data.columns = data.iloc[0]
print(data)
df_list.append(data)
df = pd.concat(df_list)
df.to_csv('C:/Users/yl0340/Desktop/clean/clean_files.txt', index=False)
</code></pre>