擅长:python、mysql、java
<p>我假定您的文本文件名为<code>myfile.txt</code></p>
<pre class="lang-py prettyprint-override"><code>import pandas as pd
my_file = open("myfile.txt")
text = [k.split()[0:4]+[k.split()[-1]] for k in my_file.readlines()]
my_file.close()
df= pd.DataFrame(text)
# next line is optional, just if you want named columns
df.columns = ['nameOfGene','sampleNumber','Allel1','Allel2','descr']
print(df)
nameOfGene sampleNumber Allel1 Allel2 descr
0 geneName1, sampleName1, Allel, allel2, tumor
1 geneName2, sampleName1, Allel, allel2, ntn
2 geneName3, sampleName1, Allel, allel2, ddd
3 geneName4, sampleName1, Allel, allel2, aga2
4 geneName1, sampleName2, Allel, allel2, tumorD
5 geneName2, sampleName2, Allel, allel2, ad2
6 geneName1, sampleName3, Allel, allel2, yyu
7 geneName2, sampleName3, Allel, allel2, hhf
8 geneName4, sampleName3, Allel, allel2, vgv
9 geneName5, sampleName3, Allel, allel2, aga5
</code></pre>