<p>可以使用Python和Pandas执行此操作:</p>
<pre><code>In [1]: df = pd.read_table("file", header=None, sep="\t", names=["A", "B","C","D"], index_col=0)
In [2]: df
Out[2]:
A B C \
0 19801464 Animals Biodiversity Computational Biology/methods
1 19696045 Environmental Microbiology Computational Biology/methods
D
0 DNA
1 Software
In [3]: b = pd.get_dummies(df.B)
In [4]: c = pd.get_dummies(df.C)
In [5]: d = pd.get_dummies(df.D)
In [6]: presence_absence = b.merge(c, right_index=True, left_index=True).merge(d,right_index=True, left_index=True)
In [7]: presence_absence
Out[7]:
Animals Biodiversity Environmental Microbiology \
A
19801464 1 0
19696045 0 1
Computational Biology/methods DNA Software
A
19801464 1 1 0
19696045 1 0 1
</code></pre>
<p>希望这有帮助</p>