擅长:python、mysql、java
<p>在最上面一定要有</p>
<pre><code>import numpy as np
</code></pre>
<p>然后替换共享的代码</p>
<pre><code>reads_array = []
for x in Bio.SeqIO.parse("inp.fastq", "fastq"):
reads_array.append(x)
df = pd.DataFrame(np.zeros((len(reads_array), 2)), columns=["read_id", "score"])
for index, x in enumerate(reads_array):
alignments = pairwise2.align.globalms("ACTTGAT", str(x.seq), 2, -1, -.5, -.1)
sorted_alignments = sorted(alignments, key=operator.itemgetter(2), reverse=True)
read_id = x.name
score = sorted_alignments[0][2]
df.loc[index, 'read_id'] = read_id
df.loc[index, 'score'] = score
</code></pre>
<p>原始代码的主要问题有两个:</p>
<p>1)您的数据帧有0行</p>
<p>2)df['column\u name']指的是整个列,而不是单个单元格,因此当执行df['column\u name']=value时,该列中的所有单元格都被设置为该值</p>