多列python-pandas的PCA分析

2024-06-01 06:22:38 发布

男 | 程序猿一只，喜欢编程写python代码。

我有一个.vcf文件，在哪里

column1 = chrom
column2 = pos
column3 = ID
column4 = reference
column5 = Alt
column6 = qual
column7 = filter
column8 = info
column9 = format    
column 10 - 99 = 100 columns that have a number of either zero or one

我在档案里读到：

^{pr2}$

还有这个不该用的

for line in vcf:
    fields=line.strip().split()
    A01=fields[9]
    A02=fields[10]
    A03=fields[11]

但是，这将花费太长的时间，因为我想保存所有的0和个，这样以后我就可以通过Python运行PCA分析了。主成分分析。我想使用pandas，但不知道如何才能对这么多列使用。在

Tags：文件 pos id fields line alt vcf reference

1条回答

网友

1楼 · 发布于 2024-06-01 06:22:38

pandas没有实现PCA算法。在

请改用^{}。在

from sklearn.decomposition import PCA
pca = PCA(n_components=5)
pca.fit(df)

然后访问如下组件：

^{pr2}$

多列python-pandas的PCA分析

相关问题更多 >

编程相关推荐

热门问题

热门文章

多列python-pandas的PCA分析

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >