根据每个时间序列的条件确定数据帧中的时间点

2条回答

网友

1楼 · 编辑于 2024-10-01 07:12:23

您可以首先创建一个掩码ma，并将最小值之前的所有行值设置为False。接下来，使用此掩码查找最小值后每行中的值，以达到最小值的4倍（由True表示）：

>>> ma = df.values.argmin(axis=1)[:,None] <= np.arange(df.shape[1])
>>> df.ge(4*df.min(axis=1), axis=0) & ma
         TP1    TP2    TP3   TP4   TP5   TP6
gene1  False  False  False  True  True  True
gene2  False  False   True  True  True  True

然后可以使用idxmax从这个布尔数据帧（我称之为df1）检索第一个True值的标签：

>>> df1.idxmax(axis=1)
gene1    TP4
gene2    TP3
dtype: object

网友
2楼 · 编辑于 2024-10-01 07:12:23

这里有一个方法：
df =pd.DataFrame({'TP1':[.4,.3],'TP2':[.2,.05],'TP3':[.1,.5],'TP4':[.5,.8],'TP5':[.8,1.0], 'TP6':[1.9,1.7]},index= ['gene1','gene2']) def f(x): #get min value and index min_ind = [ e for e in enumerate(x) if e[1] == x.min()] #return only the first value that is greater than the index of the min value and > min value *4 r =df.columns[[e[0] for e in enumerate(x) if e[1] if e[1] > min_ind[0][1]*4 and e[0]> min_ind[0][0]][0]] return r
退货：
df.apply(f, axis=1) gene1 TP4 gene2 TP3 dtype: object

相关问题更多 >

编程相关推荐

热门问题

热门文章