求d子集上下几行的有效方法

buyindex = list(data2[data2['buy'] == True].index) print buyindex [71, 102, 103, 179, 505, 506, 607] buyindex1 = map(lambda x: x + 1, buyindex) buyindex2 = map(lambda x: x - 1, buyindex) buyindex3 = map(lambda x: x - 2, buyindex) buyindex4 = map(lambda x: x + 2, buyindex) buyindex.extend(buyindex1) buyindex.extend(buyindex2) buyindex.extend(buyindex3) buyindex.extend(buyindex4) buyindex.sort() data2.iloc[buyindex]

def get_test_index(df, column, numbers): """ builds an test index based on a range of numbers above and below the a specific index you want. df = dataframe to build off of column = the column that is important to you. for instance, 'buy', or 'sell' numbers = how many above and below you want of the important index """ idx_l = list(df[df[column] == True].index) for i in range(numbers)[1:]: idxpos = data2[column].shift(i).fillna(False) idxpos = list(df[idxpos].index) idx_l.extend(idxpos) idxneg = data2[column].shift(-i).fillna(False) idxneg = list(df[idxneg].index) idx_l.extend(idxneg) #print idx_l return sorted(idx_l)

2条回答

网友

1楼 · 编辑于 2024-09-29 19:25:59

这将是一个非常有效的方法

In [39]: df = DataFrame(np.random.randn(10,2))

In [41]: start=3

In [42]: stop=4

In [43]: df.iloc[(max(df.index.get_loc(start)-2,0)):min(df.index.get_loc(stop)+2,len(df))]
Out[43]: 
          0         1
1  0.348326  1.413770
2  1.898784  0.053780
3  0.825941 -1.986920
4  0.075956 -0.324657
5 -2.736800 -0.075813

[5 rows x 2 columns]

如果你想要一个任意索引器的函数，只需创建一个列表您想要的并传递给.iloc

^{pr2}$

你可能想要独一无二的

f = lambda i: [ i-2, i-1, i, i+1, i+2 ]

In [21]: indexers = Index(list(chain(*[ f(i) for i in [71, 102, 103, 179, 505, 506, 607] ]))).unique()

In [22]: df.iloc[indexers]
Out[22]: 
            0         1
69   0.792996  0.264597
70   1.084315 -0.620006
71  -0.030432  1.219576
72  -0.767855  0.765041
73  -0.637771 -0.103378
100 -1.087505  1.698133
101  1.007143  2.594046
102 -0.307440  0.308360
103  0.944429 -0.411742
104  1.332445 -0.149350
105  0.165213  1.125668
177  0.409580 -0.375709
178 -1.757021 -0.266762
179  0.736809 -1.286848
180  1.856241  0.176931
181 -0.492590  0.083519
503 -0.651788  0.717922
504 -1.612517 -1.729867
505 -1.786807 -0.066421
506  1.423571  0.768161
507  0.186871  1.162447
508  1.233441 -0.028261
605 -0.060117 -1.459827
606 -0.541765 -0.350981
607 -1.166172 -0.026404
608 -0.045338  1.641864
609 -0.337748  0.955940

[27 rows x 2 columns]

网友

2楼 · 编辑于 2024-09-29 19:25:59

您可以使用shift和|运算符；例如，对于+/-2天，您可以这样做

idx = (data2['buy'] == True).fillna(False)
idx |= idx.shift(-1) | idx.shift(-2)   # one & two days after
idx |= idx.shift(1) | idx.shift(2)     # one & two days before
data2[ idx ] # this is what you need

相关问题更多 >

编程相关推荐

热门问题

热门文章