在Pandas数据帧中查找匹配字符串，从特定索引开始

index col0 col3 500 data " initial string1" .. .. .. 600 data "xyz" ... ... ... 1343 data "intial string1" .. .. .. 1443 data "xyz" ... ... ... 2432 data "intial string2" .. .. .. 2453 data "xyz" .. .. .. 2467 data "intial string2" .. .. .. 2487 data "xyz"

3条回答

网友

1楼 · 编辑于 2024-09-28 23:06:00

为什么不能只搜索xyz字符串？在

df = pd.DataFrame({"col1": ['data', 'data', 'data', 'data', 'data', 'data', 'data'], 
                   'col3': ['initial string', 'something', 'xyz', 
                            'initial string', 'xyz', 'nothing', 'xyz']})

df[df.col3.str.match('xyz')].index

如果有多个不同的字符串，只需使用.isin方法：

^{pr2}$

网友

2楼 · 编辑于 2024-09-28 23:06:00

这样的事情怎么样：

indices_initial = [500, 1343, 2432, 5433, 7533]
indices_xyz = []


for i, j in zip(indices[:], indices[1:]):
    indices_xyz.append(df.loc[i:j, 'col3'].eq('xyz').idxmax())

df.loc[indices_xyz]

[出去]

^{pr2}$

网友

3楼 · 编辑于 2024-09-28 23:06:00

# Setting up input data
df = pd.DataFrame(np.random.rand(12500,2), columns=['col0','col1'])
for i in [0, 500, 1343, 2432, 5433, 7533]:
    df.loc[i,'col1']='init string'
for i in range(1,12000,100):
    df.loc[i,'col1']='xyz'

# Hopefully solution to your question
search_results=pd.DataFrame()
for init_index, next_init_index in zip(df[df.col1=='init string'].index, df[df.col1=='init string'][1::].index):
    search_results = search_results.append(df.query('index>'+str(init_index)+
                                                    ' & index<'+str(next_init_index)+
                                                    ' & col1=="xyz"').head(1))
search_results

相关问题更多 >

编程相关推荐

热门问题

热门文章

在Pandas数据帧中查找匹配字符串，从特定索引开始

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >