从csv查找python列表中的索引和组合标签

2024-06-26 03:00:05 发布

您现在位置:Python中文网/ 问答频道 /正文

我的密码是-

df=pd.read_csv("file")
l1=[]
l2=[]
for i, row in df.iterrows():
    
    l1.append(' '.join((row['house'], row['district'])))
    l2.append(({"entities": [[(ele.start(), ele.end() - 1) for ele in re.finditer(r'\S+', df['house'][i])] ,df['label1'][i]],}))
    
TRAIN_DATA=list(zip(l1,l2))
print(TRAIN_DATA)

CSV文件-

house       label1          district    label2 
House 263   holding_number  dhaka       district 
House 31    holding_number  comilla     district 

我得到的输出是[('House 263 dhaka', {'entities': [[(0, 4), (6, 8)], 'holding_number']}),

我的预期产出-

[('House 263 dhaka', {'entities': [[(0, 8)], 'holding_number'], [[10,14], 'district']})

基本上,我如何在这里获得第3列的索引并将其附加到label2中?此外,我还想获得第1列(House 263)中整个字符串的索引


Tags: inl1numberdfforhouserowentities