我的密码是-
df=pd.read_csv("file")
l1=[]
l2=[]
for i, row in df.iterrows():
l1.append(' '.join((row['house'], row['district'])))
l2.append(({"entities": [[(ele.start(), ele.end() - 1) for ele in re.finditer(r'\S+', df['house'][i])] ,df['label1'][i]],}))
TRAIN_DATA=list(zip(l1,l2))
print(TRAIN_DATA)
CSV文件-
house label1 district label2
House 263 holding_number dhaka district
House 31 holding_number comilla district
我得到的输出是[('House 263 dhaka', {'entities': [[(0, 4), (6, 8)], 'holding_number']}),
我的预期产出-
[('House 263 dhaka', {'entities': [[(0, 8)], 'holding_number'], [[10,14], 'district']})
基本上,我如何在这里获得第3列的索引并将其附加到label2中?此外,我还想获得第1列(House 263)中整个字符串的索引
目前没有回答
相关问题 更多 >
编程相关推荐