编辑: @荣@shaik moeed 下面是生成部分数据帧的代码和我面临的问题:
temp = [[1, 'blblblblblb. The quaity of research was good. blblblblb'],
[2, 'blblblblblb. The quaity of research was average. blblblblb'],
[3, 'blblblblblb. The quaity of research was poor. blblblblb'],
[4, 'blblblblblb. The quaity of research was good. blblblblb']
]
Data = pd.DataFrame(temp,columns=['ID','Report'])
Data['Sentence']=Data['Report'].str.extract(r"([^.]*?The quaity of research was [^.]*\.)")
Quality_dic=dict([(1, 'excellent'), (2, 'good'), (3, 'average') , (4, 'poor'), (5, 'unassessable')])
Data['Quality']=[k for k,v in Quality_dic.items() if v in Data['Sentence'].str.split()]
不幸的是,建议的解决方案仍然不起作用。你知道吗
怎么解决这个问题有什么想法吗? 感谢大家的时间和投入
我已经创建了一个df作为您的数据,并完全按照您的要求实现。你知道吗
在
Quality_dic
中,Good
和Unassessable
具有相同的键。所以Good
将用Unassessable
重写。你知道吗现在试试
输出:
注意:
相关问题 更多 >
编程相关推荐