我有这样一个数据集:
df = pd.DataFrame({'scientist':["Wendelaar Bonga"," Sjoerd E.", "Grätzel"," Michael", "Willett", "Walter C.",
"Kessler", "Ronald C.", "Witten, Edward", "Wang, Zhong Lin"],
'SubjectField': ["Biomedical Engineering", "Inorganic & Nuclear Chemistry",
"Organic Chemistry", "Biomedical Engineering", "Developmental Biology",
"Mechanical Engineering & Transports", "Biomedical Engineering", "Microbiology",
"Cardiovascular System & Hematology", "Biomedical Engineering"]})
我想计算每个学科领域的科学家数量,并提取出拥有两名以上科学家的学科领域。这是我计算科学家人数的代码
number_of_scientists_in_fields=data.groupby(['SubjectField'])['scientist'].count()
如何提取包含两名以上科学家的主题字段
使用value_counts,如下所示:
输出
您只需要创建一个
Series
,然后使用> 2
对其进行过滤:使用^{} :
另一种可能不如Dani的好的方法是:
但是,此示例将包括2个或更多(不是>;2)
相关问题 更多 >
编程相关推荐