根据细胞类型过滤pandas数据帧

access geometry highway 0 NaN LINESTRING (-10817.60510122531 6680340.0880667... footway 1 no LINESTRING (-11843.46986863073 6678698.1663396... footway 2 no LINESTRING (-11843.46986863073 6678698.1663396... [footway, steps] 3 no LINESTRING (-11843.46986863073 6678698.1663396... footway 4 NaN LINESTRING (-9727.497855683101 6679963.0804682... unclassified

2条回答

网友

1楼 · 编辑于 2024-09-28 05:19:53

您可以使用astype转换为str，然后使用duplicated来自@chrisz的数据

df[~df.type.astype(str).duplicated(keep='first')]
Out[75]: 
              type
0  [highway, road]
1          highway
2    [road, other]

网友

2楼 · 编辑于 2024-09-28 05:19:53

您可以使用apply()只将列表转换为元组，而其余部分保持不变，然后调用unique()：

In [15]: df = pd.DataFrame({'highway': ['footway', 'footway', ['footway', 'steps'], 'footway', 'unclassified']})

In [16]: df['highway'].apply(lambda x: tuple(x) if isinstance(x, list) else x).unique()
Out[16]: array(['footway', ('footway', 'steps'), 'unclassified'], dtype=object)

如果将tuple()应用于整个列，则它将字符串转换为每个字符的元组。在

相关问题更多 >

编程相关推荐

热门问题

热门文章