在Group==条件下应用函数

position_latitude position_longitude geohash 0 53.398940 10.069293 u1 1 53.408875 10.052669 u1 2 48.856350 9.171759 u0 3 48.856068 9.170798 u0 4 48.856350 9.171759 u0

groupHashed = geoSub.groupby('geohash') geoSub['distance'] = np.nan for name, group in groupHashed: G = osmnx.graph.graph_from_xml('geohash/'+name+'.osm', simplify=True, retain_all=False) geoSub['distance'] = geoSub.apply(lambda x: getDistanceToEdge(x.position_latitude,x.position_longitude, G) if x.geohash == name, axis=1)

1条回答

网友

1楼 · 发布于 2024-09-28 20:43:31

您可以使用transform

我正在存根G和getDistanceToEdge（如x+y+geohash[-1]），因此显示一个工作示例

import pandas as pd
from io import StringIO 
data = StringIO("""
,position_latitude,position_longitude,geohash
0,53.398940,10.069293,u1
1,53.408875,10.052669,u1
2,48.856350,9.171759,u0
3,48.856068,9.170798,u0
4,48.856350,9.171759,u0
""" )
df = pd.read_csv(data, index_col=0).fillna('')

def getDistanceToEdge(x, y, G):
  return x+y+G

def fun(pos):  
  G = int(pos.values[0][-1][-1])
  return pos.apply(lambda x: getDistanceToEdge(x[0], x[1], G))

df['pos'] = list(zip(df['position_latitude'], df['position_longitude'], df['geohash']))
df['distance'] = df.groupby(['geohash'])['pos'].transform(fun)
df = df.drop(['pos'], axis=1)

print (df)

输出：

   position_latitude  position_longitude geohash   distance
0          53.398940           10.069293      u1  64.468233
1          53.408875           10.052669      u1  64.461544
2          48.856350            9.171759      u0  58.028109
3          48.856068            9.170798      u0  58.026866
4          48.856350            9.171759      u0  58.028109

如您所见，您可以在函数fun中使用pos.values[0][-1]获取组的名称。这是因为我们关心将pos列构造为（lat、log、geohash）的元组，并且groupby之后的组中的每个geohash都是相同的。因此，对于一个组，我们可以通过获取任何行的元组（pos）的最后一个值来获取geohashpos.values[0][-1]给出第一行元组的最后一个值

相关问题更多 >

编程相关推荐

热门问题

热门文章