需要匹配2列2个不同的Pandas数据帧如果它匹配我们需要附加新的d

x y z keywords stockcode a b c [apple,iphone,watch,newdevice] aapl e w q NaN null w r t [pixel,google,] ggle s t q [india,computer] null d j o [google,apple] aapl,ggle

df1['stockcode'] = np.nan #mapping data for indexKW,valueKW in df1.keyword.iteritems(): for innerVal in valueKW.split(): for indexName, valueName in df2['Name'].iteritems(): for outerVal in valueName.split(): if outerVal.lower() == innerVal.lower(): df1['stockcode'].loc[indexKW] = df2.Identifier.loc[indexName]

x y z keywords stockcode a b c [apple,iphone,watch,newdevice] aapl e w q NaN null w r t [pixel,google,] ggle s t q [india,computer] null d j o [google,apple] ggle

x y z keywords stockcode a b c [apple,iphone,watch,newdevice] aapl e w q NaN null w r t [pixel,google,] ggle s t q [india,computer] null d j o [google,apple] aapl,ggle

2条回答

网友

1楼 · 编辑于 2024-06-30 08:04:03

可以将apply和map与join一起用作：

df2.set_index('name',inplace=True)
df1.apply(lambda x: pd.Series(x['keywords']).map(df2['stockcode']).dropna().values,1)

0          [appl]
1              []
2          [ggle]
3              []
4    [ggle, appl]
dtype: object

或：

df1.apply(lambda x: ','.join(pd.Series(x['keywords']).map(df2['stockcode']).dropna()),1)

0         appl
1             
2         ggle
3             
4    ggle,appl
dtype: object

或：

df1.apply(lambda x: ','.join(pd.Series(x['keywords']).map(df2['stockcode']).dropna()),1)\
                       .replace('','null')
0         appl
1         null
2         ggle
3         null
4    ggle,appl
dtype: object

df1['stockcode'] = df1.apply(lambda x: ','.join(pd.Series(x['keywords'])\
                                          .map(df2['stockcode']).dropna()),1)\
                             .replace('','null')
print(df1)
   x  y  z                           keywords  stockcode
0  a  b  c  [apple, iphone, watch, newdevice]       appl
1  e  w  q                                NaN       null
2  w  r  t                    [pixel, google]       ggle
3  s  t  q                  [india, computer]       null
4  d  j  o                    [google, apple]  ggle,appl

网友

2楼 · 编辑于 2024-06-30 08:04:03

可以将df2转换为查找字典，然后将其映射到df1；）

import numpy as np
import pandas as pd


data1 = {'x':'a,e,w'.split(','),
         'keywords':['apple,iphone,watch,newdevice'.split(','),
                    np.nan,
                    'pixel,google'.split(',')]}
data2 = {'name':'apple lg htc google'.split(),
        'stockcode':'appl weew rrr ggle'.split()}

df1 = pd.DataFrame(data1)
df2 = pd.DataFrame(data2)

mapper = df2.set_index('name').to_dict()['stockcode']

df1['stockcode'] = df1['keywords'].replace(np.nan,'').apply(lambda x : [mapper[i] for i in x if (i and i in mapper.keys())])
df1['stockcode'] = df1['stockcode'].apply(lambda x: x[0] if x else np.nan)

相关问题更多 >

编程相关推荐

热门问题

热门文章