Python对字符串进行样条处理，添加行，每列不同

Date Unit Length AM/PM unit_new 5 Monday\r13 January 12345H\rEngineering - Unit 1: Engineering Principles\r23456H\rHealth and Social Care - Unit 2: Working in Health\rand Social Care 2h 00m\r1h 30m morning 6 Tuesday\r14 January 34567H\rBusiness/Enterprise and Entrepreneurship -\rUnit 3: Personal and Business Finance\r12345L\rApplied Human Biology - Unit 1: Principles of\rHuman Biology\r23456K\rConstruction and the Built Environment -\rUnit 1: Construction Principles 2h 00m\r1h 30m\r1h 30m morning 7 Wednesday\r15 January 34567H/1C\rApplied Science/Forensic and Criminal Investigation\r- Unit 1: Principles and Applications of Science I -\rChemistry\r12345H\rSport and Exercise Science - Unit 1: Sport and Exercise\rPhysiology 0h 40m\r1h 30m morning

def code_count_func(): code_count = df.Unit.str.count('\d{5}\w').subtract(+1) # drop na's to stop error code_count.dropna(inplace = True) # converting to int code_count = code_count.iloc[0:].astype(int)

1条回答

网友

1楼 · 发布于 2024-10-02 16:23:32

这可能会起作用，如果使用更精细的正则表达式，效果会更好。我的列可能已从复制/粘贴过程中关闭，但逻辑应该正确

拿到单位

df['Unit'] = df['Unit'].str.split('(.+?(?=\d{5}))')

了解长度

lengths = df['AM/PM'].str.split(r'\\r').explode()

分解单元，从正则表达式中删除空条目，并将长度返回到数据帧

df = pd.concat([df.explode('Unit').query("Unit != ''"), lengths], axis=1)

            Date           ...                                               Unit   AM/PM
5     Monday\r13  January  ...  12345H\rEngineering - Unit 1: Engineering Prin...  2h 00m
5     Monday\r13  January  ...  23456H\rHealth and Social Care - Unit 2: Worki...  1h 30m
6    Tuesday\r14  January  ...  34567H\rBusiness/Enterprise and Entrepreneursh...  2h 00m
6    Tuesday\r14  January  ...  12345L\rApplied Human Biology - Unit 1: Princi...  1h 30m
6    Tuesday\r14  January  ...  23456K\rConstruction and the Built Environment...  1h 30m
7  Wednesday\r15  January  ...  34567H/1C\rApplied Science/Forensic and Crimin...  0h 40m
7  Wednesday\r15  January  ...  12345H\rSport and Exercise Science - Unit 1: S...  1h 30m

相关问题更多 >

编程相关推荐

热门问题

热门文章