我在玩著名的泰坦尼克号数据。我有逗号分隔的csv数据。数据如下所示:
passengerId,survived,pclass,name,sex,age,sibSp,parch,ticket,fare,cabin,embarked
1,0,3,"Braund, Mr. Owen Harris",male,22,1,0,A/5 21171,7.25,,S
2,1,1,"Cumings, Mrs. John Bradley (Florence Briggs Thayer)",female,38,1,0,PC 17599,71.2833,C85,C
我正在尝试使用pandas.csv_read
,但它不起作用
我的代码:
import pandas as pd
titanic = pd.read_csv('titanic.csv')
print(titanic.head(10))
我尝试了csv_read方法的两种组合:sep = ',', decimal = ',', delimiter = ','
,但仍然得到了相同的输出,即:
passengerId survived ... cabin embarked
0 1,0,3,"Braund, Mr. Owen Harris",male,22,1,0,A/... NaN ... NaN NaN
1 2,1,1,"Cumings, Mrs. John Bradley (Florence Br... NaN ... NaN NaN
2 3,1,3,"Heikkinen, Miss. Laina",female,26,0,0,S... NaN ... NaN NaN
我试图搜索其他问题,但找不到答案。谢谢你的帮助
问题似乎是列中有一些逗号
quotechar参数可能会帮助您,因为它会告诉pandas忽略指定字符(“)之间的逗号
相关问题 更多 >
编程相关推荐