如何阅读cs

892,3,"Kelly, Mr. James",male,34.5,0,0,330911,7.8292,,Q 893,3,"Wilkes, Mrs. James (Ellen Needs)",female,47,1,0,363272,7,,S 894,2,"Myles, Mr. Thomas Francis",male,62,0,0,240276,9.6875,,Q 895,3,"Wirz, Mr. Albert",male,27,0,0,315154,8.6625,,S 896,3,"Hirvonen, Mrs. Alexander (Helga E Lindqvist)",female,22,1,1,3101298,12.2875,,S 897,3,"Svensson, Mr. Johan Cervin",male,14,0,0,7538,9.225,,S

892 3 "Kelly, Mr. James" male 34.5 0 0 330911 7.8292 NaN Q 893 3 "Wilkes, Mrs. James (Ellen Needs)" female 47 1 0 363272 7 NaN S 894 2 "Myles, Mr. Thomas Francis" male 62 0 0 240276 9.6875 NaN Q 895 3 "Wirz, Mr. Albert" male 27 0 0 315154 8.6625 NaN S 896 3 "Hirvonen, Mrs. Alexander (Helga E Lindqvist)" female 22 1 1 3101298 12.2875 NaN S 897 3 "Svensson, Mr. Johan Cervin" male 14 0 0 7538 9.225 S

3条回答

网友

1楼 · 编辑于 2024-10-06 11:27:00

我不太清楚你的意思，但我想这对你有用。在

我实现了另外两个函数来决定字符串是float还是integer。在

如果这个字符串是一个空字符串，我没有写，不过，你可以把它改成任何你喜欢的。在

import csv
import numpy as np

def isfloat(x):
    try:
        a = float(x)
    except ValueError:
        return False
    else:
        return True

def isint(x):
    try:
        a = float(x)
        b = int(a)
    except ValueError:
        return False
    else:
        return a == b


csv_file_object = csv.reader(open('trainData.csv', 'rb'))
header = csv_file_object

data=[]
for row in csv_file_object:
    for index, cell in enumerate(row):
        if isint(cell):
            row[index] = int(cell)
        elif isfloat(cell):
            row[index] = float(cell)
        if not cell: # cell == ''
            row[index] = None  # you can change the value to whatever you like.
    data.append(row)

print data

输出：

^{pr2}$

网友

2楼 · 编辑于 2024-10-06 11:27:00

您可以更轻松地使用熊猫库，如下所示：

import pandas as pd

df = pd.read_csv("trainData.csv", dtype={'col1': int, 'col2': int, 'col3': str, 'col4': str, 'col5': float, 'col6':int,
                                  'col7': int, 'col8': float, 'col9':float, 'col10': str, 'col11': str})
df = map(list, df.values)
print df

输出：

^{pr2}$

csv文件应该如下所示，因为第一行是列名

col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11
892,3,"Kelly, Mr. James",male,34.5,0,0,330911,7.8292,,Q
893,3,"Wilkes, Mrs. James (Ellen Needs)",female,47,1,0,363272,7,,S
894,2,"Myles, Mr. Thomas Francis",male,62,0,0,240276,9.6875,,Q
895,3,"Wirz, Mr. Albert",male,27,0,0,315154,8.6625,,S
896,3,"Hirvonen, Mrs. Alexander (Helga E Lindqvist)",female,22,1,1,3101298,12.2875,,S
897,3,"Svensson, Mr. Johan Cervin",male,14,0,0,7538,9.225,,S

你可以在这里阅读更多关于熊猫的文章http://pandas.pydata.org/pandas-docs/stable/tutorials.html

网友

3楼 · 编辑于 2024-10-06 11:27:00

我假设你用的是熊猫，因为问题的标签是熊猫。按如下方式阅读文件：

df = pd.read_csv('test.txt', skiprows=0, index_col=0, 
            names='city_type name sex weight has_cat has_dog bank_balance body_fat_index car_mileage car_type'.split())

您将得到这样一个数据帧： enter image description here

我冒昧为专栏编了名字。在

一旦你把数据读入一个数据框，你就可以用它做各种各样的魔术——看看熊猫教程（它们很棒）。这里有一个例子

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何阅读cs

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >