Pandas数据清理

2024-05-20 13:44:21 发布

您现在位置:Python中文网/ 问答频道 /正文

import pandas as pd 
import numpy as np 
import sys
auto = pd.read_csv(
    "https://archive.ics.uci.edu/ml/machine-learning-databases/auto-mpg/auto-mpg.data",
    names=['MPG', 'Cylinders', 'Displacement', 'Horse power',
           'Weight', 'Acceleration', 'Model Year', 'Origin', 'Car Name']
)

auto.head()

Out put image

我需要清理这些数据,但我一直在把这些数据公布出来,需要一些帮助。我是初学者,我搞不懂


Tags: csv数据httpsimportnumpypandasreadauto
2条回答

如果您查看文件,分隔符不是常量,而是空格的变体。sep='\s+'正在提供所需的输出。在

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/auto-mpg/auto-mpg.data"
df = pd.read_csv(url, sep = '\s+',names = ['MPG','Cylinders','Displacement','Horse power','Weight','Acceleration','Model Year','Origin','Car Name'])
df.head() 


    MPG Cylinders   Displacement    Horse power Weight  Acceleration    Model Year  Origin  Car Name
0   18      8       307             130.0       3504    12.0            70      1   chevrolet chevelle malibu
1   15      8       350             165.0       3693    11.5            70      1   buick skylark 320
2   18      8       318             150.0       3436    11.0            70      1   plymouth satellite
3   16      8       304             150.0       3433    12.0            70      1   amc rebel sst
4   17      8       302             140.0       3449    10.5            70      1   ford torino

使用delim_whitespace参数:

url = 'https://archive.ics.uci.edu/ml/machine-learning-databases/auto-mpg/auto-mpg.data'
cols = ['MPG', 'Cylinders', 'Displacement', 'Horse power', 'Weight', 
        'Acceleration', 'Model Year', 'Origin', 'Car Name']

auto = pd.read_csv(url, names=cols, delim_whitespace=True)

auto.head()
Out: 
    MPG  Cylinders  Displacement Horse power  Weight  Acceleration  \
0  18.0          8         307.0       130.0  3504.0          12.0   
1  15.0          8         350.0       165.0  3693.0          11.5   
2  18.0          8         318.0       150.0  3436.0          11.0   
3  16.0          8         304.0       150.0  3433.0          12.0   
4  17.0          8         302.0       140.0  3449.0          10.5   

   Model Year  Origin                   Car Name  
0          70       1  chevrolet chevelle malibu  
1          70       1          buick skylark 320  
2          70       1         plymouth satellite  
3          70       1              amc rebel sst  
4          70       1                ford torino  

相关问题 更多 >