使用Python检测数据帧中哪些列是分类的

1条回答

网友

1楼 · 发布于 2024-09-29 23:20:18

您可以尝试以下方法：

df = pd.DataFrame({"ID": [12324, 26342, 62438], "passengerClass": [1, 2, 2], "nationality": ["FR", "ES", "US"]})
df = df.astype('category')
print(df.dtypes)

输出：

ID                category
passengerClass    category
nationality       category
dtype: object

注意：

In the above example, all the columns are converted to "category", but you can explicitly specify dtype for individual columns.

-可选选项-

You can create config file to explicitly specify columns name with dtype:

配置文件：

[
  {
    "columnName": "ID",
    "columnDtype": "category"
  },
  {
    "columnName": "passengerClass",
    "columnDtype": "category"
  },
  {
    "columnName": "nationality",
    "columnDtype": "category"
  }
]

代码：

df = pd.DataFrame({"ID": [12324, 26342, 62438], "passengerClass": [1, 2, 2], "nationality": ["FR", "ES", "US"]})

with open('./config.json') as cf:
    configList = json.load(cf)

for col in configList:
    colName = col['columnName']
    colType = col['columnDtype']
    df[colName] = df[colName].astype(colType)

print(df.dtypes)

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用Python检测数据帧中哪些列是分类的

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >