Pandas:ValueError:无法将float NaN转换为integ

# x contained NaN df = df[~df['x'].isnull()] # Y contained some other garbage, so null check was not enough df = df[df['y'].str.isnumeric()] # final conversion now worked df[['x']] = df[['x']].astype(int) df[['y']] = df[['y']].astype(int)

3条回答

网友
1楼 · 编辑于 2024-10-01 17:29:22

要识别NaN值，请使用^{}：
print(df[df['x'].isnull()])
然后，对于删除所有非数值，请将^{}与参数errors='coerce'一起使用-它将非数值替换为NaN：
df['x'] = pd.to_numeric(df['x'], errors='coerce')
若要删除列x中具有NaNs的所有行，请使用^{}：
df = df.dropna(subset=['x'])
上次将值转换为ints：
df['x'] = df['x'].astype(int)

网友
2楼 · 编辑于 2024-10-01 17:29:22

我知道这已经得到了回答，但我想为将来的任何人提供另一种解决方案：
您可以使用.loc仅通过notnull()的值对数据帧进行子集，然后仅对'x'列进行子集。取同一个载体，然后apply(int)到它上面。
如果x列是浮动的：
df.loc[df['x'].notnull(), 'x'] = df.loc[df['x'].notnull(), 'x'].apply(int)

网友
3楼 · 编辑于 2024-10-01 17:29:22

ValueError: cannot convert float NaN to integer

从v0.24开始，你实际上可以。Pandas引入了Nullable Integer Data Types，允许整数与NaNs共存。

给定一系列缺少数据的全浮点数

s = pd.Series([1.0, 2.0, np.nan, 4.0])
s

0    1.0
1    2.0
2    NaN
3    4.0
dtype: float64

s.dtype
# dtype('float64')

您可以使用以下命令将其转换为可为空的int类型（从Int16、Int32或Int64中选择一个）

s2 = s.astype('Int32') # note the 'I' is uppercase
s2

0      1
1      2
2    NaN
3      4
dtype: Int32

s2.dtype
# Int32Dtype()

你的专栏需要有完整的数字，才能进行演员阵容。任何其他操作都会引发类型错误：

s = pd.Series([1.1, 2.0, np.nan, 4.0])

s.astype('Int32')
# TypeError: cannot safely cast non-equivalent float64 to int32

ValueError: cannot convert float NaN to integer

相关问题更多 >

编程相关推荐

热门问题

热门文章