如何通过索引定制sklearn交叉验证迭代器？

>>> train_indices = [[1,3,5,7,9],[2,4,6,8]] >>> test_indices = [[2,4,6,8],[1,3,5,7,9]] 1st fold^ 2nd fold^ >>> custom_cv = sklearn.cross_validation.customcv(train_indices,test_indices) >>> clf = GridSearchCV(X,y,params,cv=custom_cv)

2条回答

网友

1楼 · 编辑于 2024-06-01 06:23:04

实际上，交叉验证迭代器就是：迭代器。它们在每次迭代时都会返回一组训练/测试折叠。这应该对你有用：

custom_cv = zip(train_indices, test_indices)

另外，对于你提到的具体情况，你可以

import numpy as np
labels = np.arange(0, 10) % 2
from sklearn.cross_validation import LeaveOneLabelOut
cv = LeaveOneLabelOut(labels)

观察list(cv)产生

[(array([1, 3, 5, 7, 9]), array([0, 2, 4, 6, 8])),
 (array([0, 2, 4, 6, 8]), array([1, 3, 5, 7, 9]))]

网友

2楼 · 编辑于 2024-06-01 06:23:04

实际上，上面的解决方案将每一行作为一个折叠返回，我们真正需要的是：

    [(train_indices, test_indices)] # for one fold

    [(train_indices, test_indices), # 1stfold
    (train_indices, test_indices)] # 2nd fold etc

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何通过索引定制sklearn交叉验证迭代器？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >