擅长:python、mysql、java
<p>我已经通过修改transformer类来解决这个问题,并返回一个<strong>单个<strong>列表(包含多个数据帧):</p>
<pre><code>class DataSubsetGenerator(BaseEstimator, TransformerMixin):
def __init__(self, sub_percentage, random_state = 42):
self.sub_percentage = sub_percentage
self.random_state = random_state
def fit(self):
return self
def transform(self, dataframes):
X_train, X_test, y_train, y_test = dataframes
# Do data processing stuff here, removed to simplify example here...
return [X_train_sub, X_test_sub, y_train_sub, y_test_sub]
</code></pre>
<p>如果有更好的解决方案或普遍接受的模式,请随时告诉我</p>