Numpy medianofmeans跨非均等阵列计算

import numpy as np m = 10 n = 10000 # A random data matrix X = np.random.uniform(low=0.0, high=1.0, size=(m,n)).astype(np.float64) # Number of buckets to split rows into b = 5 # Partition the rows of X into b buckets row_indices = np.arange(X.shape[0]) buckets = np.array(np.array_split(row_indices, b)) X_bucketed = X[buckets, :] # Compute the mean within each bucket bucket_means = np.mean(X_bucketed, axis=1) # Compute the median-of-means median = np.median(bucket_means, axis=0) # Edit - Method 2 (based on answer) np.random.shuffle(row_indices) X = X[row_indices, :] buckets2 = np.array_split(X, b, axis=0) bucket_means2 = [np.mean(x, axis=0) for x in buckets2] median2 = np.median(np.array(bucket_means2), axis=0)

1条回答

网友

1楼 · 发布于 2024-09-26 22:55:37

<>你可以考虑分别计算每个桶的平均值，然后叠加并计算中值。您也可以直接使用array_split到X，不需要使用切片索引数组对其进行索引（可能这是您的主要问题？）

m = 11
n = 10000

# A random data matrix
X = np.random.uniform(low=0.0, high=1.0, size=(m,n)).astype(np.float64)

# Number of buckets to split rows into
b = 5

# Partition the rows of X into b buckets
buckets = np.array_split(X, 2, axis = 0)

# Compute the mean within each bucket
b_means = [np.mean(x, axis=0) for x in buckets]

# Compute the median-of-means
median = np.median(np.array(b_means), axis=0)

print(median) #(10000,) shaped array

相关问题更多 >

编程相关推荐

热门问题

热门文章