Python中加权对数正态分布拟合的正确方法

2条回答

网友

1楼 · 编辑于 2024-09-24 22:18:07

您可以使用numpy.repeat来提高解决方案的效率：

import numpy as np

dataToLearn = np.array([1,2,3,4,5])
weights = np.array([1,2,1,1,3])

print(np.repeat(dataToLearn, weights))
# Output: array([1, 2, 2, 3, 4, 5, 5, 5])

对numpy.repeat性能的非常基本的性能测试：

^{pr2}$

因此，对于您当前的方法，我得到了大约3.38，而对于numpy.repeat，我得到了0.75

网友

2楼 · 编辑于 2024-09-24 22:18:07

SciPy分布不实现加权拟合。然而，对于对数正态分布，有（未加权）maximum likelihood estimation的显式公式，这些公式很容易推广到加权数据。显式公式都是（实际上）平均值，对加权数据情况的概括是在公式中使用加权平均值。在

下面是一个脚本，它使用一个具有整数权重的小数据集演示计算，因此我们知道拟合参数的确切值应该是多少。在

import numpy as np
from scipy.stats import lognorm


# Sample data and weights.  To enable an exact comparison with
# the method of generating an array with the values repeated
# according to their weight, I use an array of weights that is
# all integers.
x = np.array([2.5, 8.4, 9.3, 10.8, 6.8, 1.9, 2.0])
w = np.array([  1,   1,   2,    1,   3,   3,   1])


#                                      -
# Fit the log-normal distribution by creating an array containing the values
# repeated according to their weight.
xx = np.repeat(x, w)

# Use the explicit formulas for the MLE of the log-normal distribution.
lnxx = np.log(xx)
muhat = np.mean(lnxx)
varhat = np.var(lnxx)

shape = np.sqrt(varhat)
scale = np.exp(muhat)

print("MLE using repeated array: shape=%7.5f   scale=%7.5f" % (shape, scale))


#                                      -
# Use the explicit formulas for the weighted MLE of the log-normal
# distribution.

lnx = np.log(x)
muhat = np.average(lnx, weights=w)
# varhat is the weighted variance of ln(x).  There isn't a function in
# numpy for the weighted variance, so we compute it using np.average.
varhat = np.average((lnx - muhat)**2, weights=w)

shape = np.sqrt(varhat)
scale = np.exp(muhat)

print("MLE using weights:        shape=%7.5f   scale=%7.5f" % (shape, scale))


#                                      -
# Might as well check that we get the same result from lognorm.fit() using the
# repeated array

shape, loc, scale = lognorm.fit(xx, floc=0)

print("MLE using lognorm.fit:    shape=%7.5f   scale=%7.5f" % (shape, scale))

输出是

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python中加权对数正态分布拟合的正确方法

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >