在python中如何利用已知的似然函数求取最大似然估计的参数误差?

2024-10-05 10:03:17 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一些数据,想拟合给定的心理测量函数p。 psychometric function 我对拟合参数和误差也很感兴趣。用scipy软件包中的曲线拟合函数“经典”方法,很容易得到p的参数和误差。但是我想用最大似然估计(MLE)来做同样的事情。从输出和图中可以看出,这两种方法提供的参数略有不同。实现MLE不是问题,但我不知道如何使用这种方法来获得错误。有没有简单的方法可以得到它们?我的似然函数L是: likelihood function 我无法修改这里描述的代码http://rlhick.people.wm.edu/posts/estimating-custom-mle.html,但这可能是一个解决方案。我如何实现这一点?或者这个还有别的办法吗?在

这里使用scipy stats模型拟合了一个类似的函数:https://stats.stackexchange.com/questions/66199/maximum-likelihood-curve-model-fitting-in-python。但参数误差也不计算。在

负对数似然函数是正确的,因为它提供了正确的参数,但是我想知道这个函数是否依赖于y数据?负对数似然函数l显然是l=-ln(l)。 这是我的代码:

#!/usr/bin/env python
# -*- coding: utf-8 -*- 

## libary
import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import curve_fit
from scipy.optimize import minimize


def p(x,x50,s50):
    """return y value of psychometric function p"""
    return 1./(1+np.exp(4.*s50*(x50-x)))

def initialparams(x,y):
    """return initial fit parameters for function p with given dataset"""
    midpoint = np.mean(x)
    slope = (np.max(y)-np.min(y))/(np.max(x)-np.min(x))
    return [midpoint, slope]

def cfit_error(pcov):
    """return errors of fir from covariance matrix"""
    return np.sqrt(np.diag(pcov))

def neg_loglike(params):
    """analytical negative log likelihood function. This function is dependend on the dataset (x and y) and the two parameters x50 and s50."""
    x50 = params[0]
    s50 = params[1]
    i = len(xdata)
    prod = 1.
    for i in range(i):
        #print prod
        prod *= p(xdata[i],x50,s50)**(ydata[i]*5) * (1-p(xdata[i],x50,s50))**((1.-ydata[i])*5)
    return -np.log(prod)


xdata = [0.,-7.5,-9.,-13.500001,-12.436171,-16.208617,-13.533123,-12.998025,-13.377527,-12.570075,-13.320075,-13.070075,-11.820075,-12.070075,-12.820075,-13.070075,-12.320075,-12.570075,-11.320075,-12.070075]
ydata = [1.,0.6,0.8,0.4,1.,0.,0.4,0.6,0.2,0.8,0.4,0.,0.6,0.8,0.6,0.2,0.6,0.,0.8,0.6]

intparams = initialparams(xdata, ydata)## guess some initial parameters


## normal curve fit using least squares algorithm
popt, pcov = curve_fit(p, xdata, ydata, p0=intparams)
print('scipy.optimize.curve_fit:')
print('x50 = {:f} +- {:f}'.format(popt[0], cfit_error(pcov)[0]))
print('s50 = {:f} +- {:f}\n'.format(popt[1], cfit_error(pcov)[1]))



## fitting using maximum likelihood estimation
results = minimize(neg_loglike, initialparams(xdata,ydata), method='Nelder-Mead')
print('MLE with self defined likelihood-function:')
print('x50 = {:f}'.format(results.x[0]))
print('s50 = {:f}'.format(results.x[1]))
#print results


## ploting the data and results
xfit = np.arange(-20,1,0.1)

fig = plt.figure()
ax = fig.add_subplot(1,1,1)
ax.plot(xdata, ydata, 'xb', label='measured data')
ax.plot(xfit, p(xfit, *popt), '-r', label='curve fit')
ax.plot(xfit, p(xfit, *results.x), '-g', label='MLE')
plt.legend()
plt.show()

输出为:

^{pr2}$

拟合和测量数据都可以在这里看到: results plot 我的Python版本是debianstretch上的2.7版本。谢谢你的帮助。在


Tags: 函数参数returnnpfunctionscipyresultsfit
1条回答
网友
1楼 · 发布于 2024-10-05 10:03:17

最后,由robhicks(http://rlhick.people.wm.edu/posts/estimating-custom-mle.html)描述的方法成功了。安装了numdifftools之后,我可以从hessian矩阵中计算出估计参数的误差。在

在具有su权限的Linux上安装numdifftools:

apt-get install python-pip
pip install numdifftools

下面是我的程序的完整代码示例:

^{pr2}$

生成以下输出:

scipy.optimize.curve_fit:
x50 = -18.702375 +- 1.246728
s50 = 0.063620 +- 0.041207

MLE with self defined likelihood-function:
x50 = -18.572181 +- 0.779847
s50 = 0.078935 +- 0.028783

但是,在使用numdifftools计算hessian矩阵时会出现一些运行时错误。有点被零除。这可能是因为我自己定义的负逻辑函数。最后给出了误差分析结果。使用“扩展Statsmodels”的方法可能更优雅,但我无法理解。在

相关问题 更多 >

    热门问题