<p>这是一个图解,说明了来自<a href="https://stats.stackexchange.com/a/149178">stats.stackexchange answer</a>的盒子的组件。注意,如果不在Pandas中提供<code>whis</code>关键字,k=1.5。</p>
<p><a href="https://i.stack.imgur.com/ty5wN.png" rel="noreferrer"><img src="https://i.stack.imgur.com/ty5wN.png" alt="enter image description here"/></a></p>
<p>Pandas中的boxplot函数是<code>matplotlib.pyplot.boxplot</code>的包装器。<a href="https://matplotlib.org/api/_as_gen/matplotlib.pyplot.boxplot.html" rel="noreferrer">matplotlib docs</a>详细解释了盒子的组件:</p>
<p><strong>问题A:</strong></p>
<blockquote>
<p>The box extends from the lower to upper quartile values of the data, with a line at the median.</p>
</blockquote>
<p>也就是说,四分之一的输入数据值在方框下方,四分之一在方框上方。</p>
<p><strong>问题B:</strong></p>
<blockquote>
<p>whis : float, sequence, or string (default = 1.5)</p>
<p>As a float, determines the reach of the whiskers to the beyond the
first and third quartiles. In other words, where IQR is the
interquartile range (Q3-Q1), the upper whisker will extend to last
datum less than Q3 + whis*IQR). Similarly, the lower whisker will
extend to the first datum greater than Q1 - whis*IQR. Beyond the
whiskers, data are considered outliers and are plotted as individual
points.</p>
</blockquote>
<p>Matplotlib(和Pandas)还提供了许多选项来更改胡须的默认定义:</p>
<blockquote>
<p>Set this to an unreasonably high value to force the whiskers to show
the min and max values. Alternatively, set this to an ascending
sequence of percentile (e.g., [5, 95]) to set the whiskers at specific
percentiles of the data. Finally, whis can be the string 'range' to
force the whiskers to the min and max of the data.</p>
</blockquote>