W3cubDocs

statsmodels.graphics.boxplots.beanplot

statsmodels.graphics.boxplots.beanplot(data, ax=None, labels=None, positions=None, side='both', jitter=False, plot_opts={}) [source]

Make a bean plot of each dataset in the data sequence.

A bean plot is a combination of a violinplot (kernel density estimate of the probability density function per point) with a line-scatter plot of all individual data points.

Parameters:	data (sequence of ndarrays) – Data arrays, one array per value in `positions`. ax (Matplotlib AxesSubplot instance, optional) – If given, this subplot is used to plot in instead of a new figure being created. labels (list of str, optional) – Tick labels for the horizontal axis. If not given, integers `1..len(data)` are used. positions (array_like, optional) – Position array, used as the horizontal axis of the plot. If not given, spacing of the violins will be equidistant. side ({'both', 'left', 'right'}, optional) – How to plot the violin. Default is ‘both’. The ‘left’, ‘right’ options can be used to create asymmetric violin plots. jitter (bool, optional) – If True, jitter markers within violin instead of plotting regular lines around the center. This can be useful if the data is very dense. plot_opts (dict, optional) – A dictionary with plotting options. All the options for `violinplot` can be specified, they will simply be passed to `violinplot`. Options specific to `beanplot` are: `’violin_width’ : float. Relative width of violins. Max available` space is 1, default is 0.8. ’bean_color’, MPL color. Color of bean plot lines. Default is ‘k’. Also used for jitter marker edge color if `jitter` is True. ’bean_size’, scalar. Line length as a fraction of maximum length. Default is 0.5. ’bean_lw’, scalar. Linewidth, default is 0.5. ’bean_show_mean’, bool. If True (default), show mean as a line. ’bean_show_median’, bool. If True (default), show median as a marker. ’bean_mean_color’, MPL color. Color of mean line. Default is ‘b’. ’bean_mean_lw’, scalar. Linewidth of mean line, default is 2. ’bean_mean_size’, scalar. Line length as a fraction of maximum length. Default is 0.5. ’bean_median_color’, MPL color. Color of median marker. Default is ‘r’. ’bean_median_marker’, MPL marker. Marker type, default is ‘+’. `’jitter_marker’, MPL marker. Marker type for jitter=True.` Default is ‘o’. ’jitter_marker_size’, int. Marker size. Default is 4. ’jitter_fc’, MPL color. Jitter marker face color. Default is None. ’bean_legend_text’, str. If given, add a legend with given text.
Returns:	fig – If `ax` is None, the created figure. Otherwise the figure to which `ax` is connected.
Return type:	Matplotlib figure instance

References

P. Kampstra, “Beanplot: A Boxplot Alternative for Visual Comparison of Distributions”, J. Stat. Soft., Vol. 28, pp. 1-9, 2008.

Examples

We use the American National Election Survey 1996 dataset, which has Party Identification of respondents as independent variable and (among other data) age as dependent variable.

>>> data = sm.datasets.anes96.load_pandas()
>>> party_ID = np.arange(7)
>>> labels = ["Strong Democrat", "Weak Democrat", "Independent-Democrat",
...           "Independent-Indpendent", "Independent-Republican",
...           "Weak Republican", "Strong Republican"]

Group age by party ID, and create a violin plot with it:

>>> plt.rcParams['figure.subplot.bottom'] = 0.23  # keep labels visible
>>> age = [data.exog['age'][data.endog == id] for id in party_ID]
>>> fig = plt.figure()
>>> ax = fig.add_subplot(111)
>>> sm.graphics.beanplot(age, ax=ax, labels=labels,
...                      plot_opts={'cutoff_val':5, 'cutoff_type':'abs',
...                                 'label_fontsize':'small',
...                                 'label_rotation':30})
>>> ax.set_xlabel("Party identification of respondent.")
>>> ax.set_ylabel("Age")
>>> plt.show()

(Source code, png, hires.png, pdf)

../_images/graphics_boxplot_beanplot.png

© 2009–2012 Statsmodels Developers
© 2006–2008 Scipy Developers
© 2006 Jonathan E. Taylor
Licensed under the 3-clause BSD License.
http://www.statsmodels.org/stable/generated/statsmodels.graphics.boxplots.beanplot.html