mpl-probscale icon indicating copy to clipboard operation
mpl-probscale copied to clipboard

Probability plot missing data when using 'prob' option

Open jkmccaughey opened this issue 4 years ago • 2 comments

  • Python version: 3.7.6
  • numpy version: 1.19.2
  • matplotlib version: 3.3.2
  • mpl-probscale version: 0.2.5
  • Operating System: Windows 10 Running in jupyter notebook.

Description

I generated 100 data points (norm.rvs) and plotted the results using probscale. I've got two plots -- one uses 'prob' and the other uses 'qq' for the y-axis. The plot that uses qq shows all the data, while the plot that uses prob does not show all the data.

What I Did

fig, (ax1, ax2) = pyplot.subplots(figsize=(12,6), ncols=2, sharex=False)
common_opts = dict(
    probax='y',
    datascale='linear',
    datalabel='',
    scatter_kws=dict(marker='o', linestyle='none')
)

df2 = norm.rvs(0,1,size=100)
fig = probscale.probplot(df2, ax=ax1, plottype='prob', bestfit=False, problabel='Probability', color='xkcd:ocean green', **common_opts)
fig = probscale.probplot(df2, ax=ax2, plottype='qq', bestfit=False, problabel='Standard Normal Quantiles', color='xkcd:blue gray', **common_opts)

What else?

If I set the y limits for the 'prob' case to something wider than the defaults, I can see the data points that were being left out.

Thanks, Joe NPP_prob_qq

jkmccaughey avatar Jan 07 '22 22:01 jkmccaughey

Here's a more useful plot. Joe NPP_v2

jkmccaughey avatar Jan 07 '22 23:01 jkmccaughey

Thanks for the report. Where N = 100 is right on the fence where the automatic limits are set.

Should be pretty easy to fix.

phobson avatar Feb 08 '22 21:02 phobson