Fix violinplot crash on empty datasets (#31700) by rahulrathnavel · Pull Request #31707 · matplotlib/matplotlib

rahulrathnavel · 2026-05-19T12:37:09Z

PR summary

This PR fixes a bug where passing an empty dataset to violinplot causes it to crash (ValueError: zero-size array to reduction operation minimum), whereas boxplot handles the exact same scenario gracefully by simply drawing nothing.

Reasoning for this implementation:
I updated cbook.violin_stats to check if the input dataset is empty. If it is, it bypasses the min/max/KDE math operations and returns an empty stats dictionary for that specific dataset. I also added a safeguard in axes.violin to prevent width-scaling calculations on empty density arrays.

This allows violinplot to safely skip rendering violins for empty datasets, perfectly mirroring the resilient behavior of boxplot. I have also included a regression test to ensure this remains fixed.

AI Disclosure

I used an AI assistant strictly to help navigate the codebase, locate the specific statistics functions in cbook.py and _axes.py, and draft the boilerplate for the pytest. The core logic was manually reviewed, applied, and tested locally to ensure complete compliance with Matplotlib's standards.

PR checklist

"closes [Bug]: violinplot crashes on empty datasets #31700" is in the body of the PR description to link the related issue
new and changed code is tested
[N/A] Plotting related features are demonstrated in an example
[N/A] New Features and API Changes are noted with a directive and release note
[N/A] Documentation complies with general and docstring guidelines

…logic

rahulrathnavel · 2026-05-20T04:44:06Z

Hi @story645! The GitHub UI was throwing an error when I tried to accept the commit suggestion directly, so I pulled the branch and applied both of your changes manually locally!(since i don't why that apply suggestion button not worked for me)

_axes.py now uses the stricter > 0 constraint, and cbook.py has been updated to use the exact same 'append up here and mutate below' pattern as boxplot_stats. Let me know if everything looks good to go now!

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

rahulrathnavel · 2026-05-20T07:04:55Z

Hi @timhoffm, great one with the [np.nan, np.nan] edge case!

I've updated cbook.py so the NaN and inf stripping logic happens before the len(x) == 0 bailout check. The empty dataset dictionary now safely populates and avoids the crash even if the array initially contained only NaNs.

Also, just a heads-up: it looks like the AppVeyor Windows check failed right at the end of its run due to a Windows temp file PermissionError during teardown, but the actual pytest suite passed perfectly. Let me know if everything else looks good to go! and what to do next!

timhoffm · 2026-05-20T09:41:46Z

+        # note tricksiness, append up here and then mutate below
+        vpstats.append(stats)
+


This is unconventional and makes the code harder to reason about. Instead, put the calculation into an else block:

if len(x) == 0: #empty stats else: # calculate stats vpstats.append(stats)

The way boxplot does it is puts a continue at the end of the empty case. Might make sense here too?

Hi @story645! I actually just refactored this loop into the strict if/else block that @timhoffm suggested above, since it lets us avoid the early append and the continue statement entirely.
The code is pushed and the linters are perfectly green! Let me know if you are both happy with this if/else structure, or if you'd prefer I switch it to the continue pattern!

I think the continue pattern is better b/c then you don't have a giant indent block for the else that you need to keep track of. That's presumably why it's used in boxplots

I see this differently: The "early return" block is almost as long as the regular block, because the majority of work is identical: configuring stats values and appending to vpstats. IMHO it's beneficial for readablility to reflected this parallelism in an if / else block with equal indentation for both cases.

On a more general note, the code is a bit fragmented and cluttered with extra variables. Directly appending a dict literal would be much cleaner:

for (x, quantile) in zip(X, quantiles): x = np.asarray(x) x = x[~(np.isnan(x) | np.isinf(x))] if len(x) == 0: vpstats.append({ 'vals': np.array([]), 'coords': np.array([]), 'mean': np.nan, 'median': np.nan, 'min': np.nan, 'max': np.nan, 'quantiles': np.array([]), }) else: min_val = np.min(x) max_val = np.max(x) coords = np.linspace(min_val, max_val, points) vpstats.append({ 'vals': method(x, coords), 'coords': coords, 'mean': np.mean(x), 'median': np.median(x), 'min': min_val, 'max': max_val, 'quantiles': np.atleast_1d(np.percentile(x, 100 * quantile)) })

But I'm not going to fight over this.

timhoffm · 2026-05-20T10:12:45Z

-        max_val = np.max(x)
-        quantile_val = np.percentile(x, 100 * q)
+        x = np.asarray(x)
+        x = x[~(np.isnan(x) | np.isinf(x))]


This should be documented.

@rahulrathnavel sorry for not being precise. I meant documenting in the docstring (Parameter X) not a code comment.

Hi @timhoffm, that makes total sense! No worries at all, I completely misunderstood what you meant earlier my side weak interpretation.

I have removed the inline code comment and moved the explanation up into the public docstring for parameter X in violin_stats so users know that NaN and infinite values are automatically stripped.

The code is pushed up and the CI checks are running now. Let me know if the wording looks good to you!Eagarly waiting to hear from you!

timhoffm · 2026-05-20T21:52:56Z

+        # note tricksiness, append up here and then mutate below
+        vpstats.append(stats)
+


I see this differently: The "early return" block is almost as long as the regular block, because the majority of work is identical: configuring stats values and appending to vpstats. IMHO it's beneficial for readablility to reflected this parallelism in an if / else block with equal indentation for both cases.

On a more general note, the code is a bit fragmented and cluttered with extra variables. Directly appending a dict literal would be much cleaner:

for (x, quantile) in zip(X, quantiles): x = np.asarray(x) x = x[~(np.isnan(x) | np.isinf(x))] if len(x) == 0: vpstats.append({ 'vals': np.array([]), 'coords': np.array([]), 'mean': np.nan, 'median': np.nan, 'min': np.nan, 'max': np.nan, 'quantiles': np.array([]), }) else: min_val = np.min(x) max_val = np.max(x) coords = np.linspace(min_val, max_val, points) vpstats.append({ 'vals': method(x, coords), 'coords': coords, 'mean': np.mean(x), 'median': np.median(x), 'min': min_val, 'max': max_val, 'quantiles': np.atleast_1d(np.percentile(x, 100 * quantile)) })

But I'm not going to fight over this.

rahulrathnavel · 2026-05-21T01:37:47Z

Hi @timhoffm and @story645, thank you both so much for talking through the design and for the phenomenal mentorship!

I have implemented the dict literal snippet exactly as requested, added the documentation comment for the NaN-stripping logic, and the CI checks are now 100% green! 😁

Since this is one of my very first open-source contributions, I really appreciate your patience and guidance in helping me get the code structure and standards just right. I am super excited and looking forward to seeing this merged! Let me know if you need absolutely anything else from me.

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

rahulrathnavel · 2026-05-22T19:18:36Z

Thanks so much for fully guided review and approval, @timhoffm! for my first PRs here.
Also, just wanted to give a quick heads-up that I separated the commits on the clabel fix (as you suggested earlier ) over in #31706 exactly like as you requested. It is completely isolated now and ready whenever you have a chance to take a look. Absolutely no rush at all, though just a bit excited 😅!

rahulrathnavel · 2026-05-28T02:16:12Z

@scottshambaugh thanks . I have pushed a commit that updates cbook.py to use delete_masked_points, added the docstring for Axes.violinplot, and included the API release note as you suggested. Let me know that i am all right or needs any improvements/corrections!
@timhoffm sucessfully all tests passed waiting for the code to be approved and merged! 😄

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

Co-authored-by: Scott Shambaugh <14363975+scottshambaugh@users.noreply.github.com>

scottshambaugh

Looks good, thanks for working this! I'll merge once CI finishes.

Marking for 3.12 due to the API change but I'd be okay with 3.11.1 if someone feels otherwise.

rahulrathnavel · 2026-05-28T17:08:52Z

Looks good, thanks for working this! I'll merge once CI finishes.

😄 to be honest, that's the good thing I have heard so far today.
lucky to end my day(today) by seeing the code gets merged ,super excited.
Really, thanks for your patience and guidance.(though i made many mistakes)
I hope I can learn more from you all. very happy to work like this, and willing to hear more from you, @scottshambaugh.
@timhoffm, special thanks to you too.

scottshambaugh · 2026-05-28T17:21:24Z

Congrats on your first contribution to matplotlib @rahulrathnavel! We hope to see you again.

QuLogic · 2026-05-28T19:01:49Z

Is this both a bug fix and new feature? The linked issue seems to suggest the former, so wondering if this can go into 3.11?

story645 · 2026-05-28T19:14:13Z

Is this both a bug fix and new feature?

I think more bugfix than new feature since boxplot already works on empty datasets.

QuLogic · 2026-05-29T04:47:27Z

@meeseeksdev backport to v3.11.x

…707-on-v3.11.x Backport PR #31707 on branch v3.11.x (Fix violinplot crash on empty datasets (#31700))

Fix violinplot crash on empty datasets (matplotlib#31700)

0644d90

story645 reviewed May 19, 2026

View reviewed changes

Comment thread lib/matplotlib/cbook.py Outdated

story645 reviewed May 19, 2026

View reviewed changes

Comment thread lib/matplotlib/axes/_axes.py Outdated

Apply reviewer feedback: stricter len check and align vpstats append …

ca14be6

…logic

timhoffm reviewed May 20, 2026

View reviewed changes

Comment thread lib/matplotlib/tests/test_axes.py Outdated

rahulrathnavel and others added 2 commits May 20, 2026 10:55

Update lib/matplotlib/tests/test_axes.py

da0dcf1

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

Fix violin_stats bailout to handle all-NaN datasets

3a2fb8e

timhoffm reviewed May 20, 2026

View reviewed changes

rahulrathnavel added 2 commits May 20, 2026 15:20

Refactor violin_stats empty dataset check into if/else block

d091ad5

Fix Ruff W293: Remove trailing whitespace on blank lines in cbook.py

978716a

melissawm added the topic: plotting methods label May 20, 2026

Refactor violin_stats to use early exit continue pattern

b4e85d5

story645 reviewed May 20, 2026

View reviewed changes

Comment thread lib/matplotlib/cbook.py Outdated

timhoffm reviewed May 20, 2026

View reviewed changes

Refactor violin_stats to use dict literals and document NaN stripping

3170944

Move NaN-stripping documentation to X parameter docstring

c41ac1b

timhoffm reviewed May 22, 2026

View reviewed changes

Comment thread lib/matplotlib/cbook.py Outdated

Update lib/matplotlib/cbook.py

968d82d

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

timhoffm approved these changes May 22, 2026

View reviewed changes

scottshambaugh reviewed May 27, 2026

View reviewed changes

Comment thread lib/matplotlib/cbook.py

Comment thread lib/matplotlib/cbook.py Outdated

MAINT: Use delete_masked_points and document NaN handling for violinplot

8636991

timhoffm reviewed May 28, 2026

View reviewed changes

Comment thread lib/matplotlib/axes/_axes.py Outdated

Comment thread doc/api/next_api_changes/behavior/violinplot_empty.rst Outdated

Comment thread lib/matplotlib/cbook.py Outdated

rahulrathnavel and others added 3 commits May 28, 2026 18:04

Update lib/matplotlib/axes/_axes.py

4d526c0

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

Update doc/api/next_api_changes/behavior/violinplot_empty.rst

4c34554

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

Update lib/matplotlib/cbook.py

786cfea

Co-authored-by: Tim Hoffmann <2836374+timhoffm@users.noreply.github.com>

timhoffm approved these changes May 28, 2026

View reviewed changes

scottshambaugh reviewed May 28, 2026

View reviewed changes

Comment thread doc/api/next_api_changes/behavior/violinplot_empty.rst Outdated

Update doc/api/next_api_changes/behavior/violinplot_empty.rst

a42716f

Co-authored-by: Scott Shambaugh <14363975+scottshambaugh@users.noreply.github.com>

scottshambaugh approved these changes May 28, 2026

View reviewed changes

scottshambaugh added this to the v3.12.0 milestone May 28, 2026

scottshambaugh merged commit 5c55704 into matplotlib:main May 28, 2026
34 of 37 checks passed

QuLogic modified the milestones: v3.12.0, v3.11.0 May 29, 2026

meeseeksmachine mentioned this pull request May 29, 2026

Backport PR #31707 on branch v3.11.x (Fix violinplot crash on empty datasets (#31700)) #31774

Merged

timhoffm pushed a commit that referenced this pull request May 29, 2026

Backport PR #31707: Fix violinplot crash on empty datasets (#31700)

22d1187

timhoffm added a commit that referenced this pull request May 29, 2026

Merge pull request #31774 from meeseeksmachine/auto-backport-of-pr-31…

2021faa

…707-on-v3.11.x Backport PR #31707 on branch v3.11.x (Fix violinplot crash on empty datasets (#31700))

		# note tricksiness, append up here and then mutate below
		vpstats.append(stats)

Uh oh!

Conversation

rahulrathnavel commented May 19, 2026

PR summary

AI Disclosure

PR checklist

Uh oh!

Uh oh!

Uh oh!

rahulrathnavel commented May 20, 2026

Uh oh!

Uh oh!

rahulrathnavel commented May 20, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

story645 May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rahulrathnavel commented May 21, 2026

Uh oh!

Uh oh!

rahulrathnavel commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rahulrathnavel commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

scottshambaugh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rahulrathnavel commented May 28, 2026

Uh oh!

Uh oh!

scottshambaugh commented May 28, 2026

Uh oh!

QuLogic commented May 28, 2026

Uh oh!

story645 commented May 28, 2026

Uh oh!

QuLogic commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

story645 May 20, 2026 •

edited

Loading

rahulrathnavel commented May 22, 2026 •

edited

Loading

rahulrathnavel commented May 28, 2026 •

edited

Loading

scottshambaugh left a comment •

edited

Loading