Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why the values change when filter the dataframe based on the size of the set? #216

Closed
gabriel-thiaguinho opened this issue Mar 30, 2023 · 1 comment

Comments

@gabriel-thiaguinho
Copy link

gabriel-thiaguinho commented Mar 30, 2023

Hi. I am using the library and I have a problem. When I filter a dataframe based on the size of each set, the numbers in the plot relative to intersection size between two or more sets changes a lot and I can't figure it out why this happens. The images below shows this behavior. In the first one, I am using all the sets and in the second one I am filtering the dataframe with sets that are greater than 230k elements. This filtering happens before using the function from_contents.
Is this a problem with the library or is a problem between the chair and the computer rs?
overlap_audiencias
overlap_audiencias_filtrada

@jnothman
Copy link
Owner

It sounds like you should be filtering inside Upsetplot, not before calling upsetplot, but it's hard to tell without seeing your dataframe structure and code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants