Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ACM dataset preprocess #57

Open
JianwuZheng413 opened this issue Jun 28, 2024 · 1 comment
Open

ACM dataset preprocess #57

JianwuZheng413 opened this issue Jun 28, 2024 · 1 comment

Comments

@JianwuZheng413
Copy link

          Hi 

there are 4025 papers from KDD, SIGMOD,SIGCOMM, MobiCOMM, and VLDB, but for having balanced classes they randomly select 994 papers from database class (SIGCOMM and MOBICOMM) from 1994 papers.

Originally posted by @shirinmous in #10 (comment)

@JianwuZheng413
Copy link
Author

hi,thanks for your answers. But the data in the code I downloaded from you is different from what you said. SIGCOMM and MOBICOMM do not have 1993 papers. But SIGMOD and VLDB have 1,994 papers. Are you referring to the random selection of 994 papers from SIGMOD and VLDB, in proportion to the random selection?
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant