-
Notifications
You must be signed in to change notification settings - Fork 220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ACM dataset preprocess #57
Comments
hi,thanks for your answers. But the data in the code I downloaded from you is different from what you said. SIGCOMM and MOBICOMM do not have 1993 papers. But SIGMOD and VLDB have 1,994 papers. Are you referring to the random selection of 994 papers from SIGMOD and VLDB, in proportion to the random selection? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
there are 4025 papers from KDD, SIGMOD,SIGCOMM, MobiCOMM, and VLDB, but for having balanced classes they randomly select 994 papers from database class (SIGCOMM and MOBICOMM) from 1994 papers.
Originally posted by @shirinmous in #10 (comment)
The text was updated successfully, but these errors were encountered: