Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preproc/aggregate #17

Merged
merged 15 commits into from
Dec 3, 2021
Merged

Preproc/aggregate #17

merged 15 commits into from
Dec 3, 2021

Conversation

atrostan
Copy link
Owner

No description provided.

add command example to build sbt;
add partitioning, preprocessing scripts that execute Compressor and Partitioner JARs programmatically;
add README explaining how to build Compressor and Partitioner JARs using intellij;
use Scala Either to differentiate compression, partitioning of weighted vs. unweighted edge lists;
add compression tests; Partitioning tests remain to be done;
add test resources for CompressorTest;
parse paths of worker/partitions nodes from yml;
add rmat, weighted email graph;
add types for rdd row for mains, mirrors;
add aggregation driver that unions partitioned edgelists,
assigns mains, mirrors to partitions,
and saves to partition paths;
add rdd caching to compression script;
remove hardcode spark config;
pass spark config in preprocess script;
add tagged neighbours for mains, mirrors rdds;
modify entity managers to initialize using partition specific mains, mirrors;
modify VertexEntity neighbors to include edge weights;
tag edges' src, dests with type:
main -> main, main -> mrr, mrr -> main, or mrr -> mrr;
reformat, autoindent partition.util;
add symmetric rmat graph for testing colouring;
shutdown actorsystems in cluster sharding app;
clean up entitymanager;
@atrostan
Copy link
Owner Author

atrostan commented Nov 29, 2021

PR #14 can be ignored - this PR fixes some issues with jar building that is present in PR 14, once we merge into main, I'll delete PR 14.

  • add 1d, 2d, hybrid partitioners; driver
  • add partitioning, preprocessing scripts that execute Compressor and Partitioner JARs programmatically;
  • add README explaining how to build Compressor and Partitioner JARs using assembly;
  • use Scala Either to differentiate compression, partitioning of weighted vs. unweighted edge lists;
  • add compression tests; Partitioning tests remain to be done;
  • add test resources for CompressorTest;
  • add spark SQL, dataframes;
  • parse paths of worker/partitions nodes from yml;
  • add symmetric rmat, weighted email graph;
  • add types for rdd row for mains, mirrors;
  • add aggregation driver that unions partitioned edgelists,
  • assign mains, mirrors to partitions, and saves to partition paths;
  • add rdd caching to compression script;
  • revise assembly merge strategy for uber-jar assembly;
  • remove hardcoded spark config;
  • pass spark config in preprocess script;
  • add tagged neighbours for mains, mirrors rdds;
  • modify entity managers to initialize using partition specific mains, mirrors;
  • modify VertexEntity neighbors to include edge weights;
  • tag edges' src, dests with type:
    • main -> main, main -> mrr, mrr -> main, or mrr -> mrr;
  • reformat, autoindent partition.util;
  • shutdown actorsystems in cluster sharding app;

Apologies for large PR - there's lots to merge.

add nnodes nnedges to preprocess.sh;
add testColor graph;
add coloring correctness fn to cluster sharding app;
remove worker yaml read from agg. driver;
stop export yaml with n, m after compression;
@atrostan atrostan merged commit b46a2a0 into main Dec 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant