[BEAM-649] Analyse DAG to determine if RDD/DStream has to be cached or not #1739

jbonofre · 2017-01-05T13:44:22Z

Be sure to do all of the following to help us incorporate your contribution
quickly and easily:

Make sure the PR title is formatted like:
[BEAM-<Jira issue #>] Description of pull request
Make sure tests pass via mvn clean verify. (Even better, enable
Travis-CI on your fork and ensure the whole test matrix passes).
Replace <Jira issue #> in the title with the actual Jira issue
number, if there is one.
If this contribution is large, please file an Apache
Individual Contributor License Agreement.

jbonofre · 2017-01-05T13:44:33Z

asfbot · 2017-01-05T14:10:02Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6427/
--none--

amitsela

I've added some comments and I will re-iterate after you push the changes, thanks!

amitsela · 2017-01-05T21:59:58Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

+ }
+ }
+ // update cache candidates with node outputs
+ for (TaggedPValue output : node.getOutputs()) {


you shouldn't iterate over outputs since a PCollection (RDD/DStream) needs to be cached only if it is used as an input to more than one transformation so it won't be evaluated again all the way throughout it's lineage.
The cache-candidate PCollection should be looked for as the output of a transformation in the Evaluator since we want to cache after it is first evaluated (so the evaluator that creates this RDD/DStream will know it should cache at the end of the evaluation).

amitsela · 2017-01-05T22:00:30Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

+ // if the input or output of the node (aka transform) is already known in the cache
+ // candidates map, and it appears more than one time, then we enable caching
+ // considering node input for caching
+ for (TaggedPValue input : node.getInputs()) {


Here this should be removed since you're looking for the output.

amitsela · 2017-01-05T22:10:19Z

runners/spark/src/main/java/org/apache/beam/runners/spark/translation/TransformTranslator.java

@@ -104,6 +105,9 @@ public void evaluate(Flatten.FlattenPCollectionList<T> transform, EvaluationCont
 }
 unionRDD = context.getSparkContext().union(rdds);
 }
+ if (cacheHint) {


This comment is for all following:

if (cacheHint) { rdd.cache(); }

and

if (cacheHint) { dstream.cache(); }

I'd go for using the runner's Dataset so that we use the user-defined StorageLevel (batch). This will require slight changes, and also get rid of multiReads optimization in EvaluationContext since this is a better cache optimization.

I'd also try something more fluent like:

cacheHint ? rdd.cache : rdd;

Though you'll probably hide it in Dataset anyway so I'm not sure it will do much difference.

Good point. I will update that way.

Are you going to leave the ifs ? inline them ? or move into Dataset ?

amitsela · 2017-01-05T22:11:57Z

.../org/apache/beam/runners/spark/translation/streaming/SparkRunnerStreamingContextFactory.java

@@ -74,7 +77,8 @@ public JavaStreamingContext create() {
 JavaSparkContext jsc = SparkContextFactory.getSparkContext(options);
 JavaStreamingContext jssc = new JavaStreamingContext(jsc, batchDuration);
 ctxt = new EvaluationContext(jsc, pipeline, jssc);
- pipeline.traverseTopologically(new SparkRunner.Evaluator(translator, ctxt));
+ pipeline.traverseTopologically(new SparkRunner.Evaluator(translator, ctxt,
+ new HashMap<PCollection, Long>()));


This should be the map you populated in the pre-visit, no ?

amitsela · 2017-01-05T22:14:19Z

runners/spark/src/test/java/org/apache/beam/runners/spark/io/hadoop/WritingSinkTest.java

+/**
+ * Test BEAM-1206 (consequence of BEAM-649).
+ */
+public class WritingSinkTest {


As discussed, this is a good test specifically for BEAM-1206 but it doesn't directly test BEAM-649, I will try and come up with an idea for how to test this properly.

Think of it now, this should have been caught by ROS tests for the Spark runner... Since Write is tested and probably has tests, I assume (and from a quick look I might be right) that we're missing ab HDFSWriter ROS test.

You can open a separate ticket for this I guess.

As for this test, we should probably keep it as long as there's no ROS test for it, but you should assert the result by reading the output file, right ?

jbonofre · 2017-01-06T12:08:35Z

I updated the PR by populating cache candidates with PTransforms inputs and check on the outputs. However, I have to add a hack for the WriteBundles.

asfbot · 2017-01-06T12:31:44Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6440/
--none--

jbonofre · 2017-01-11T16:31:43Z

Rebased to integrate #1747 and add a specific test.

asfbot · 2017-01-11T16:45:20Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6512/

Build result: FAILURE

[...truncated 11462 lines...] at hudson.remoting.UserRequest.perform(UserRequest.java:153) at hudson.remoting.UserRequest.perform(UserRequest.java:50) at hudson.remoting.Request$2.run(Request.java:332) at hudson.remoting.InterceptingExecutorService$1.call(InterceptingExecutorService.java:68) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745)Caused by: org.apache.maven.plugin.MojoFailureException: You have 1 Checkstyle violation. at org.apache.maven.plugin.checkstyle.CheckstyleViolationCheckMojo.execute(CheckstyleViolationCheckMojo.java:588) at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:134) at org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208) ... 31 more2017-01-11T16:45:16.069 [ERROR] 2017-01-11T16:45:16.069 [ERROR] Re-run Maven using the -X switch to enable full debug logging.2017-01-11T16:45:16.069 [ERROR] 2017-01-11T16:45:16.069 [ERROR] For more information about the errors and possible solutions, please read the following articles:2017-01-11T16:45:16.069 [ERROR] [Help 1] http:https://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException2017-01-11T16:45:16.069 [ERROR] 2017-01-11T16:45:16.069 [ERROR] After correcting the problems, you can resume the build with the command2017-01-11T16:45:16.069 [ERROR] mvn -rf :beam-runners-sparkchannel stoppedSetting status of 822f675 to FAILURE with url https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6512/ and message: 'Build finished. 'Using context: Jenkins: Maven clean install
--none--

amitsela

See comments, thanks!

amitsela · 2017-01-11T16:38:27Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

+ if (cacheCandidates.get(value) != null) {
+ count = cacheCandidates.get(value) + 1;
+ }
+ if (value.getName().equals("Write/WriteBundles.out")) {


Probably can remove the hack now...

amitsela · 2017-01-11T16:40:59Z

runners/spark/src/main/java/org/apache/beam/runners/spark/translation/TransformTranslator.java

@@ -104,6 +105,9 @@ public void evaluate(Flatten.FlattenPCollectionList<T> transform, EvaluationCont
 }
 unionRDD = context.getSparkContext().union(rdds);
 }
+ if (cacheHint) {


Are you going to leave the ifs ? inline them ? or move into Dataset ?

amitsela · 2017-01-11T16:45:35Z

runners/spark/src/test/java/org/apache/beam/runners/spark/io/hadoop/WritingSinkTest.java

+/**
+ * Test BEAM-1206 (consequence of BEAM-649).
+ */
+public class WritingSinkTest {


Think of it now, this should have been caught by ROS tests for the Spark runner... Since Write is tested and probably has tests, I assume (and from a quick look I might be right) that we're missing ab HDFSWriter ROS test.

You can open a separate ticket for this I guess.

As for this test, we should probably keep it as long as there's no ROS test for it, but you should assert the result by reading the output file, right ?

amitsela · 2017-01-11T16:47:37Z

runners/spark/src/test/java/org/apache/beam/runners/spark/translation/SideInputTest.java

+/**
+ * Tests for translation of side inputs in the Spark Runner.
+ */
+public class SideInputTest {


Side inputs are tested by ROS, why do we need this ? (I might have been supportive of this at some point and don't remember why, so excuse me if so 😉 )

jbonofre · 2017-01-11T16:54:39Z

Fixed checkstyle and remove the WriteBundles hack (thanks for #1747 ).

asfbot · 2017-01-11T17:19:47Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6513/
--none--

jbonofre · 2017-01-11T19:09:30Z

@amitsela Let's chat about it tomorrow.

asfbot · 2017-01-12T15:18:01Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6533/
--none--

jbonofre · 2017-01-22T14:09:02Z

I will do the changes in two steps:

Remove the if from the evaluators and move the cacheHint test in the BorrowDataset & BorrowDStream.
Change the tests to provide one test really focused on the auto-caching enabled (depending of the DAG).

amitsela · 2017-01-22T14:16:04Z

Sounds good.

jbonofre · 2017-01-30T16:16:05Z

Rebased and updated with the cacheHint dealt in the borrowDataset. I started to implement a new test with a pipeline visitor, not yet complete.

coveralls · 2017-01-30T16:41:06Z

Changes Unknown when pulling 44c2199 on jbonofre:BEAM-649 into ** on apache:master**.

asfbot · 2017-01-30T16:41:11Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6883/
--none--

coveralls · 2017-01-30T16:41:34Z

Changes Unknown when pulling 44c2199 on jbonofre:BEAM-649 into ** on apache:master**.

asfbot · 2017-01-30T16:41:52Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6882/
--none--

coveralls · 2017-01-30T17:06:15Z

Changes Unknown when pulling 44c2199 on jbonofre:BEAM-649 into ** on apache:master**.

asfbot · 2017-01-30T17:06:25Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/6885/
--none--

jbonofre · 2017-02-17T13:37:38Z

Rebasing to deal with conflicts.

jbonofre · 2017-02-23T20:26:04Z

Rebased and resolved conflicts. I just have to complete the CacheTest.

coveralls · 2017-02-23T20:57:14Z

Coverage decreased (-0.007%) to 69.31% when pulling 194eb31 on jbonofre:BEAM-649 into 0806183 on apache:master.

jbonofre · 2017-03-22T09:32:29Z

Run Spark RunnableOnService

asfbot · 2017-03-22T09:55:06Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PostCommit_Java_RunnableOnService_Spark/1318/

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

org.apache.beam.runners.spark.translation.streaming.ResumeFromCheckpointStreamingTest.testWithResume

--none--

jbonofre · 2017-03-22T10:06:50Z

Run Spark RunnableOnService

coveralls · 2017-03-22T10:13:11Z

Coverage decreased (-0.007%) to 69.895% when pulling b0d14a8 on jbonofre:BEAM-649 into e1dc7a8 on apache:master.

asfbot · 2017-03-22T10:14:44Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8650/
--none--

asfbot · 2017-03-22T10:29:26Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PostCommit_Java_RunnableOnService_Spark/1319/

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

org.apache.beam.runners.spark.translation.streaming.ResumeFromCheckpointStreamingTest.testWithResume

--none--

jbonofre · 2017-03-22T10:49:10Z

@amitsela I don't think ResumeFromCheckpointStreamingTest failure is related to my change (at least it doesn't occur on my machine). WDYT ?

amitsela · 2017-03-22T20:56:05Z

Run Spark RunnableOnService

asfbot · 2017-03-22T21:26:13Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PostCommit_Java_RunnableOnService_Spark/1326/

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

org.apache.beam.runners.spark.translation.streaming.ResumeFromCheckpointStreamingTest.testWithResume

--none--

amitsela · 2017-03-22T21:50:40Z

@jbonofre you're right, it flakes a lot this past few days - Jenkins is terribly unstable anyway..

…r not

jbonofre · 2017-03-23T07:44:38Z

Squashed. Ready to do cosmetic improvements ;)

jbonofre · 2017-03-23T07:44:48Z

Run Spark RunnableOnService

amitsela

I've added a bunch of nits about naming, codestyle, etc.
Feel free to merge the PR after addressing them.
Thanks!

amitsela · 2017-03-23T07:51:42Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

 /**
 * Options used in this pipeline runner.
 */
 private final SparkPipelineOptions mOptions;

+ private SparkPipelineTranslator translator;


this can be final and a local variable in run() just before declaring final ExecutorService executorService = Executors.newSingleThreadExecutor();

amitsela · 2017-03-23T07:52:34Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

+ translator = new StreamingTransformTranslator.Translator(
+ new TransformTranslator.Translator());
+ updateCacheCandidates(pipeline, translator,
+ contextFactory.getEvaluationContext());


no real need for this to be in a new line.

amitsela · 2017-03-23T07:53:29Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

 final JavaSparkContext jsc = SparkContextFactory.getSparkContext(mOptions);
- final EvaluationContext evaluationContext = new EvaluationContext(jsc, pipeline);
+ final EvaluationContext evaluationContext =
+ new EvaluationContext(jsc, pipeline);


no need for new line here as well

amitsela · 2017-03-23T07:55:31Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

@@ -254,6 +268,17 @@ private void detectTranslationMode(Pipeline pipeline) {
 }

 /**
+ * Evaluator that update/populate the cache candidates.
+ */
+ private void updateCacheCandidates(Pipeline pipeline,


constructor parameters in newline, 4 space indent - see for example here

amitsela · 2017-03-23T07:56:04Z

runners/spark/src/main/java/org/apache/beam/runners/spark/SparkRunner.java

+ SparkPipelineTranslator translator,
+ EvaluationContext evaluationContext) {
+ CacheVisitor updater =
+ new CacheVisitor(translator, evaluationContext);


no need for newline. updater should be renamed.

amitsela · 2017-03-23T08:03:30Z

runners/spark/src/main/java/org/apache/beam/runners/spark/translation/EvaluationContext.java

+ JavaRDD<WindowedValue<T>> rdd =
+ getSparkContext().parallelize(CoderHelpers.toByteArrays(elems, windowCoder))
+ .map(CoderHelpers.fromByteFunction(windowCoder));
+ // create a BoundedDataset that would create a RDD on demand


this comment belongs in the else clause.

amitsela · 2017-03-23T08:05:06Z

runners/spark/src/main/java/org/apache/beam/runners/spark/translation/EvaluationContext.java

@@ -231,4 +252,8 @@ private String storageLevel() {
 return runtime.getPipelineOptions().as(SparkPipelineOptions.class).getStorageLevel();
 }

+ public Map<PCollection, Long> getCacheCandidates() {


pull this up to where all public methods are, and add Javadoc.
Also, could you do me a favour and remove:

/** * Retrieves an iterable of results associated with the PCollection passed in. * * @param pcollection Collection we wish to translate. * @param <T> Type of elements contained in collection. * @return Natively types result associated with collection. */ <T> Iterable<T> get(PCollection<T> pcollection) { Iterable<WindowedValue<T>> windowedValues = getWindowedValues(pcollection); return Iterables.transform(windowedValues, WindowingHelpers.<T>unwindowValueFunction()); }

it's unused and I always forget to remove it.

amitsela · 2017-03-23T08:07:35Z

runners/spark/src/test/java/org/apache/beam/runners/spark/CacheTest.java

+ @Test
+ public void cacheCandidatesUpdaterTest() throws Exception {
+ Pipeline pipeline = pipelineRule.createPipeline();
+ PCollection pCollection = pipeline.apply(Create.of("foo", "bar"));


please type the PCollection to avoid unnecessary warnings.

amitsela · 2017-03-23T08:08:19Z

runners/spark/src/test/java/org/apache/beam/runners/spark/CacheTest.java

+ PCollection pCollection = pipeline.apply(Create.of("foo", "bar"));
+ // first read
+ pCollection.apply(Count.globally());
+ // second read


comment: explain that the second apply would have to re-evaluate pCollection or cache to begin with.

amitsela · 2017-03-23T08:08:29Z

runners/spark/src/test/java/org/apache/beam/runners/spark/CacheTest.java

+
+ JavaSparkContext jsc = SparkContextFactory.getSparkContext(pipelineRule.getOptions());
+ EvaluationContext ctxt = new EvaluationContext(jsc, pipeline);
+ SparkRunner.CacheVisitor updater =


rename updater.

asfbot · 2017-03-23T08:14:41Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PostCommit_Java_RunnableOnService_Spark/1336/

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

org.apache.beam.runners.spark.translation.streaming.ResumeFromCheckpointStreamingTest.testWithResume

--none--

asfbot · 2017-03-23T08:25:22Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8706/

Build result: FAILURE

[...truncated 2.53 MB...] at org.apache.http.impl.DefaultBHttpClientConnection.receiveResponseHeader(DefaultBHttpClientConnection.java:165) at org.apache.http.impl.conn.CPoolProxy.receiveResponseHeader(CPoolProxy.java:167) at org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:272) at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:124) at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:271) at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:184) at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:88) at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:110) at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:184) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:82) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:107) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:55) at org.eluder.coveralls.maven.plugin.httpclient.CoverallsClient.submit(CoverallsClient.java:84) at org.eluder.coveralls.maven.plugin.CoverallsReportMojo.submitData(CoverallsReportMojo.java:400) at org.eluder.coveralls.maven.plugin.CoverallsReportMojo.execute(CoverallsReportMojo.java:254) ... 33 more2017-03-23T08:25:08.041 [ERROR] 2017-03-23T08:25:08.041 [ERROR] Re-run Maven using the -X switch to enable full debug logging.2017-03-23T08:25:08.041 [ERROR] 2017-03-23T08:25:08.041 [ERROR] For more information about the errors and possible solutions, please read the following articles:2017-03-23T08:25:08.041 [ERROR] [Help 1] http:https://cwiki.apache.org/confluence/display/MAVEN/MojoFailureExceptionchannel stoppedSetting status of 843c44e to FAILURE with url https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8706/ and message: 'Build finished. 'Using context: Jenkins: Maven clean install
--none--

coveralls · 2017-03-23T08:25:50Z

Coverage remained the same at 70.149% when pulling 843c44e on jbonofre:BEAM-649 into 5e1be9f on apache:master.

coveralls · 2017-03-23T13:54:45Z

Coverage increased (+0.006%) to 70.156% when pulling af812e2 on jbonofre:BEAM-649 into 5e1be9f on apache:master.

asfbot · 2017-03-23T13:55:43Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8715/

Build result: FAILURE

[...truncated 2.55 MB...] at org.apache.http.impl.DefaultBHttpClientConnection.receiveResponseHeader(DefaultBHttpClientConnection.java:165) at org.apache.http.impl.conn.CPoolProxy.receiveResponseHeader(CPoolProxy.java:167) at org.apache.http.protocol.HttpRequestExecutor.doReceiveResponse(HttpRequestExecutor.java:272) at org.apache.http.protocol.HttpRequestExecutor.execute(HttpRequestExecutor.java:124) at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:271) at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:184) at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:88) at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:110) at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:184) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:82) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:107) at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:55) at org.eluder.coveralls.maven.plugin.httpclient.CoverallsClient.submit(CoverallsClient.java:84) at org.eluder.coveralls.maven.plugin.CoverallsReportMojo.submitData(CoverallsReportMojo.java:400) at org.eluder.coveralls.maven.plugin.CoverallsReportMojo.execute(CoverallsReportMojo.java:254) ... 33 more2017-03-23T13:55:09.613 [ERROR] 2017-03-23T13:55:09.613 [ERROR] Re-run Maven using the -X switch to enable full debug logging.2017-03-23T13:55:09.613 [ERROR] 2017-03-23T13:55:09.613 [ERROR] For more information about the errors and possible solutions, please read the following articles:2017-03-23T13:55:09.613 [ERROR] [Help 1] http:https://cwiki.apache.org/confluence/display/MAVEN/MojoFailureExceptionchannel stoppedSetting status of af812e2 to FAILURE with url https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8715/ and message: 'Build finished. 'Using context: Jenkins: Maven clean install
--none--

jbonofre · 2017-03-23T14:02:23Z

retest this please

coveralls · 2017-03-23T14:57:30Z

Coverage increased (+0.003%) to 70.152% when pulling af812e2 on jbonofre:BEAM-649 into 5e1be9f on apache:master.

asfbot · 2017-03-23T15:01:39Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8720/

Failed Tests: 1

beam_PreCommit_Java_MavenInstall/org.apache.beam:beam-runners-spark: 1

org.apache.beam.runners.spark.translation.streaming.CreateStreamTest.testFlattenedWithWatermarkHold

--none--

jbonofre · 2017-03-23T15:03:36Z

retest this please

coveralls · 2017-03-23T15:51:27Z

Coverage increased (+0.003%) to 70.152% when pulling af812e2 on jbonofre:BEAM-649 into 5e1be9f on apache:master.

asfbot · 2017-03-23T15:56:04Z

Refer to this link for build results (access rights to CI server needed):
https://builds.apache.org/job/beam_PreCommit_Java_MavenInstall/8722/
--none--

amitsela reviewed Jan 5, 2017

View reviewed changes

jbonofre force-pushed the BEAM-649 branch from fbde8ec to 0edfb4c Compare January 6, 2017 12:06

amitsela mentioned this pull request Jan 6, 2017

[BEAM-1250] Remove leaf when materializing PCollection to avoid re-ev… #1747

Closed

4 tasks

jbonofre force-pushed the BEAM-649 branch from 0edfb4c to 822f675 Compare January 11, 2017 16:30

amitsela reviewed Jan 11, 2017

View reviewed changes

jbonofre force-pushed the BEAM-649 branch from f775d6d to 64872e3 Compare January 12, 2017 14:50

jbonofre force-pushed the BEAM-649 branch from 64872e3 to 44c2199 Compare January 30, 2017 16:14

jbonofre force-pushed the BEAM-649 branch from 44c2199 to 194eb31 Compare February 23, 2017 20:24

jbonofre force-pushed the BEAM-649 branch from 6cfeeca to b0d14a8 Compare March 22, 2017 09:31

[BEAM-649] Analyse DAG to determine if RDD/DStream has to be cached o…

843c44e

…r not

jbonofre force-pushed the BEAM-649 branch from b0d14a8 to 843c44e Compare March 23, 2017 07:44

amitsela approved these changes Mar 23, 2017

View reviewed changes

[BEAM-649] Implement cosmetic changes

af812e2

asfgit closed this in 82b7b86 Mar 23, 2017

jbonofre deleted the BEAM-649 branch March 23, 2017 16:30

[BEAM-649] Analyse DAG to determine if RDD/DStream has to be cached or not #1739

[BEAM-649] Analyse DAG to determine if RDD/DStream has to be cached or not #1739

Conversation

jbonofre commented Jan 5, 2017

jbonofre commented Jan 5, 2017

asfbot commented Jan 5, 2017

amitsela left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amitsela Jan 11, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jbonofre commented Jan 6, 2017

asfbot commented Jan 6, 2017

jbonofre commented Jan 11, 2017

asfbot commented Jan 11, 2017

Build result: FAILURE

amitsela left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amitsela Jan 11, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jbonofre commented Jan 11, 2017

asfbot commented Jan 11, 2017

jbonofre commented Jan 11, 2017

asfbot commented Jan 12, 2017

jbonofre commented Jan 22, 2017

amitsela commented Jan 22, 2017

jbonofre commented Jan 30, 2017

coveralls commented Jan 30, 2017

asfbot commented Jan 30, 2017

coveralls commented Jan 30, 2017

asfbot commented Jan 30, 2017

coveralls commented Jan 30, 2017

asfbot commented Jan 30, 2017

jbonofre commented Feb 17, 2017

jbonofre commented Feb 23, 2017

coveralls commented Feb 23, 2017

jbonofre commented Mar 22, 2017

asfbot commented Mar 22, 2017

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

jbonofre commented Mar 22, 2017

coveralls commented Mar 22, 2017

asfbot commented Mar 22, 2017

asfbot commented Mar 22, 2017

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

jbonofre commented Mar 22, 2017

amitsela commented Mar 22, 2017

asfbot commented Mar 22, 2017

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

amitsela commented Mar 22, 2017

jbonofre commented Mar 23, 2017

jbonofre commented Mar 23, 2017

amitsela left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

asfbot commented Mar 23, 2017

Failed Tests: 1

beam_PostCommit_Java_RunnableOnService_Spark/org.apache.beam:beam-runners-spark: 1

asfbot commented Mar 23, 2017

Build result: FAILURE

amitsela Jan 11, 2017 •

edited

Loading

amitsela Jan 11, 2017 •

edited

Loading