Large-scale learning from data streams with apache samoa

N Kourtellis, G De Francisci Morales, A Bifet - Learning from Data Streams …, 2019 - Springer
… It features a pluggable architecture that allows it to run on several distributed stream
processing engines such as Apache Flink, Apache Storm, and Apache Samza. Apache SAMOA is …

Big data stream learning with SAMOA

A Bifet, GDF Morales - 2014 IEEE International Conference on …, 2014 - ieeexplore.ieee.org
… stream processing engines such as Storm, S4, and Samza. SAMOA is written in Java and
is available at https://samoa-project.net under the Apache Software License version 2.0. …

[PDF][PDF] SAMOA: scalable advanced massive online analysis.

GDF Morales, A Bifet - J. Mach. Learn. Res., 2015 - jmlr.org
… engines such as Storm, S4, and Samza. samoa is written in Java, is open source, and is
available at https://samoa-project.net under the Apache Software License version 2.0. Keywords: …

[PDF][PDF] Survey of distributed stream processing for large stream sources

S Kamburugamuve, G Fox, D Leake, J Qiu - Grids Ucs Indiana Edu, 2013 - infomall.org
… The upstream backup recovery method works only for task level failures in Samza. If a
broker node fails, Samza loses messages persisted in the file system and these cannot be …

Cloud computing platform based real-time processing for stream reasoning

HS Jung, CS Yoon, YW Lee, JW Park… - … Conference on Future …, 2017 - ieeexplore.ieee.org
… We use Apache Kafka, a message processing system, and Apache Storm, a real-time
distributed processing system, to overcome the constraints associated with real-time processing. …

[PDF][PDF] Data Centric Systems and Networking

M Schaarschmidt - cl.cam.ac.uk
Apache Samza [23] provides very similar functionality to Storm. Samza has a topological model
… An interesting feature of Samza is its approach to state management: Samza tasks come …

A taxonomy and survey of stream processing systems

X Zhao, S Garg, C Queiroz, R Buyya - … Architecture for Big Data and the …, 2017 - Elsevier
… For example, Apache Samza has provided a pluggable API to enable the users to integrate
… , S4, Spark Streaming, and Samza provide scalable systems that allow the enlargement of …

A survey of data stream processing tools

M Gorawski, A Gorawska, K Pasterak - Information Sciences and Systems …, 2014 - Springer
… Another stream-based application from Apache [1] aims at performing computation over
data streams by combining Apache Kafka’s messaging and Apache Hadoop NextGen …

[PDF][PDF] Liquid: Unifying Nearline and Offline Big Data Integration.

RC Fernandez, PR Pietzuch, J Kreps, N Narkhede… - CIDR, 2015 - lsds.doc.ic.ac.uk
… The processing layer is implemented using Apache Samza [30], a distributed stream
processing framework that follows a stateful processing paradigm [3, 4]. It executes ETL-like jobs …

[PDF][PDF] Hailstorm: Distributed Stream Processing with Exactly Once Semantics

T Dimson, M Ganjoo - scs.stanford.edu
… Unlike Storm and Samza, Hailstorm mandates that all streaming computations must be both
… Like Samza, it requires that all events must be initially stored as messages in Apache Kafka […