Co-authors: Yixing Zhang, Bingfeng Xia, Ke Wu, and Xinyu Liu Since Beam Samza runner was developed in 2018 at LinkedIn, we now have 100+ Samza Beam jobs running in production. As our usage grew, we wanted to better understand how the Samza runner performs compared to other runners and identify areas of improvement. In general, for stream processing platforms,...
Apache Samza Articles
-
- Topics:
- Stream Processing,
- Apache Samza,
- Performance,
- Benchmark
-
Over a decade ago, test strategies invested heavily in UI-driven tests. Backend and mid-tier services were tested using automated UI-based tests. While UI-based tests have certain merits, such as testing user flows, they are also time-consuming and fragile. The strong coupling of tests with UI caused several problems: Tests needed frequent modification due to...
- Topics:
- Apache Samza,
- Stream Processing,
- Testing,
- Kafka
-
This post is the second in a series discussing asynchronous processing and multithreading in Apache Samza. In the previous post, we explored the design and architecture of the new AsyncStreamTask API and the asynchronous event loop. In this post, we will focus on the study of the performance of this feature with benchmark Samza jobs. Some of the interesting...
- Topics:
- Apache Samza,
- Stream Processing,
- Big Data,
- Kafka
-
As part of the Apache Samza 0.11 release, we rebuilt Samza’s underlying event processing engine to use an asynchronous and parallel...
- Topics:
- Apache Samza,
- Stream Processing,
- Big Data,
- Kafka
-
This post is the second in a series of posts that discuss some of the hard problems in stream processing. In the previous post, we...
- Topics:
- Apache Samza,
- Stream Processing,
- Big Data,
- Kafka,
- Samza
-
We live in an age where we want to know relevant things happening around the world as soon as they happen; an age where digital...
- Topics:
- Apache Samza,
- Stream Processing,
- Big Data,
- realtime,
- Kafka,
- Samza