Co-authors: Hunter Lee and Dru Pollini LinkedIn was built to help professionals achieve more in their careers, and every day millions of people use our products to make connections, discover new opportunities and get better at what they do. An important part of our mission is helping people to find other professionals who are interested in the same things they...
Apache Helix Articles
-
- Topics:
- Apache Helix,
- Distributed Systems,
- ESPRESSO,
- Data
-
Co-authors: Junkai Xue and Hunter Lee Stateless tasks are widely used for serving large-scale data processing systems. Lots of requests were made by systems, which rely on Apache Helix, for a stateless task management feature to be added to Apache Helix. Recently, our team decided to explore new ways to manage stateless tasks, in addition to our ongoing work to...
- Topics:
- Apache Helix,
- Distributed Systems,
- Data,
- Open Source
-
Editor's note: This blog has been updated. Typical distributed data systems are clusters composed of a set of machines. If the dataset does not fit on a single machine, we usually shard the data into partitions, and each partition can have multiple replicas for fault tolerance. Partition management needs to ensure that replicas are distributed among machines as...
- Topics:
- Apache Helix,
- Big Data,
- Distributed Systems,
- Open Source
-
This post originally appeared as a contributed piece on The New Stack. Distributed data systems are used in a variety of settings like...