At LinkedIn, we have many different monitoring systems—each with its own role and granularity— ranging from quarterly reports about the business as a whole to the lowest levels of system-specific latency and availability. However, these systems don’t operate in vacuums—sometimes, issues or changes that are flagged by one system will go on to cause problems in...

Posts by Alexander Pucher
-
- Topics:
- Pinot,
- real time monitoring,
- metrics,
- Data
-
Auto-Scaling with Apache Helix and Apache YARN
Alexander Pucher September 24, 2013
At LinkedIn, many individual services integrate together to deliver a reliable and consistent end-user experience. Although each service handles a specialized set of responsibilities, they all share a common set of required features such as load-balancing, dynamic reconfiguration, health monitoring, and fault-detection. Last year we introduced Apache Helix, an...
- Topics:
- Cluster Management,
- Distributed Systems,
- Open Source