SRE Articles

  • open-sourcing-shiv-1

    Introducing and Open Sourcing shiv

    May 10, 2018

    At LinkedIn, we ship hundreds of command-line utilities to every machine in our data centers and to all of our employees’ workstations. The vast majority of these utilities are written in Python. In addition to developing these command-line utilities, we have hundreds of supporting libraries that are constantly being iterated on, with new versions published...

  • feature7

    Evolution of Couchbase at LinkedIn

    May 1, 2018

    Author's note: My colleague, Michael Kehoe, wrote a blog post on the Couchbase Ecosystem at LinkedIn. I encourage you to read it if you haven’t already! The following aims to provide an evolved perspective of Couchbase as it evolves to be a standard caching platform at LinkedIn, provided by a Couchbase SRE who has been working on Couchbase at LinkedIn since 2013...

  • gd-sre-teams-pt2-1

    The Makeup of Successful Geographically-Distributed SRE Teams: Part 2

    March 27, 2018

    In part one of this series, we discussed some of the key principles to consider when developing geographically distributed (GD) SRE teams. Similar to the first article, we’re leveraging the journey of LinkedIn’s SRE team as the point of reference for the topics discussed here in part two. Within this post, we’ll discuss growth planning, the challenges associated...

  • geographically-distributed-sre-teams-1

    The Makeup of Successful Geographically-Distributed SRE...

    March 15, 2018

    Why geographically-distributed SRE teams? In today’s hyper-connected technological world, there is a need for geographically...

  • image5

    Project STAR*: Streamlining Our On-Call Process

    January 10, 2018

    Co-authors: Bef Ayenew and Adam Hobson Consider the following conversation that used to be typical at LinkedIn: "Folks, we may have an...

  • fossor2

    Automating Your Oncall: Open Sourcing Fossor and Ascii Etch

    December 14, 2017

    One of our sayings in Site Reliability Engineering (SRE) is that the goal of your job is to “automate yourself out of the job.” While...