Guaranteeing that our servers are continually upgraded to secure and vetted operating systems is one major step that we take to ensure our members and customers can access LinkedIn to look for new roles, access new learning programs, or exchange knowledge with other professionals. LinkedIn has quite a large fleet of servers on-premise that depend on internal...
SRE Articles
-
- Topics:
- infrastructure,
- operations,
- SRE
-
Saira joined our Bangalore site reliability engineering (SRE) team to tackle large-scale, site engineering challenges and grow. She highlights for us the impactful work she found here — from ushering in LinkedIn’s next-generation, server query system that runs over a fleet of 350,000 servers, to mentoring the next generation of female engineers: In my...
-
Co-authors: Hengyang Hu, Dinesh Dhakal, Kalyanasundaram Somasundaram Introduction Completing recurring operating system (OS) upgrades on time and without impacting users can be challenging. For LinkedIn, completing these upgrades at a massive scale has its own complexities as we’re often facing multiple upgrades. To secure our platform and protect our members’...
- Topics:
- LinkedIn Engineering,
- ,
- SRE
-
Introduction LinkedIn’s stack consists of thousands of different microservices and the associated complex dependencies among them....
- Topics:
- SRE
-
While site outages are inevitable, it’s our job to minimize both the duration of outages and the likelihood for an outage to occur....
- Topics:
- Performance,
- infrastructure,
- SRE
-
Co-authors: Akbar KM and Kalyanasundaram Somasundaram Site up and secure is a fundamental element of how we operate, and site...
- Topics:
- Open Source,
- SRE