Apple Cloud Services infrastructure is planetary scale. Data Platform Site Reliability Engineering manages infrastructure and applications on bare-metal and cloud computing platforms to deliver data processing, governance, and storage for many of Apple’s global products and organizations. Our platform teams work with exabytes of data, terabytes of memory, and hundreds of thousands of jobs to enable predicable and performant data analytics enabling features in Apple Music, TV, Maps, News, and other world class products. Ensuring all of these technologies in geographically distributed data centers and platforms work together in harmony presents unique challenges. As an SRE at Apple, you’ll need to solve problems that arise using empirical data, teamwork, and your own unique expertise.
Data Platform SRE work directly with our partner engineering teams, operating in unison with the developers to deliver seamless experiences for our customers. We run a mix of open source, vendor licensed, and internally developed tools which you will use and have opportunities to improve upon. The cross functional team collaborates to ensure we apply a consistent incident management process across all data platform services and provide user journey based SLOs derived from exhaustive observability metrics, high availability architecture, and automation for deployments. We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded.