Summary
The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, Fitness+ and Apple Books. And they do it on a massive scale, meeting Apple's high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services. Thanks to Apple's unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep dedication to strengthening Apple's privacy policy, one of Apple's core values. Although services are a bigger part of Apple's business than ever before, these teams remain small, nimble, and cross-functional, offering greater exposure to the array of opportunities here.
Description
As an SRE at Apple, you will work on world-renowned internet services that serve hundreds of millions of users worldwide. You’ll collaborate closely with development teams and other stakeholders in the entire lifecycle of services from inception through deployment and continuous refinement. Your responsibilities will include monitoring performance and availability, capacity and disaster recovery planning, defining Service Level Objectives (SLO) and ensuring the reliability and resiliency of systems. Join us if you thrive on solving large-scale problems through your expertise and collaborative team work.
- Support and lead cross functional projects across multiple services - Design, analyze and troubleshoot complex mission-critical distributed systems on a global scale.
- Provide OnCall support to 1st level production support teams.
- Participate in incident management and blameless postmortems.
- Detect and resolve performance bottlenecks, enhance service efficiency.
- Write clear technical documentation, production run books.
Apple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, Women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Minimum Qualifications
- BS degree in computer science or equivalent field with 5+ years or MS degree with 3+ years experience, or equivalent.
- At least 5 years in a Site Reliability Engineering, DevOps or infrastructure focused role
- Excellent written and verbal communication skills
- Experience in supporting internet-facing production services and distributed systems
- Experience in collaborating effectively across organizations, building strong relationships
- Proficient coding experience with Python or similar scripting languages
- Experience with containers and container orchestration platforms such as Docker and Kubernetes
- Experience with monitoring tools such as Splunk, Grafana, and Prometheus
- Demonstrated ability to deliver results on time with high quality
- Experience in managing and scaling distributed systems in a public, private, or hybrid cloud environment
- Experience in debugging Java code and optimizing JVM performance is plus.
- Understanding of the Linux Operating System, standard networking protocols, and components
Preferred Qualifications
- Passion for crafting and building reliable systems
- Strong sense of ownership and integrity proven through clear communication and collaboration
- Automation advocate - you truly believe in removing operation load with software
- Excellent troubleshooting and problem solving skills