Database Systems SRE, ASE Cassandra SRE
$171,600 - $302,200/year
Role Details
The ASE Cassandra SRE team develops applications and tooling that are safe, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and rigor in engineering. Team members contribute to all major components of Cassandra deployment infrastructure, including maintenance automation, backup service application, monitoring and alerting tooling/dashboards, deployment architecture, as well as contributing back to the upstream patches to the database focused on stability, performance, and scaling. This role also requires excellent communication, ability to partner with our Core Storage and Analytics teams, and a high degree of customer focus when engaging with internal platform customers. As a distributed team, ability to work effectively with colleagues based in other locations is also essential; experience in this area is a plus. Prior experience with development or maintenance of distributed databases / storage systems is recommended. Understanding of core SRE concepts - Monitoring, Alerting, Incident management. Understanding of database concepts (consistency models, isolation levels, crash and recovery semantics). Performance engineering (design concepts, profile-guided optimization). Service management across a bare metal, virtualized (EC2), and containerized (K8s) style platforms. Fundamentals of system-level hardware and networking components (storage devices and controllers, network interfaces, CPU and memory layout in server-class systems). Operating systems concepts (process scheduling, disk and network I/O, performance). Datacenter architecture (networking topologies, host placement strategies, and failure modes); design of multi-datacenter systems; failure domains; and wide-area networking. BS or MS in Computer Science / related fields or equivalent work experience Support of internet-facing production services and distributed systems via deployments, On Call and Incident Management. Experience running large scale infrastructure with a heavy reliance on automation tooling Excellent troubleshooting and performance deep dive analysis Real operational experience managing services at scale on Kubernetes Proficient in one or more of the following programming languages: Java, Go (golang), Python Operational experience deploying in and running on Datacenter and Cloud architectures (networking topologies, host placement strategies, and failure modes); design of multi-datacenter systems; failure domains; and wide-area networking. Self motivated, inquisitive with an aptitude to learn new technologies quickly and effectively. Support of internet-facing production services and distributed systems via deployments, On Call and Incident Management. Experience running large scale infrastructure with a heavy reliance on automation tooling Excellent troubleshooting and performance deep dive analysis Real operational experience managing services at scale on Kubernetes Proficient in one or more of the following programming languages: Java, Go (golang), Python Operational experience deploying in and running on Datacenter and Cloud architectures (networking topologies, host placement strategies, and failure modes); design of multi-datacenter systems; failure domains; and wide-area networking. Self motivated, inquisitive with an aptitude to learn new technologies quickly and effectively.
For more details click Job Post.
About Apple Inc
Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software