FoundationDB SRE Manager
$228,100 - $342,800/year
Role Details
The FoundationDB SRE team develops applications and tooling that are safe, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and rigor in engineering. Team members contribute to all major components of FoundationDB deployment infrastructure, including maintenance automation, backup service application, monitoring and alerting tooling/dashboards, deployment architecture, as well as contributing back to the upstream patches to the database focused on stability, performance, and scaling. As a leader in this organization, you will manage, develop, and grow a team responsible for FoundationDB’s scalability and performance across Apple. You will also be responsible for helping grow the iCloud service organization responsible for CloudKit and Content. We are seeking a hands-on manager with domain experience who is comfortable working in the details. Lead and grow a SRE team responsible for FoundationDB availability, reliability, data durability and performance. Design and implement scalable, highly available backup and restore architectures. Drive development of tiered storage systems and CDC pipelines that support both recovery and online use. Ensure strong data integrity guarantees across transactional and replicated systems. Improve performance, reliability, and cost efficiency of large-scale backup storage systems. Set technical direction while maintaining high engineering quality and operational rigor. Experience managing or developing critical internet services / platform infrastructure. Understanding of distributed systems and database concepts (consistency models, isolation levels, crash and recovery semantics). Fundamentals of system-level hardware and networking components (storage devices and controllers, network interfaces, CPU and memory layout in server-class systems). Operating systems concepts (process scheduling, disk and network I/O, performance). Datacenter architecture (networking topologies, host placement strategies, and failure modes); design of multi-datacenter systems; failure domains; and wide-area networking. Proficient in modern languages, ideally Java and Golang, optionally Python Excellent communication, ability to partner with Infrastructure teams. A high degree of customer focus when engaging with internal platform customers. Ability to work effectively with colleagues based in other locations. Understanding of core SRE concepts - Monitoring, Alerting, Incident management, SLOs. BS, MS, or PhD in Computer Science / related fields or equivalent work experience. Service management across a bare metal, virtualized (EC2), and containerized (Kubernetes) style platforms. Performance engineering (design concepts, profile-guided optimization). Advanced understanding of data structures and algorithms in storage and indexing.
For more details click Job Post.
About Apple Inc
Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software