Senior Perception Algorithms Engineer - Special Project

Apple Inc

Cupertino, California, USA Posted 42 days ago

$181,100 - $318,400/year

Role Details

As a member of the team, you’ll have the opportunity to work with a team of highly skilled engineers and scientists to bring new experiences to Apple products. We are looking for a self-motivated algorithm engineer who is passionate about blending modern deep learning with robust classical techniques. This position requires strong technical and interpersonal skills to handle responsibilities including: Designing and implementing a robust, real-time multi-object tracking system to solve real-world computer vision problems. Leveraging multimodal estimates (vision, audio, etc.) to ensure robust, high-fidelity estimation across complex and challenging environments. Developing rigorous evaluation frameworks, curating datasets, and defining metrics to benchmark model performance, analyze edge cases, and continuously improve perception pipelines. Integrating perception systems into a larger software stack with real-world performance constraints. PhD in Computer Science, Robotics, or a related field with 3 years industry experience or MS with 5+ years industry experience. Proficiency in systems programming (C++/Swift) and writing performant, production-quality code. Fluency in Python and modern ML frameworks (e.g., PyTorch, JAX) with a solid foundation in machine learning and traditional perception and state-estimation pipelines. Ability to break down complex problems into testable solutions, prioritizing challenging edge cases and accessible experiences for all users. Curiosity about new technologies, flexibility, and an openness to ambiguity. Experience designing scalable evaluation pipelines for learning based and classical perception pipelines. Experience in building and/or deploying on-device computer vision models or multi-object tracking systems. An area of particular domain expertise, such as one of the following: Experience with machine learning approaches and architectures (e.g., VLMs, VLAs, foundation models, self-supervision, distillation, or data augmentation techniques). Experience with classical and modern computer vision approaches, reconstruction pipelines, image processing/camera systems and computational photography pipelines Experience with multimodal data fusion across a variety of inputs and sensors, including audio processing (e.g., DSP, echo cancellation, audio-visual diarization, speech recognition) Knowledge of the broader robotics software stack (e.g., kinematics, planning, controls) alongside state estimation methods (e.g., SLAM, factor graphs, filtering, sensor fusion) and reinforcement learning methods. Strong applied math background (e.g., numerical optimization, geometry, graphics). Familiarity with Swift and Apple developer tools.

For more details click Job Post.

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software