Staff Software Engineer, ML Data Platform

(Additional locations: San Francisco CA, Sunnyvale CA)

As an engineer on this team, you will be responsible for building and supporting a petabyte-scale data platform in the cloud and providing powerful foundations for Cruise’s ML Data Platform tools, frameworks, and services. Responsibilities include ensuring scalable, transparent, and reliable data ingestion and management; development of fast, robust, and spike-resistant data consumption, data mining, and processing tools for the entire company; and development of orchestration for large-scale post-processing, and computational pipelines.

What you’ll be doing:

Lead us in the development, optimization and productionization of the next generation data processing platform using Beam and Spark in the cloud.
Build self-serve capabilities to help customers to adopt the next generation data processing platform
Use the latest cloud technologies to own, design, implement, and test scalable distributed data systems in the cloud. Champion engineering excellence by continuously improving systems and processes
Own technical projects from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs. Effectively participate in team’s planning, code reviews and design discussions
Consider the effects of projects across multiple teams and proactively manage conflicts. Work together with partner teams and orgs to achieve cross-organizational goals and satisfy broad requirements
Conduct technical interviews with well-calibrated standards and play an essential role in recruiting activities. Effectively onboard and mentor junior engineers and/or interns

What you must have:

Experience building a data processing system using Beam / Spark and its ecosystems from the ground up.
Experience optimizing those data processing clusters for cost efficiency and performance
Experience building serving systems capable of delivering data at high-throughput, low-latency and high QPS in a cost-efficient and spike-resilient manner.
Experience building scalable infrastructure on the cloud with Python or Java/Scala (or similar)
10+ years working with big data
BS, MS or Ph.D. in Computer Science, Electrical Engineering, Mathematics, Physics, or another relevant field; or equivalent real-world experience
Passionate about self-driving technology and its potential impact on the world
Attention to detail and a passion for seeking truth
A track record of efficiently solving complex problems
Startup mentality - openness to dealing with unknown unknowns and wearing many hats

Bonus points!

Demonstrable expertise in a building end-to-end data ingestion, processing and serving systems at petabyte scale from the ground up
Proficiency in writing SQL queries for analytic purposes
Relevant publications

The salary range for this position is $183,600 - $270,000. Compensation will vary depending on location, job-related knowledge, skills, and experience. You may also be offered a bonus, and benefits. These ranges are subject to change.