As an engineer on one of our Core platform teams, you'll be building the tools to help engineers deploy and operate the services that make Cloudflare work. Our mission is to provide a reliable, yet flexible, platform to help product teams release new software efficiently and safely. You’ll be building the private cloud that Cloudflare developers leverage to build Cloudflare itself. Core platforms we operate at Cloudflare include:
- Kubernetes
- Kafka and Zookeeper
- Developer tools, CI, and CD systems
- Vault, Consul, Terraform
- Consul
- Etcd
- Temporal, Airflow
What You'll Do
- Build software that automates the operation of large, highly-available distributed systems.
- Ensure platform security, and guide security best practices
- Document your work and guide fellow developers towards optimal solutions
- Contribute back to the open source community
- Leave code better than we found it
What You'll Need
- Recent career experience with Go, Python or Rust and at least 3 years experience in the role of full-time software engineer (any language).
- Experience with configuration management tooling (we use Salt)
- Experience with deploying and managing services using Docker on Linux
- A firm grasp of IP networking, load balancing and DNS
- Excellent debugging skills in a distributed systems environment
- Source control experience including branching, merging and rebasing (we use git)
- The ability to break down complex problems and drive towards a solution
- Be passionate about improving User Experience
Bonus Points
- Experience with Deployment, StatefulSets, Persistent Volumes Claims, Ingresses, CRDs on Kubernetes
- Operational experience deploying, tuning and managing large systems on bare metal
- Experience as a Site Reliability Engineer (SRE) across a large scale infrastructure.
- You have practical knowledge of web and systems performance, and extensively used tracing tools like ebpf and strace.