Available Locations: Remote Portugal, Netherlands, France, UK, and Germany
About the Department
Cloudflare’s Production Engineering Team builds and runs the systems and software that support our solutions that handle trillions of requests per month. Production Engineering ensures all of the new and existing features and functionality that Cloudflare offers can be managed at scale and meet the needs of our massively growing customer base.
The Platform Configuration Team within the Production Engineering organization is responsible for ensuring that we can provision and manage our infrastructure in a manner that supports both rapid innovation and safe change at scale to provide the speed and reliability that Cloudflare is known for.
The Platform Configuration Team has a unique role in Cloudflare in that we get visibility into many different aspects of how Cloudflare’s systems work. This is an incredible opportunity to work on Cloudflare’s next generation of configuration management systems.
What you'll do
In this role, you'll help us:
- Re-imagine and redesign Configuration Management tooling for a rapidly evolving, global infrastructure capable of handling over 20% of all Internet traffic
- Improve and extend our existing deployment tools, frameworks and processes to provide key enhancements to extensibility and testability
- Design, implement and teach best-practices giving our internal users the most flexibility with the easiest learning-curve
- Own operations, reliability, availability and security of our platform configuration systems
- Participate in code reviews to ensure code quality and distribute knowledge
You may be a good fit for our team if you have:
- A degree in Computer Science, Engineering, Mathematics, Statistics or related field; OR have relevant background/experience to the field.
- A deep understanding of performance, security, and user interactions in complex, distributed systems
- Experience developing a configuration management framework at scale (e.g. SaltStack, Chef, Puppet, or Ansible)
- Ability to manage a project, work to deadlines, and prioritize between competing demands
- Proficiency in Python or similar language
- Experience in designing and implementing secure and highly-available distributed systems
- Experience (and love) for debugging to ensure the system works in all cases
- Experience with a continuous integration workflow and using source control (we use git)
- Strong English language oral and written communications skills