Our Engineering Team lies at the core of the value we offer to our customers. We solve complex problems by working not only within our squads, but also by working collaboratively with other teams across the organization. If you are excited by solving complex technical challenges, this is the right place for you!
YOUR ROLE
As a member of Shift Technology's SRE and Developer experience team within our Cloud platform department, your role will be to:
- Build our Infrastructure platforms which enable the deployment of our services and their hosting (CI/CD, Cloud platform, Observability)
- Own the development of our Internal developer platform which enables our internal users to self-serve (creation of a new service in Kubernetes, day 2 operations, start of new products…)
- Keep our service reliable, available and fast
- Debug, troubleshoot, optimize application performance and solve a scaling bottleneck in a critical service, whether they be deep in the OS kernel or in the application code
- Define the internal operational needs, develop and own appropriate tools.
- Provide expert support to our level-2 / application support team, to troubleshoot priority incidents, and conduct post-mortems
- Build a DevOps and SRE culture and enable the transition to modern infrastructure management and deployment practices
YOUR TOOLKIT
We work with modern technologies and always encourage our team to explore what's new in the market.
Our main tools are:
- GitHub, Terraform, Python, C#, Golang, Ansible, ArgoCD
- Linux, Containers and Kubernetes
- Grafana, Opentelemetry, Tempo and Loki
- Microsoft Azure, AWS, and OVH Data Centers
- Palo Alto Firewalls, F5, Cloudflare and Cilium for our Kubernetes clusters
What We Are Looking For
- Technical Abilities:
- 5+ years experience with data structure and algorithms, handling big data with NoSQL Databases, queuing systems with Kafka, RabbitMQ, Redis... and other like Elasticsearch, MongoDB, Clickhouse at scale
- 3+ years of previous experience using Kubernetes in production and at least 1 major cloud provider at scale (multi regions, geo routing, low latency, sharding, high availability and scalability)
- Ability to write and scale Infrastructure as Code;
- You have architected, built, and operated distributed systems to solve problems at high scale
- Understanding of security, logging, monitoring and performance aspects of cloud-native platform and application architectures;
- Solid understanding of automation principles and programming experience using frameworks such as Python, C# and/or GoLang
- Soft Skills:
- You have 5 years experience in SRE/DevOps and Software Engineering
- You are proactive and take pride and ownership of your work and able to distribute the workload across the SRE team;
- You are dynamic, curious, and eager to learn; always looking to expand your fields of expertise;
- You can work under pressure and still deliver excellent service to our customers;
- You are able to maintain a high level of confidentiality, professionalism and a courteous demeanor when working with clients and internal teams;
- You can easily adapt your work to changing priorities, as needed.
RECRUITMENT PROCESS
- HR interview
- 1 technical interview with our Head of SRE
- 1 case study from home + oral tech debrief with the team
- Interview with the Head of Infrastructure
#LI-RH1 #LI-REMOTE