Quantinuum is seeking to hire a System Reliability Engineer Engineer for our Cambridge-based cloud platform, Quantinuum Nexus. Our team aims to support the effort of quantum researchers at every stage of an experiment, making working on quantum computers as easy as sending an email.
 
 The successful candidate will be an expert at working with managed Kubernetes instances such as Amazon EKS and the distributed systems that can be built on top of them. Managing the architecture, performance, security and cost of this instance will be at the core of what you do. Experience with tools like Helm, Karpenter and k9s will be essential to meeting the goals of this role.
 
 The ideal candidate will have experience with collecting logs, traces and metrics via Opentelemetry and making those available through AWS products like x-ray and cloudwatch. These readings should then be used to ensure that Nexus meets our high standards for performance and reliability or, if it’s falling short, they can be used to direct the team on how best to improve things.
 
 In the event of issues and outages you’ll be active in reporting, monitoring and diagnosing the cause of issues. You should have the programming experience required to read and understand code in production with the intention of matching it up with readings collected by monitoring tools. You will be working closely with the development team to make sure everyone has the information they need to identify and resolve the issue as soon as possible.
 
  The ideal candidate will have:
  
- 
   Experience with Kubernetes and Docker.
  
- 
   Experience collecting logs, traces and metrics for distributed systems.
  
- 
   Experience using tools such as AWS CloudWatch to locate bugs and performance issues.
  
- 
   Experience improving declarative Infrastructure as Code tools such as Terraform
  
- 
   Experience working on cloud based systems where uptime and reliability are crucial.
  
- 
   Professional experience working with Python.
   
 
 
It would be desirable to have:
- 
   Experience with PostgreSQL.
  
- 
   Experience working in a continuous deployment environment.
  
- 
   Experience with triaging and debugging issues in code.
  
- 
   Familiarity with the OpenTelemetry standard and SDKs.
   
 
 
What is in it for you?
 Working alongside a highly talented team, with leading names in the quantum computing industry. We offer a highly competitive package, equity, 28 days of paid holiday (in addition to public holidays), a workplace pension, a positive approach to flexible working and enhanced parental and adoption benefits.
 
  About Us:
  
 Quantinuum is the world’s largest integrated quantum company, driving breakthroughs in materials discovery, cybersecurity, and next-generation quantum AI. With a team of more than 600 employees, including more than 420 of them being scientists and engineers, we are leading the worldwide quantum computing revolution.
 
 By uniting best-in-class software with high-fidelity hardware, our integrated full-stack approach is accelerating the path to practical quantum computing and scaling its impact across multiple industries.
 
 As we celebrate the International Year of Quantum, there has never been a more exciting time to be part of this rapidly evolving field. By joining Quantinuum, you’ll be at the forefront of this transformative revolution, shaping the future of quantum computing, pushing the limits of technology, and making the impossible possible.
 
 Visit our news pages to learn more about Quantinuum and our scientific breakthroughs and achievements: https://www.quantinuum.com/news
 
 Quantinuum Intro Video: The Future of Quantum Computing
 
 Please note that employment with us is subject to successfully passing our pre-employment screening checks. We are an inclusive equal opportunity employer. You will be considered without regard to age, race, creed, color, national origin, ancestry, marital status, affectional or sexual orientation, gender identity or expression, disability, nationality, sex, or veteran status.
 
 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.