• Full Time

Apply for job

Drop your file here, or Browse. Max. file size: 1 MB.

Omnis Partners are supporting a fast-growing AI business hiring a Principal SRE / Platform Engineer to support a major UK banking client.

This is a senior platform engineering and site reliability role focused on helping regulated organisations move AI systems from prototype into safe, scalable production.

The work sits at the intersection of AI infrastructure, banking technology, Kubernetes, cloud-native platforms, reliability engineering and production security.

The role

You will help design and operate the production guardrails around AI workloads in a highly regulated financial services environment.

This is not a research or machine learning engineering role. It is about the infrastructure, controls and reliability layer needed to run AI products safely in production.

You will be working across areas such as Kubernetes, observability, monitoring, incident response, security, access control, compliance, auditability, safe rollout and operational resilience.

The role would suit a senior hands-on SRE, Platform Engineer, Principal Platform Engineer or Infrastructure Engineer who has worked in complex, production-critical environments.

What you’ll be working on

You will help shape how AI workloads are deployed, monitored, secured and operated in a regulated banking environment.

Key areas will include:

  • Building and improving Kubernetes-based platform infrastructure
  • Designing reliable, scalable and secure production systems
  • Improving observability, monitoring, alerting and incident response
  • Supporting safe rollout, rollback and production change processes
  • Implementing security, access control and compliance guardrails
  • Working with senior engineering and client stakeholders
  • Helping define operational standards for AI systems in production
  • Supporting platform maturity across a fast-moving AI environment

What we’re looking for

We are looking for someone with strong platform engineering and SRE fundamentals.

You do not need to come from a pure AI background. AI/ML exposure would be useful, but the priority is experience building, securing and operating production-grade infrastructure.

Relevant experience might include:

  • Senior SRE, Staff SRE, Principal SRE or Lead SRE experience
  • Senior Platform Engineer, Principal Platform Engineer or Kubernetes Platform Engineer experience
  • Strong Kubernetes experience in production environments
  • Terraform / Infrastructure as Code
  • Cloud-native platform engineering
  • Observability, monitoring, logging, tracing and alerting
  • Incident management, reliability engineering and production support
  • Security, identity, access control and compliance
  • Experience in banking, fintech, financial services or another regulated environment
  • Strong communication skills and comfort working with senior technical stakeholders

Why this role?

This is an opportunity to work on one of the most important problems in AI: how to make agentic AI safe, reliable and production-ready for regulated enterprises.

The company is working with serious enterprise clients, including in banking, where reliability, security and compliance are not optional.

You will be joining at a point where AI infrastructure is moving beyond experimentation and into real-world production systems.

For the right person, this is a chance to have meaningful influence over how modern AI systems are deployed and operated at scale.

SIMILAR JOBS

CONTACT US

Please contact for any additional information or for updates.

SUBMIT A VACANCY

Send us the details of your job opening and one of our consultants will be in touch to discuss suitable candidates.

UPLOAD YOUR CV

Send us your details and one of our consultants will be in touch to discuss suitable roles.