Site Reliability Engineer (SRE) – Data & Insights

Site Reliability Engineer (SRE) – Data & Insights

Site Reliability Engineer (SRE) – Data & Insights

Lloyds Banking Group

Gov UK

Bristol, Bristol, BS1 5LF

1 day ago

No application

About

JOB TITLE: Site Reliability Engineer
SALARY: £70,929 - £78,810 per annum
LOCATION: Bristol
HOURS: Full-time
WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites

Come join an innovative team as a Data Site Reliability Engineer and gain exposure to one of the biggest data sets in the UK that includes millions of records each day.

About this opportunity
We're looking for a Data Site Reliability Engineer (SRE) to join our Personalised Experiences & Communications (PEC) Customer Intelligence Lab Team within our PEC Platform. You will ensure the reliability, scalability, security and operability of PEC's real time and micro batch data products on GCP. The SRE will partner with Data Engineering, Architecture and Platform teams to design SLOs/SLIs, automate ops, reduce toil, harden controls, and lead incident/problem management across ingress and egress/decisioning systems, ingestion into ODPs (Origin Data Products) and FDPs (Foundation Data Products).


Personalised Experiences and Communications (PEC) is a business platform that sits within Consumer Relationships and plays a critical role in supporting the achievement of the Group strategic view by working closely & collaboratively across the Platform leadership team to grow by protecting and deepening customer relationships. PEC enables & delivers personalised customer communications & experiences across all channels, media and business areas supporting the Customer Relationship Growth strategy to deepen relationships with our customers across both the retail & commercial bank. This includes the data, analytics, and technology to unlock the value of differentiating our branded channel experiences, proposition, price and communication; as well as paper-free sustainable ambitions.

You'll be at the heart of enabling scalable, automated messaging across multiple cloud environments, working closely with Design, Data, and Communications teams at Lloyds Banking Group.

Day to day, you will
* Handle real-time and batch-based data pipelines
* Design and maintain SLOs, SLIs, and error budgets across our data systems, driving continuous improvements in reliability.
* Partner with Data Engineers, full-stack Software Engineers, and Platform teams to automate deployments and manage infrastructure using IaC (Terraform, CloudFormation, etc.).
* Lead root cause analysis and post-incident reviews, embedding reliability learnings into our platform and processes.
* Owning the data products pipeline that the team builds in the lab from go-live through to delivery
* Ensure the reliability of the lab while proposing and recommending paths to improve and keep our data robust and under control
* Develop and collaborate with other teams within the PEC Platform to build GCP-based products


Why Lloyds Banking Group
We're on an exciting journey to transform our Group and the way we're shaping finance for good. We're focusing on the future, investing in our technologies, workplaces, and colleagues to make our Group a great place for everyone. Including you.

What you'll need:
* SRE core: dashboarding, SLO/SLI design, error budgets, production change/incident/problem management, strong runbook craft.
* Strong background in Site Reliability Engineering, DevOps, or Data Engineering (4+ years).
* Experience with CI/CD pipelines and tools such as Jenkins and Harness.
* Proven experience with cloud platforms (GCP, AWS, or Azure) and containerisation (Kubernetes, Docker).
* Deep understanding of data infrastructure - streaming, batch, warehouse, and orchestration tools (e.g. Kafka, Airflow, Spark, dbt, Snowflake).
* Hands-on experience with Infrastructure as Code (Terraform, CloudFormation, etc.).
* Familiarity with monitoring and observability tools.
* Proficiency in automation and scripting languages such as Python, Go, or Bash.
* Practical knowledge on how data pipelines work and how to enable Machine Learning.
* GCP data stack: Dataflow/Apache Beam (streaming & micro batch), Pub/Sub, BigQuery (including streaming inserts/Storage Write API), Cloud Composer/Airflow; Cloud Logging/Monitoring/Trace.
* Streaming: Kafka fundamentals (partitions, consumer groups, compaction, schema governance), Connectors and DLQs.
* Security & compliance on GCP.

And any experience of these would be really useful:
* Experience with Azure or AWS Cloud Public Cloud platforms
* Experience with real-time streaming and event-driven architectures.
* Exposure to machine learning infrastructure, feature stores, or model serving.
* Familiarity with data lineage, cataloguing, or metadata management tools.
* Experience contributing to open-source or internal reliability initiatives.


About working for us
Our focus is to ensure we're inclusive every day, building an organisation that reflects modern society and celebrates diversity in all its forms. We want our people to feel that they belong and can be their best, regardless of background, identity or culture. We were one of the first major organisations to set goals on diversity in senior roles, create a menopause health package, and a dedicated Working with Cancer initiative. And it's why we especially welcome applications from under-represented groups. We're disability confident. So if you'd like reasonable adjustments to be made to our recruitment processes, just let us know

We also offer a wide-ranging benefits package, which includes:
* A generous pension contribution of up to 15%
* An annual performance-related bonus
* Share schemes including free shares
* Benefits you can adapt to your lifestyle, such as discounted shopping
* 30 days' holiday, with bank holidays on top
* A range of wellbeing initiatives and generous parental leave policies

Ready for a career where you can have a positive impact as you learn, grow and thrive? Apply today and find out more.

Proud member of the Disability Confident employer scheme

Disability Confident
A Disability Confident employer will generally offer an interview to any applicant that declares they have a disability and meets the minimum criteria for the job as defined by the employer. It is important to note that in certain recruitment situations such as high-volume, seasonal and high-peak times, the employer may wish to limit the overall numbers of interviews offered to both disabled people and non-disabled people. For more details please go to Disability Confident.