Jobs
Big Data / Data Platform Site Reliability Engineer
Big Data / Data Platform Site Reliability Engineer
We're is partnering with a fast-growing, data-intensive technology organisation to hire a Site Reliability Engineer focused on large-scale data platforms. This role sits at the heart of a mission-critical data environment, with responsibility for reliability, scalability and operational excellence across complex distributed systems.
This is a senior, hands-on role for an engineer who enjoys owning infrastructure, improving system behaviour over time and operating close to production in high-throughput environments.
The role
You’ll be responsible for deploying, configuring, monitoring and maintaining multiple large-scale data stores across distributed environments. Reliability, performance and availability are core to the role, with a strong focus on lifecycle management of critical data infrastructure.
You’ll manage and evolve large Linux-based systems, ensuring predictable performance and high uptime. This includes defining and documenting configuration standards, operational procedures and best practices that support long-term stability.
A key part of the role involves performance and reliability testing, reviewing system configuration, software choices and hardware decisions to identify improvement opportunities. You’ll also play an active role in incident response, root cause analysis and driving lasting reliability improvements across the platform.
There is scope to influence the direction of the technology stack, contributing ideas that improve resilience, observability and operational efficiency.
What we’re looking for
This role suits someone with strong hands-on experience operating large-scale Linux infrastructure in production environments. You should be comfortable owning complex systems and debugging issues across storage, compute and networking layers.
Deep, practical experience with Hadoop-based data platforms is important, including HDFS architecture, security models and operational lifecycle management such as upgrades, scaling and recovery. Experience running Kafka clusters in production environments is also key.
You should have experience designing or improving automation and deployment workflows, with proficiency in scripting or automation using tools such as Python or shell scripting. A solid understanding of networking fundamentals is expected, including TCP/IP, DNS, load balancing and basic network security concepts.
The role requires someone who is comfortable taking technical ownership, contributing to on-call and incident processes, and driving continuous reliability improvement.
The position operates on East Coast US working hours and is suitable for engineers working remotely.
Additional experience
Experience with large-scale analytical query engines, distributed storage systems or high-availability databases is beneficial. Familiarity with observability platforms, configuration management tools, containerisation and Kubernetes in production environments is also valuable.
Engineers who enjoy mentoring others and helping establish operational standards will find opportunities to do so in this role.
Location
Type
Full-time
Apply
Other Jobs
Senior Data Engineer - Enterprise data
Full-time
Senior Data Engineer
Remote
Full-time
Data Engineer
Full-time
Senior / Staff AI Engineer
Remote
Full-time
Full Stack Engineer
Remote – Germany
Full-time
Senior Platform Engineer
Remote – Europe
Full-time
Senior Cloud Security Architect
Remote – Europe
Full-time
Senior Big Data Architect
Remote – Europe
Full-time
Senior Cloud Data Architect
Remote – Europe
Full-time
Senior Full-Stack Engineer
Remote – Full Time
Full-time
Senior Security Engineer
Düsseldorf, Germany, Hybrid, Full-time
Full-time
Azure Cloud Engineer
Berlin, Germany, Full-time, Hybrid
Full-time
Head of Infrastructure
Berlin, Germany Hybrid
Full-time
Senior Backend Engineer
Frankfurt, Germany – Hybrid
Full-time
Senior Frontend Engineer
Frankfurt, Germany – Hybrid
Full-time
Lead UI Engineer
Remote / Europe
Full-time
Product Engineer
Remote / Europe
Full-time
Cloud Infrastructure Engineer
Remote / Europe
Full-time
Senior Backend Engineer (Remote – Europe)
Remote / Europe
Full-time
Testimonials
What people have to say

Dan
Solution Engineer
Tides were incredibly diligent on my behalf during my search. They represented me brilliantly by providing transparency and creating urgency in the process. Without them, I am not convinced I would get the job.

Philipp
Senior Engineer
Scott is an exceptionally good recruiter, the best one I’ve ever worked with. He always made sure to keep me up to date, asked how interviews went and followed up with companies immediately. I can strongly recommend Tides.

Sammi
Global Talent Manager
Tides is an excellent recruitment expert. They actively communicate during our collaboration, quickly understands the business needs and find us suitable candidates. They have found many outstanding employees for us in past years.

Leonard
Co Founder
Courtney is a pleasure to work with for tech recruiting - highly effective and great a getting strong candidates into and through the funnel and really understanding the kind of profiles that would fit to our team. Tides are my top pick!
Blog
Recent blog post
FAQs
Frequently asked questions
Why should we choose Tides Digital for our talent needs?
How does Tides Digital identify top talent in the technology domain?
Can Tides Digital streamline our hiring process?
How do you ensure candidate quality and suitability?
How can we start a partnership with Tides Digital?



