JobsSite Reliability Engineer 5, Ads SRE
N

Site Reliability Engineer 5, Ads SRE

NetflixUSA
RemoteFull-timeVerified

About the role

Netflix is hiring a Site Reliability Engineer 5 for its Ads Reliability Engineering team to help ensure the stability, scalability, and resilience of systems powering the Netflix Ads Suite. The role focuses on maintaining highly available distributed systems that directly impact advertising performance, revenue, and user experience across Netflix’s rapidly growing advertising infrastructure. You will design and maintain scalable cloud infrastructure, improve observability and automation, support incident response processes, and proactively identify reliability risks across large-scale distributed systems. The role also involves collaborating closely with engineering and product teams to embed reliability, security, and operational best practices throughout the software development lifecycle. Engineers on this team contribute to incident management, capacity planning for large-scale live streaming ad insertion systems, and the development of tooling and frameworks that improve operational efficiency and engineering velocity.

What we're looking for

Candidates should have at least 5 years of experience working as a Site Reliability Engineer, Production Engineer, or in a similar role supporting high-scale, business-critical systems. Strong programming skills in languages such as Python, Go, or Java are required, along with a strong automation mindset and experience building operational tooling. Applicants should have hands-on experience with cloud platforms such as AWS, Azure, or GCP, infrastructure-as-code technologies like Terraform, and container orchestration systems such as Kubernetes. A strong understanding of distributed systems, system reliability, scalability challenges, and production troubleshooting is essential. Strong communication and collaboration skills are required, as the role involves working across multiple engineering teams and influencing reliability practices organization-wide. Experience with ad-tech systems, real-time bidding platforms, dynamic ad insertion, high-scale traffic systems, large-scale data pipelines, or open-source reliability tooling is considered a strong advantage.
Site Reliability EngineeringSRENetflixCloud InfrastructureAWSKubernetesTerraformPythonGoJavaDistributed SystemsDevOpsProduction EngineeringAd TechDynamic Ad InsertionObservabilityIncident ResponseReliability Engineering

About Netflix

N

Netflix