Site Reliability Engineer – Level 5

Remote, USA Full-time
Job Description: • Design, implement, and maintain scalable and reliable infrastructure to support Netflix Ads Suite • Collaborate with engineering and product teams to integrate observability, reliability, and security considerations into the entire software development lifecycle • Coordinate capacity planning as we scale up Dynamic Ad Insertion for global-scale Netflix Live streaming • Develop and implement automation tools for monitoring, deployment, and incident response • Participate in on-call rotations to ensure the 24/7 health of the Netflix Ad Suite and contribute to incident response, diagnosis, and resolution • Implement and maintain a robust incident response framework, including blame-aware incident reviews • Proactively identify sources of instability in distributed systems and analyze how complex systems fail • Champion and embed a culture of reliability across the Ads organization Requirements: • 5+ years of experience as a Site Reliability Engineer (SRE), Production Engineer, or similar role supporting business-critical, high-traffic services • Write code to solve problems; proficient in one or more languages like Python, Go, or Java • Understand modern cloud infrastructure; hands-on experience with AWS/Azure/GCP, Infrastructure as Code such as Terraform, and container orchestration systems like Kubernetes • Understand large-scale distributed systems, their common failure modes and edge cases • Excellent communication skills and a proven ability to build relationships with engineering partners • Calmly navigate complex production issues, identify root causes, and implement effective, lasting solutions • Growth mindset; relentlessly curious and passionate about scaling your expertise Benefits: • Health Plans • Mental Health support • 401(k) Retirement Plan with employer match • Stock Option Program • Disability Programs • Health Savings and Flexible Spending Accounts • Family-forming benefits • Life and Serious Injury Benefits • Paid leave of absence programs • 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off • Flexible time off for full-time salaried employees Apply tot his job
Apply Now

Similar Jobs

Shopify Developer Needed – MVP E-commerce Site (Jan 15 Deadline)

Remote, USA Full-time

Shopify Developer Needed – Membership & Giveaway Platform

Remote, USA Full-time

URGENT — Shopify UI/UX Developer Needed Today — Launch Day Fixes

Remote, USA Full-time

Shopify Developer & Designer – Premium Supplement Brand Store

Remote, USA Full-time

Remote - Site Reliability Engineering (SRE) Consultant - 51313-1

Remote, USA Full-time

Social Media/Content Creator Specialist

Remote, USA Full-time

Social Media & Digital Marketing Growth Strategist for @JohnnyChangLive & @UnlearnedWisdom

Remote, USA Full-time

Social Media Strategist and Content Creator

Remote, USA Full-time

Biddable Media Analyst, Paid Social

Remote, USA Full-time

Digital Marketing Specialist (Social Media & Performance-Based)

Remote, USA Full-time

**Part-Time Customer Service Specialist – Join arenaflex's Dynamic Team in California**

Remote, USA Full-time

**Experienced Part-Time Live Chat Representative – Entry-Level Opportunity at arenaflex**

Remote, USA Full-time

Operations Coordinator/ VA

Remote, USA Full-time

**Experienced Full Stack Data Entry Specialist – Remote Work Opportunity at arenaflex**

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Full Time/Part Time Opportunities for Career Growth and Development at blithequark

Remote, USA Full-time

CLINICAL RESEARCH COORDINATOR, SR

Remote, USA Full-time

Experienced Customer Services Representative for Dynamic Remote Opportunities – Full-Time Position with arenaflex

Remote, USA Full-time

Clinical Research Budget Analyst (Remote) Job at START Center for Cancer Research in San Antonio

Remote, USA Full-time

[Remote] Test Automation Engineer, Mid

Remote, USA Full-time

Global Supply Chain Manager

Remote, USA Full-time
Back to Home