Principal Cloud Reliability Engineer

Location: Maryland, US

Apply

Requisition Number: 76132

Position Title:

External Description:

Role Summary

Cloud Reliability operates as a center of excellence for cloud and reliability engineering, delivering secure, scalable, and resilient cloud platforms to teams across the firm.  The Principal Cloud Reliability Engineer is a senior individual contributor responsible for defining, building, and operating enterprise cloud foundations, with a strong emphasis on reliability, observability, and operational excellence.  This role combines enterprise technical authority with hands‑on execution, ensuring that cloud platforms—particularly AWS Landing Zone (ALZ)—enable teams to deliver end‑to‑end applications that meet the firm’s availability, resilience, and risk expectations. The Principal serves as a design authority, SRE leader, and escalation point for complex cloud and reliability challenges.

Responsibilities

The Principal Cloud Reliability Engineer owns the architecture, implementation, and reliability outcomes of enterprise cloud platforms. The role balances strategic leadership with direct contribution, driving reliability standards while remaining hands‑on in critical areas of platform engineering and incident response.

  • Lead the design, architecture, and evolution of enterprise cloud platforms, including AWS Landing Zone (ALZ).
  • Ensure cloud platforms are designed for high availability, fault tolerance, and operational resilience.
  • Design and implement cloud solutions using AWS services such as EC2, S3, ECS, EKS, ELB, RDS, Route 53, Lambda, and API Gateway.
  • Design and automate advanced AWS networking solutions including VPCs, Transit Gateway, VPC Peering, PrivateLink, and Direct Connect.
  • Build and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, Ansible, and Git‑based workflows.
  • Be accountable for Site Reliability Engineering (SRE) outcomes across cloud platforms and drive adoption of SRE best practices.
  • Standardize reusable modules and patterns that promote consistent, reliable deployments at scale.
  • Define and enforce reliability standards, including availability targets, recovery expectations, and resilience patterns.
  • Ensure instrumentation, monitoring, logging, and alerting are embedded into platforms and services.
  • Act as an escalation point for complex incidents, driving root‑cause analysis and long‑term remediation.
  • Design and implement guardrails that enable secure, reliable, self‑service cloud adoption.
  • Enable teams to own end‑to‑end services while meeting reliability and operational standards.
  • Participate in a Agile delivery model, contributing to stories and epics, tracking work in Jira, and supporting sprint releases.
  • Partner closely with Application Development, Development Services, Enterprise Architecture, and Enterprise Security teams.
  • Mentor engineers while promoting a culture of operational excellence and tech modernization to drive client value.

Business Knowledge

  • Deep understanding of how reliability and availability impact business outcomes and client experience.
  • Ability to balance delivery speed with risk, resilience, and operational sustainability.
  • Experience operating in regulated, risk‑aware environments with strong security and compliance requirements.
  • Makes decisions aligned with enterprise technology strategy while improving MTTR, incident reduction, and platform stability.

Qualifications
Required:

  • Bachelor's degree or the equivalent combination of education and relevant experience AND 10+ years of experience designing and operating cloud infrastructure with senior‑level impact.
  • Deep hands‑on experience with AWS.
  • Expert knowledge of:
    • Cloud infrastructure. container platforms and serverless deployments.
    • Reliability engineering concepts (availability, resilience, observability).
    • Operating systems, networking, identity and access management.
    • Infrastructure automation, CI/CD pipelines, and DevSecOps practices.
  • Proven ability to design, build, and operate enterprise‑scale, highly reliable cloud platforms.
  • Strong troubleshooting skills across infrastructure, platform, and reliability layers.
  • Experience mentoring engineers and influencing teams without formal line management.

Preferred:

  • Cloud or SRE‑related certifications.
  • Working knowledge of Azure.
  • Experience with SolarWinds DPA.

Applicants for employment in the US must have work authorization that does not now or in the future require sponsorship of a visa for employment authorization in the United States (e.g., H1-B visa, F-1 visa (OPT), TN visa or any other non-immigrant work status)      

FINRA Requirements  
FINRA licenses are not required and will not be supported for this role.   
 
Work Flexibility  
This role is eligible for hybrid work, with up to three days per week from home.  

City:

State:

Community / Marketing Title: Principal Cloud Reliability Engineer

Company Profile:

Location_formattedLocationLong: Maryland, US

CountryEEOText_Description: Commitment to Diversity, Equity, and Inclusion: We strive for equity, equality, and opportunity for all associates. When we embrace the power of diversity and create an environment where people can bring their authentic and best selves to work, our firm is stronger, and we create greater value for our clients. Our commitment and inclusive programming aim to lift the experience for each associate and builds allies for our global associate community. We know that a sense of belonging is key not only to your success at the firm, but also to your ability to bring your best each day. Benefits: We invest in our people through a wide range of programs and benefits, including: • Competitive pay and bonuses as well as a generous retirement plan and employee stock purchase plan with matching contributions • Flexible and remote work opportunities • Health care benefits (medical, dental, vision) • Tuition assistance • Wellness programs (fitness reimbursement, Employee Assistance Program) Our policies may change as our working lives evolve. Yet, our commitment to supporting our associates’ well-being and addressing the needs of our clients, business, and communities is unwavering. T. Rowe Price is an equal opportunity employer and values diversity of thought, gender, and race. We believe our continued success depends upon the equal treatment of all associates and applicants for employment without discrimination on the basis of race, religion, creed, color, national origin, sex, gender, age, mental or physical disability, marital status, sexual orientation, gender identity or expression, citizenship status, military or veteran status, pregnancy, or any other classification protected by country, federal, state, or local law.

We’re driven by our purpose: To identify and actively invest in opportunities to help people thrive in an evolving world.

Find us on:     Facebook     X     YouTube     LinkedIn     Instagram

Do Not Sell or Share My Personal Information

Transparency in Coverage Disclosure

This website does not provide investment advice or recommendations. Nothing in this website shall be considered a solicitation to buy or an offer to sell a security, or any other product or service, to any person in any jurisdiction where such offer, solicitation, purchase, or sale would be unlawful under the laws of such jurisdiction.

T. ROWE PRICE, INVEST WITH CONFIDENCE, and the Bighorn Sheep design are, collectively and/or apart, trademarks of T. Rowe Price Group, Inc. All rights reserved.

© 2026 T. Rowe Price. All Rights Reserved.