2024 AFROTECH Conference Attendees Only - Site Reliability Engineer (Hybrid)

Location: Maryland, US

Notice

This position is no longer open.

Requisition Number: 74234

Position Title:

External Description:

We are seeking highly motivated Site Reliability Engineers (SRE). As an SRE, you will play a crucial role in ensuring the availability, latency, performance, efficiency, and stability of our critical infrastructure, which supports a range of data platforms, applications, and services. You will collaborate closely with development teams to implement and maintain reliable and scalable systems while adhering to industry best practices and security standards.

Responsibilities

Availability :

Proactively monitor and proactively identify potential issues that could impact the availability of our systems.
Implement and maintain automated alerting mechanisms to notify the appropriate parties of potential outages or performance degradation.
Collaborate with development teams to design and implement solutions that enhance system resilience and reduce downtime.

Latency:

Analyze performance metrics to identify and resolve latency bottlenecks in our infrastructure.
Implement performance optimization techniques and tools to improve the overall responsiveness of our systems.
Work with development teams to ensure that new features and code changes do not introduce performance regressions.

Performance:

Develop and maintain metrics dashboards to track key performance indicators (KPIs) for our critical systems.
Identify performance trends and anomalies that may indicate potential issues or areas for improvement.
Recommend and implement performance optimization strategies to enhance the overall efficiency of our systems.

Efficiency:

Optimize resource utilization and minimize unnecessary expenditure on IT infrastructure.
Collaborate with development teams to optimize resource allocation for new applications and services.

Release Management:

Participate in the release planning process to ensure that software releases are conducted smoothly and without disruptions.
Develop and implement automated deployment and rollback procedures to mitigate risks associated with software updates.
Monitor the performance of new releases and address any issues that arise promptly.

Monitoring:

Design, implement, and maintain a comprehensive monitoring infrastructure to track the health and performance of our systems.
Analyze monitoring data to identify potential issues and proactively troubleshoot problems before they impact users.
Develop and implement alerts and notifications for critical events to ensure timely intervention.

Emergency Response:

Respond promptly to incidents and work collaboratively to resolve them in a timely manner.
Analyze root causes of incidents to identify and implement preventive measures to minimize their recurrence.
Document incident responses and lessons learned to enhance our incident handling processes.
Participate in capacity planning exercises to anticipate future workloads and make proactive recommendations to expand or optimize infrastructure resources.
Stay abreast of emerging technologies, trends, and industry best practices in the field of site reliability engineering and contribute to the continuous improvement of our practices and tools.
Work with development teams to review architecture design to ensure high availability and proper disaster recovery strategy
Collaborate with reliability and infrastructure engineering team in T Rowe Price to build synergy in tooling for the implementation of observability, tracing, and alerting

Qualifications

Required:

Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience)
Experience as a Site Reliability Engineer or equivalent in a similar role.
Experience in monitoring, analyzing, and optimizing the performance of large-scale distributed systems.
Linux systems administration, including managing servers, operating systems, and network configurations.
Scripting and automation skills, preferably with experience in Bash, Python, or similar languages.
Familiarity with AWS.
Experience with DevOps tools and practices, such as GitLab CI/CD, and Docker.
Troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues.
Ability to work independently and as part of a collaborative team, effectively communicating technical concepts to both technical and non-technical stakeholders.
A passion for maintaining high availability, performance, and reliability of critical systems in a fast-paced environment

Preferred:

Experience in Financial Services industry
Knowledge of key concepts such as ACID, Normalization, and Transactions
Experience with practical APIs and abstractions

FINRA Requirements

FINRA licenses are not required and will not be supported for this role.

Work Flexibility

This role is eligible for hybrid work, with up to three days per week from home.

City:

State:

Community / Marketing Title: 2024 AFROTECH Conference Attendees Only - Site Reliability Engineer (Hybrid)

Company Profile:

Location_formattedLocationLong: Maryland, US

CountryEEOText_Description: Commitment to Diversity, Equity, and Inclusion: We strive for equity, equality, and opportunity for all associates. When we embrace the power of diversity and create an environment where people can bring their authentic and best selves to work, our firm is stronger, and we create greater value for our clients. Our commitment and inclusive programming aim to lift the experience for each associate and builds allies for our global associate community. We know that a sense of belonging is key not only to your success at the firm, but also to your ability to bring your best each day. Benefits: We invest in our people through a wide range of programs and benefits, including: • Competitive pay and bonuses as well as a generous retirement plan and employee stock purchase plan with matching contributions • Flexible and remote work opportunities • Health care benefits (medical, dental, vision) • Tuition assistance • Wellness programs (fitness reimbursement, Employee Assistance Program) Our policies may change as our working lives evolve. Yet, our commitment to supporting our associates’ well-being and addressing the needs of our clients, business, and communities is unwavering. T. Rowe Price is an equal opportunity employer and values diversity of thought, gender, and race. We believe our continued success depends upon the equal treatment of all associates and applicants for employment without discrimination on the basis of race, religion, creed, color, national origin, sex, gender, age, mental or physical disability, marital status, sexual orientation, gender identity or expression, citizenship status, military or veteran status, pregnancy, or any other classification protected by country, federal, state, or local law.