Principal Infrastructure Services (SRE) at Northern Trust

Pune, Maharashtra, India

Northern Trust Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Financial Services, Banking, FinTechIndustries

Requirements

  • Bachelor's degree or equivalent experience
  • 10+ years in systems engineering with a focus on reliability, systems operations, and software engineering
  • 5+ years as a Team lead or a hands-on Technical Manager role that can engage and deliver projects to completion
  • Strong proficiency in programming languages such as Python, Go, Ruby, Java, etc
  • Experience with both on-prem and cloud solutions
  • Experience with containerization
  • Demonstrated ability to design and implement systems that ensure observability with associated dashboards
  • Deep understanding of distributed systems, networking, and modern software architectures
  • Excellent problem-solving skills and ability to handle complex technical challenges
  • Strong dedication to customer needs, with excellent communication and the ability to build lasting relationships, alongside the capability to articulate complex reliability strategies in a clear and impactful manner
  • Prior experience delivering Infrastructure as Code via a CI/CD pipeline

Responsibilities

  • Lead the design and architecture of providing reliability, scalability, and performance of critical complex systems
  • Develop and maintain automation scripts and tools to streamline operations and reduce manual tasks
  • Oversee system performance transparency
  • Collaborate with root cause analysis and implement measures to prevent recurrence of issues
  • Design and implement comprehensive monitoring and observability solutions to proactively detect and address issues prior to them impacting our business
  • Develop and maintain dashboards and alerts to provide real-time insights into system health
  • Identify opportunities for improving system reliability through process enhancements and technical solutions
  • Create and maintain detailed documentation of systems, processes, and procedures
  • Communicate effectively with stakeholders across different teams and levels within the organization
  • Manage and prioritize multiple projects and initiatives related to reliability and performance improvements
  • Collaborate with product, development, and operations teams to align SRE efforts with overarching business goals

Skills

Site Reliability Engineering
SRE
Observability
Automation
DevOps
System Architecture
Capacity Planning
Performance Management
Deployment
Release Engineering
Incident Response
Root Cause Analysis
Linux
Python
Prometheus
Grafana
Terraform

Northern Trust

About Northern Trust

N/AHeadquarters
N/AYear Founded
N/ACompany Stage

Land your dream remote job 3x faster with AI