Strong program management skills to work with cross-functional infrastructure and application owners to assess, design, and implement technical solutions for the Disaster Recovery (DR) program
Ability to troubleshoot, identify root cause, and seek formal solutions with other teams
Hands-on experience with data center, networking, security, storage, virtualization, database, and middleware technologies
Solid understanding of program management and leadership skills to engage various teams
Expertise in evaluating and designing complex IT infrastructure solutions across a vast range of technologies
Responsibilities
Independently implements technical solutions and delivers projects end-to-end working autonomously, while providing updates to stakeholders
Performs analysis of complex functional and business requirements. Prepares and delivers solutions for others. Leads design activities, working with architects from various technical teams
Evaluates, develops, and oversees resiliency strategy across development and engineering teams
Advances the DR automation effort of applications and business capabilities
Collaborates closely with teams participating in strategic planning, facilitating cross-team solution reviews, and communicating strategy direction
Maintains an effective technical network across technical SMEs and architects for multiple service areas
Configures and integrates interfaces with existing systems with the DR Orchestration tool. Integrates project management, disaster recovery, and functional business expertise to create customized solutions
Works with teams to assess and implement high availability and seamless failover resiliency mechanisms across multiple layers of the application and infrastructure stacks
Develops documentation for onboarding of applications onto the toolset, creates training modules, and applies project manager expertise to ensure project milestones are met
Applies new technologies and designs highly complex infrastructure and software solutions
Maintains accurate documentation of disaster recovery plans, procedures, and incident response protocols
Leads disaster recovery efforts in the event of a disruption, coordinating the response and recovery activities
Develops and provides training to employees on disaster recovery and business continuity procedures to enhance preparedness
Stays updated on industry best practices, emerging technologies, and evolving threats to continually improve disaster recovery capabilities