Job Details


Software Engineer


Disaster Recovery Engineer  San Francisco, CA  Posted: 7/26/2019
Job Description

Job ID#:

8801

Job Category:

Software Engineer

Position Type:

Staff Aug

Duration:

6mos+ CTH


Details:

We are looking for an exceptional Disaster Recovery Engineer. As an engineer on the disaster recovery team, you'll be tasked with creating technical recovery plans and designing and completing large-scale disaster recovery tests. You will be helping engineering teams perform chaos in their systems, collect evidence of successful testing, and drive remediations to completion. As a Disaster Recovery Engineer you will need to ensure DR is effective in a high availability environment, consisting of both colocation data centers and cloud infrastructure.

Scope of work includes:
* Engaging with engineers across the organization to design disaster recovery tests
* Lead large-scale failure tests across a wide range of products
* Write load and chaos scripts to meet DR test requirements
* Assist the Disaster Recovery Program Manager with completing disaster recovery plans
* Expanding the scope of the current program to cover all corporate and production systems
* Scope and gather system resilience information
* Work with external auditors on disaster recovery test results and documentation
* Working closely with engineering teams to validate their recovery solutions

 
Job Requirements

 
Details:

Experience:
* Strong background leading complex failure tests in a cloud environment
* An understanding of chaos engineering and how to do chaos testing
* 3+ years coding experience preferably in Python or Go
* 3+ years working in system fault tolerance, resilience engineering, or chaos engineering
* Experience working in a DevOps environment and an understanding of the engineering lifecycle including test automation, bug tracking, and design documentation
* Ability to lead teams through complex engineering problems and technical recovery tests
* Ability to present recovery test results to senior leaders and drive continual improvement on recovery solutions
* Cloud experience preferably AWS




 

Already have an account? Log in here