Senior Resilience Manager

Posted Apr 19

GitHub is looking for an experienced Resiliency professional with a technical bent to lead the continued growth and maintenance of GitHub’s Business Continuity & Disaster Recovery program. In addition to filling the role of a senior manager, this role will be uniquely positioned to build relationships and liaise across the enterprise and will work in close collaboration with the Director of Technical Risk. GitHub is committed to doing right by our customers and developing a BCDR program where risk is managed in a meaningful and sensible way that aligns with our business.

If you have a demonstrated record in Business Continuity and Disaster Recovery, have experience collaborating with service owners, teams, and diverse business organizations in order to drive enterprise objectives and want to contribute to making the world's largest software development platform more resilient, we want to hear from you!

About the Role:

As part of the Technical Risk Team, this role will lead the Resilience Program. You will work closely with multiple groups including software engineering, infrastructure, product, security operations, and audit to develop and execute sound disaster recovery processes and technical controls to meet customer needs and address internal business objectives.

This is an excellent opportunity for an experienced manager to have a hand in elevating disaster recovery as a business and sales enabler, and to integrate a deep understanding of product and business into the resilience program space. This is a team effort, so bringing your team members, leadership, and customers along for the ride is integral to your success. Central to the team's culture is that of inclusion, transparency, and teamwork - we lift each other up to be successful!

The selected candidate should be comfortable interfacing with engineering, product and security services as well as business and operational organizations. The role will work with respective stakeholders and owners to prioritize the business continuity and disaster recovery efforts with the highest potential impact and support the operationalization of those efforts. Our ideal candidate takes an extremely pragmatic approach to disaster recovery, functions as part of a growing team, and is able to balance the needs of a very dynamic engineering culture with that of protecting the company and customer data.

Past experience specifically in Disaster Recovery planning supporting cataloged services in an IT, Software, Finance, Government or other similarly complex organization will stand out.


You will be the lead of the program and collaborate with your direct reports as well as the Technical Risk Director on the build out, maintenance and ongoing testing and improvement of business continuity and disaster recovery plans across GitHub. As part of this role, you will:

  • Act as a funnel of information to the rest of the Resilience and Technical Risk Teams.
  • Support the full life cycle of business continuity and disaster recovery planning, training and testing efforts of GitHub’s services and business functions.
  • Track progress of services and business functions against resilience goals, requirements and standards.
  • Coordinate with GitHub’s Site Reliability team and service owners to ensure resilience metrics are captured in and are extractable from GitHub’s service catalog in a meaningful way.
  • Advocate for the use of best practices in business continuity and disaster recovery for GitHub’s services and business functions while factoring their prioritizations and offering balanced solutions where possible.
  • Bring attention to potential high impact - low cost opportunities or innovations to increase resiliency across GitHub.
  • Coordinate and partner with risk functions across GitHub to 1.) Continuously identify risks to resiliency and 2.) Support mitigation work in response to identified risks via assessments and audits.
  • Provide a high level of customer service when supporting services, teams and businesses in their efforts.
  • Support the planning, execution and review of enterprise table top exercises
  • During disruptive or potentially disruptive events, support workplace security in coordinating crisis management.
  • Act as the people manager for the team with weekly 1:1s, individual development planning, performance reviews, hiring, etc.

Required Experience:

  • 4+ years of people management experience with experience hiring and building a team.
  • 2+ years of experience managing a distributed remote team while customizing team processes for maximum productivity and minimal burnout.
  • 4+ years of leadership experience with teams that build and maintain BCPDR resilience programs and tooling.
  • 5+ years of experience in the fields of business continuity and disaster recovery
  • Practical experience with one or more continuity frameworks (i.e. ISO 22301) and familiarity with conforming to a continuity standard methodology in an evidenced manner.
  • 5+ years conducting and reporting on business impact analysis and other assessment activities with technical stakeholders and subject matter experts..
  • 4+ years experience recommending redundant, fault-tolerant solutions to services and business functions.
  • 4+ years experience specifically in Disaster Recovery planning supporting cataloged services in an IT, Software, Finance, Government or other similarly complex organization
  • 5+ years experience participating in the conduct of drills, tests and exercises for both individual services in a Disaster Recovery setting (e.g. failover testing) and in a business continuity setting (e.g. a table top exercise walking through manual workarounds within a business function).
  • 3+ years experience building project plans and tracking completion, negotiating commitments and escalating on blocking issues constructively.
  • Experience standing up and/or administering applications and tooling with a growth mindset for learning scripting and automating processes.
  • Proven communication skills and ability to understand the value and drivers behind adjusting style and tone for a given audience, including technical and non-technical peers across the company.
  • Independent motivation, high comfort level with asynchronous work environments, written communication, use of chat tools.

Preferred Experience and Approach:

  • Your first and foremost job is to put your employees and team members first by building and maintaining a safe psychological environment based on GitHub’s Leadership Principles: Customer Obsessed - Trust by Default - Ship to Learn - Own the Outcome - Growth Mindset - Global Product, Global Team - Anything is Possible - Practice Kindness
  • Mastery at digging into problems, answering questions, and assisting colleagues both within the Resilience Team and across the company.
  • Experience / familiarity working with GitHub's Products and services
  • Proven success in developing and using metrics/KPIs to assess, report on and improve program performance.
  • Experience in an iterative, transparent environment where work is shared in draft stages.
  • Proficiency in working under ambiguous situations, with demonstrated drive to bring clarity using communication and independent research of existing documentation and resources.
  • Experience with a team-centric mindset. Drawn to collaboration with a belief that we create a better result together.
  • Expertise in functioning as a business to technology translator and help bridge the business view of the Resilience Program to technical engineering and operations staff and vice versa.
  • Demonstrated confidence in ability to say I don't know, but I will find out! with a strong desire to learn.
  • Continuity certification from the Disaster Recovery Institute (DRI) CBCP, MBCP etc. or an equivalent entity.

(Colorado only) Minimum salary of 104,400 to maximum 221,500 + bonus + equity + benefits.
Note: Disclosure as required by sb19-085 (8-5-20) of the minimum salary compensation for this role when being hired in Colorado.

Who We Are:

GitHub is the developer company. We make it easier for developers to be developers: to work together, to solve challenging problems, and to create the world’s most important technologies. We foster a collaborative community that can come togetheras individuals and in teamsto create the future of software and make a difference in the world.

Leadership Principles:

Customer Obsessed - Trust by Default - Ship to Learn - Own the Outcome - Growth Mindset - Global Product, Global Team - Anything is Possible - Practice Kindness

Why You Should Join:

At GitHub, we constantly strive to create an environment that allows our employees (Hubbers) to do the best work of their lives. We've designed one of the coolest workspaces in San Francisco (HQ), where many Hubbers work, snack, and create daily. The rest of our Hubbers work remotely around the globe.

We are also committed to keeping Hubbers healthy, motivated, focused and creative. We've designed our top-notch benefits program with these goals in mind. In a nutshell, we've built a place where we truly love working, we think you will too.

GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!

Please note that benefits vary by country. If you have any questions, please don't hesitate to ask your Talent Partner.

Misc Info For This Position:

Remote Mix: 100% Remote

Job Type: Full-Time

Regional Restrictions: USA Only

Apply for this Position


San Francisco,CA
United States


Job Info:
100% Remote

Apply for this Position

See All Their Jobs

Share this job: