Site Reliability Engineer, Lead in Washington, DC at Booz Allen Hamilton Inc.

Date Posted: 4/21/2018

Job Snapshot

Job Description

Job Number: R0019372

Booz Allen Hamilton has been at the forefront of strategy and technology for more than 100 years. Today, the firm provides management and technology consulting and engineering services to leading Fortune 500 corporations, governments, and not-for-profits across the globe. Booz Allen partners with public and private sector clients to solve their most difficult challenges through a combination of consulting, analytics, mission operations, technology, systems delivery, cybersecurity, engineering and innovation expertise.

Site Reliability Engineer, Lead

Key Role:

Work as a site reliability engineer and systems lead leveraging deep and broad technical knowledge, command the shell fluently, and grow a team of engineers in charge of the resiliency and performance of the production infrastructure while designing and implementing innovations that improve software engineering velocity, infrastructure resiliency and security, and data availability. Act as the subject matter expert (SME) and maintain responsibility for multiple proprietary and open source technologies, developing training, mentoring, defining standards for configuration, monitoring, reliability, and performance, and coordinating and performing major upgrades with minimal downtime. Provide an expert’s perspective on the capabilities and limits of the multi–datacenter production infrastructure and collaborate with software development teams to ensure the needs of IT operations are incorporated into release plans while streamlining administrative functions, metrics, and other troubleshooting best practices into the platform's code base. Assess live performance and stability issues in production, formulate a plan to prevent further recurrences, develop and manage metric collection and reporting, and establish and manage on–call rotations for the team.

Basic Qualifications:

-5+ years of experience with designing and managing services in a distributed, Internet–scale Linux environment

-4+ years of experience with scripting in Shell, Perl, or Python

-2+ years of experience with managing a team of system administrators or infrastructure engineers

-1+ years of experience with configuration management in Salt or an equivalent technology, including Ansible or Puppet

-TS/SCI clearance; willingness to take a polygraph exam

-BS degree and 8 years of experience with IT

Additional Qualifications:

-Knowledge of edge cases and risk mitigation strategies to be applied to every change

-Ability to motivate technology and process innovations

-Ability to comprehend how code, processes, and systems fit together quickly

-Ability to disseminate technical details one minute and career feedback the next

-Ability to advance multiple projects simultaneously

-BS degree in CS or a related technical field

-CISSP, RHCE, or Saltstack Certification


Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; TS/SCI clearance is required.

Integrating a full range of consulting capabilities, Booz Allen is the one firm that helps clients solve their toughest problems by their side to help them achieve their missions.  Booz Allen is committed to delivering results that endure.

We are proud of our diverse environment, EOE, M/F/Disability/Vet.


Your Career is Waiting.

Get job alerts. Learn about new work and upcoming events. Share open roles with friends and colleagues.
Our Talent Network is your opportunity hub.

Get Answers and Access.

Need more information? Find it in our FAQs.

Application already in-process? Log in to keep going.