Site Reliability Engineer, Senior in Washington, DC at Booz Allen Hamilton Inc.

Date Posted: 10/27/2018

Job Snapshot

Job Description

Job Number: R0027638

Site Reliability Engineer, Senior

Key Role:
Work as a site reliability engineer and systems lead, including leveraging deep and broad technical knowledge, commanding the shell fluently, and growing a team of engineers in charge of the resiliency and performance of the production infrastructure while designing and implementing innovations that improve software engineering velocity, infrastructure resiliency and security, and data availability. Act as the subject matter expert (SME) and maintain responsibility for multiple proprietary and open source technologies, including developing training, mentoring, defining standards for configuration, monitoring, reliability, and performance, and coordinating and performing major upgrades with minimal downtime. Provide an expert’s perspective on the capabilities and limits of the multi-data center production infrastructure and collaborate with software development teams to ensure the needs of IT operations are incorporated into release plans while streamlining administrative functions, metrics, and other troubleshooting best practices into the platform's code base. Assess live performance and stability issues in production, analyze the formulation of a plan to prevent further recurrences, develop and manage metric collection and reporting, and establish and manage on-call rotations for the team.

Basic Qualifications:
-5+ years of experience with designing and managing services in a distributed Internet-scale Linux environment
-4+ years of experience with scripting in Shell, Perl, or Python
-2+ years of experience with managing a team of system administrators or infrastructure engineers
-1+ years of experience with configuration management in Salt, Ansible, or Puppet
-TS/SCI clearance with a polygraph
-BS degree and 8 years of experience with IT

Additional Qualifications:
-Knowledge of edge cases and risk mitigation strategies to be applied to every change
-Ability to motivate technology and process innovations
-Ability to comprehend how code, processes, and systems fit together quickly
-Ability to disseminate technical details one minute and career feedback the next
-Ability to advance multiple projects simultaneously
-BS degree in CS or a related technical field
-CISSP, RHCE, or Saltstack Certification

Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; TS/SCI clearance with polygraph is required.

We’re an EOE that empowers our people—no matter their race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or veteran status—to fearlessly drive change.


Your Career is Waiting.

Get job alerts. Learn about new work and upcoming events. Share open roles with friends and colleagues.
Our Talent Network is your opportunity hub.

Get Answers and Access.

Need more information? Find it in our FAQs.

Application already in-process? Log in to keep going.