Data Engineer, Lead in Beavercreek, OH at Booz Allen Hamilton Inc.

Date Posted: 5/14/2018

Job Snapshot

Job Description

Job Number: R0028559

Data Engineer, Lead

Key Role:

Design, implement, and manage databases and data delivery systems, including transform it into beautiful insights, analyses, and reports. Exhibit expertise in database design and implementation tools, including entity-relationship data modelling and SQL, distribute computing architectures, operating systems, storage technologies, and memory management. Create structure and value out of complex and ambiguous technical challenges with little guidance using networking techniques. Exhibit knowledge of structured and unstructured data, streaming and batch data processing, including ETL, data wrangling, data ingest, and data access.


Basic Qualifications:

-2+ years of experience with data engineering and wrangling

-Experience with distributed computing technologies, including Hadoop, Hive, Spark, and AWS EMR

-2+ years of experience in building highly scalable ETL pipelines, including Kylo, NiFi, Spark or Cloud native or open source data technologies

-Experience in modeling, organizing, structuring and sustaining physical data assets in the context of a Data Lake architecture

-Knowledge of relational and dimensional modeling techniques

-Ability to obtain a security clearance

-HS diploma or GED


Additional Qualifications:

-Experience in applying principles, best practices and trade-offs of schema design to various types of database technologies, including relational, columnar, graph and NoSQL

-Experience with traditional Data Warehousing and business intelligence tools, techniques and technologies

-Experience with implementing Web services, including SOAP and RESTful APIs using microservices architectures and design patterns, including the 12 Factor App methodology

-Experience with implementing rapid response query solutions on Big Data platforms leveraging Solr, Elasticsearch, or scalable query engine implementations

-Experience in implementing batch and real-time Big Data integration frameworks and applications, in private or public Cloud, preferably AWS, using various technologies, including Hadoop, Spark, Impala, debugging, identifying performance bottlenecks and finetuning those frameworks

-Experience with SQL, including Python, Scala and Java

-Knowledge of relational and dimensional modeling techniques

-Ability to conduct research on emerging technologies and industry trends, independently

-Amazon Certified Solutions Architect and Associate


Clearance:
Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information.

We’re an EOE that empowers our people—no matter their race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or veteran status—to fearlessly drive change.

Your Career is Waiting.

Get job alerts. Learn about new work and upcoming events. Share open roles with friends and colleagues.
Our Talent Network is your opportunity hub.


Get Answers and Access.

Need more information? Find it in our FAQs.

Application already in-process? Log in to keep going.