Position: Hadoop Admin

Location: Washington DC

Duration: Contract to hire

Description :

Independently installs and maintains Big Data (Cloudera, Horton Works, etc.) clusters in high available, load balanced configuration across multiple (Production, User Acceptance, Development) environments. Under general supervision, manage Big Data Administration activities, technical documentation, system performance support, and internal customer support. May provide input into the development of Systems Architecture for mission critical corporate development projects. The incumbent will work with Solutions Architects, Infrastructure Architects, Lead Big Data Administrator, Big Data Supplier and Developers to setup environments and support the development teams. Cloudera Administration
install/Configure/Maintain Apache Cloudera tools like Hive, Impala, Sqoop, Flume, Kafka, HBase, SOLR and File formats like Avro, Parquet. Develop the scripts to Monitor the services on top of the Clusters and send out the Alerts if necessary. Working on the user Incidents who have issues with the Cloudera Services and assisting them accordingly. Developing the Best practices documentation to the Application Developer partners to educate how to use the Cloudera Services. Write the shell scripts to monitor the health check of Hadoop daemon services and respond accordingly to any warning or failure conditions. Monitoring the health of all the services running in the production cluster using the Cloudera Manager. Performing/Accessing the databases, Metastore tables and writing Hive, Impala queries using HUE. Setting up the High Availability of the Services like Hue, Hive, HBase REST, SOLR and IMPALA on top of the all new clusters that were built on the BDPaas Platform. Responsible for monitoring the health of the Services on top of the all clusters. Working closely with different teams like Application development team, Security team, Platform Support to identify and implement the Configurational changes that are needed on top of the cluster for better performance of the services.
apache Kafka – strong Administration & troubleshooting skills
Kafka Streams API
stream processing with KStreams & Ktables
Kafka integration with MQ
Kafka broker management
Topic management
offset management
Apache Nifi – Administration
Flow management
registry server management
controller service management
Nifi to kafka integration
Nifi to Hbase integration
Nifi to solr integration
Flume – integration with Kafka , Nifi & IBM MQ
Hbase – administration
database management
solr – administration
managing Logging level
managing shards
managing shard high availability
collection management
troubleshooting resource intensive & long running solr queries

– provided by DiceTracking

To Apply: https://www.jobg8.com/Traffic.aspx?d6px1HJYfSMmKUGsLp3NAwq