TECHEAD is celebrating over thirty-five years of incredible heritage, talent, and accomplishments! To learn more about TECHEAD, visit us at TECHEAD.com or on Glassdoor .
Cloudera Administrator
12 Month Contract
Remote
We are seeking a highly skilled Cloudera Big Data Administrator to join our team. This is an administrative role, not a developer position, focused on managing and optimizing Cloudera-based big data infrastructure in a cloud environment. The ideal candidate will have experience with setting up and managing Cloudera clusters and related services such as NiFi , SOLR , HBase , Kafka , and Knox within the CDP Public Cloud v7.2.17 or higher. The role involves ensuring high availability of services, monitoring cluster health, and collaborating with cross-functional teams to improve performance and resolve issues.
Key Responsibilities:
- Cloudera Cluster Management : Build and maintain Cloudera clusters on the BDPaaS platform , ensuring optimal configuration and high availability of critical services like Hue , Hive , HBase , SOLR , and Impala .
- Service Monitoring and Health Checks : Use Cloudera Manager to monitor the health of services running in production clusters, and implement shell scripts for proactive health checks and failure response.
- Database Management : Perform administrative tasks on Hadoop databases , metastore tables , and interact with data via Hive and Impala queries using Hue .
- Cross-Team Collaboration : Work closely with application development, security, and platform support teams to implement necessary configuration changes for enhanced performance and stability.
- Cluster Configuration & Troubleshooting : Assist in configuring and troubleshooting Kafka , HBase , SOLR , and other Cloudera components to ensure smooth operations and optimal resource utilization.
- Kafka Administration : Oversee Kafka broker management, topic/offset management, and integration with other systems (e.g., IBM MQ ), ensuring high availability and efficient stream processing.
- NiFi Administration : Manage Apache NiFi , including flow management, controller service configuration, and integration with systems like Kafka , HBase , and SOLR .
- Security & Compliance : Implement and enforce security policies across the Cloudera cluster, including Kerberos and Ranger for access control and auditing.
- Automation & Scripting : Develop and maintain automation scripts for monitoring, backup, and maintenance tasks, using tools like Python and Shell scripting .
- Upgrades & Patching : Perform regular updates and upgrades to the Cloudera environment, including CDP , CDSW , Hive , Spark , and NiFi to maintain the latest features and security patches.
- Big Data Pipeline Design : Assist with the design and implementation of big data pipelines and automated data flows using Python , R , and NiFi for analytics and machine learning.
- Troubleshooting and Performance Tuning : Address and resolve performance issues, deadlocks, replication issues, and other database problems within the Hadoop ecosystem. Implement query optimization and performance tuning strategies.
- Streaming Technology Support : Support and manage streaming technologies such as Kafka , Spark , and Kudu within the cluster.
Qualifications:
- Cloudera CDP (Public Cloud v7.2.17 or higher) expertise in managing and configuring services like Hive , Spark , NiFi , and Impala .
- Proven experience with Apache Kafka administration, including broker management, topic/offset management, and integration with external systems (e.g., IBM MQ ).
- Strong hands-on experience in NiFi administration, including flow management, registry server management, and integration with Kafka , HBase , and SOLR .
- Experience with HBase database management, including troubleshooting and optimization.
- Proficient in managing SOLR configurations, including shard management, high availability setup, and query optimization.
- Expertise with AWS services like EC2 , S3 , EBS , and EFS for managing cloud-based infrastructure.
- Ability to develop and maintain automation scripts (using Python or Shell scripting ) to streamline operations and monitor system health.
- Strong understanding of Kerberos , TLS/SSL , and Ranger for implementing security policies in a Hadoop ecosystem.
- Experience with performance tuning, troubleshooting database integrity issues, and managing resource allocation with YARN .
- Hands-on experience managing big data pipelines and automating data flows using NiFi and Python .
- Proficiency in managing data ingestion , analytics/ML workflows , and cluster deployments in a cloud environment .
- Streaming technology expertise, particularly with Kafka and Spark , for real-time data processing.
Desirable Skills:
- Knowledge of Kudu and its integration with Spark for real-time analytics and processing.
- Experience working with Apache Ranger for securing Hadoop services and ensuring data governance and compliance.
- Familiarity with AWS Cloud architecture, EC2 , S3 , and related cloud services.
- Experience with cluster migration, expansion, and disaster recovery processes in a cloud environment.
TECHEAD's mission is to make our on-site associates successful by placing them in the right environment so they can grow and prosper. How we treat and respond to our clients and employees is a reflection of who we are and makes us stand out from the rest. Keeping our business focused on building and maintaining relationships with our employees and clients is the key to our success. We won't strive for anything less.
TECHEAD provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws governing non-discrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.
For more information on TECHEAD please visit .
No second parties will be accepted.