+

We'd love to work with you!

By answering these 7 questions, we can find out if we're a match

Running Cloudera Public Cloud

Course Details Find Out More
Code ADMIN-336
Tuition (CAD) Array
Tuition (USD) Array

CDP Public Cloud Administrator Training provides participants with a comprehensive understanding of all the steps required to configure, operate, and maintain CDP Public Cloud instances. This four (4) day instructor-led course covers everything from setup to configuring various data services to execute workloads on the cloud on all major cloud providers using Cloudera Management Console. It also covers various configuration options using the web interface and automation scenarios using Ansible. On the optimization side, it covers load balancing and tuning CDP PC instances. This Cloudera training course is the best preparation for the real-world challenges faced by administrators running CDP Public Cloud.

Who Can Benefit

  • This course is best suited to cloud systems administrators and operators who have at least basic Linux and AWS/Azure/GCP experience.

Skills Gained

  • Through instructor-led discussion and interactive, hands-on exercises, you will learn how to:
  • Evaluate and select the appropriate deployment option
  • Setup CDP Public Cloud using Cloudera Management Console
  • Setup and configure various data services
  • Configure and monitor instances using Cloudera Manager
  • Optimize cluster performance and security
  • Detect, troubleshoot, and repair problems with the cluster
  • Auto scale Data Hub clusters and Data Services

Prerequisites

  • Prior knowledge of CDP, nor earlier platforms such as Cloudera’s CDH or Hortonworks HDP, is not required but will be helpful.

Course Content

Installation Overview (Quick Start)

  • Cloudera Management Console
  • CDP Credentials
  • CDP Control Plane Regions
  • Register a CDP environment
  • Cloudera Data Platform
  • Industry Trends for Big Data
  • The Challenge to Become Data-Driven
  • The Enterprise Data Cloud
  • CDP Overview
  • CDP Form Factors

CDP Architecture

  • Overview
  • Key Concepts & Components
  • CDP Runtime Overview
  • Minimum Hardware
  • Outbound Connections

Control Plane Overview

  • Accessing and Managing an Environment
  • Data Management Overview
  • Management Console
  • Dashboard
  • Environments
  • Data Lakes
  • User Management
  • Classic Clusters
  • Data Hubs
  • Data Catalog
  • Replication Manager
  • Observability

CDP CLI (Command Line Interface)

  • CDP CLI Command Line Interface
  • Installing CDP CLI / CLI Client Setup
  • CLI Modules
  • Generating an API access key / Configuring CDP client
  • Logging into the CDP CLI/SDK
  • Configuring CLI autocomplete / CLI reference /Accessing CLI help
  • CDP API overview / CDP SDK for Java overview / CDP curl overview

Managing CDP Access

  • Management Console
  • User Management
  • Create Machine User
  • User Permissions
  • Sync Users
  • Configure Groups
  • Identity Providers
  • Roles and Resource Roles
  • Global Settings
  • Audit Data Storage Credential

Data Hubs Overview

  • Data Hubs
  • Planning / Creating your DataHub Cluster
  • General Planning Considerations
  • Configuring Nodes
  • Managing Data Hub
  • Choosing the Right Hardware
  • Advanced Cluster Configuration
  • Data HubTypes
  • DataFlow
  • Data Engineering
  • • Troubleshooting

Managing Data Hubs

  • Best Practices on Data Hubs
  • Sizing Data Hubs
  • Cloudera Manager
  • DataHub Services
  • Autoscaling/Data Hub Info
  • Checking Cluster Health Status / Events and Alerts
  • Host Maintenance
  • Upgrading a Data Hub Cluster
  • Monitoring / Monitoring Features

Data Services Overview

  • • Data Services Overview
  • DataServices
  • Planning Your DataService Cluster
  • Choosing the Right Hardware / Network Considerations
  • Creating Data Services
  • DataFlow
  • Data Engineering
  • Data Warehouse
  • Operational Database
  • Machine Learning
  • Troubleshooting

DataFlow

  • DataFlow Service Overview
  • Data Ingest Overview
  • Ingesting Data using File Transfer or REST Interfaces
  • Ingesting Data Using NiFi
  • Autoscaling

Data Engineering

  • Data Engineering Service Overview
  • Apache Spark/Flink/Kafka streams Overview
  • Autoscaling

Data Warehouse

  • Data Warehouse Service Overview
  • Adding and Managing a Database Catalog
  • Adding and Tuning a Virtual Warehouse
  • Querying a Data Warehouse
  • Data Visualization
  • Monitoring & Troubleshooting

Operational Database

  • Operational Database Service Overview
  • Apache HBase/Search Overview
  • Autoscaling

Machine Learning

  • Machine Learning Service Overview
  • CML Engines
  • Requirements for CML Workspaces
  • Provisioning a CML Workspace
  • CML Auto-Scaling
  • Monitoring

Monitoring and Management

  • Monitoring and Management in CDP Public Cloud
  • Data Lake Cluster Monitoring and CDP Auditing
  • Getting Started with Monitoring in CDP
  • Monitoring with Cloudera Manager: Health Tests and Dashboards
  • Monitoring Clusters, Services, Hosts, Roles, and Activities
  • Troubleshooting Cluster Configuration and Operation

Data Management

  • SDX -Security and Governance
  • Security Concepts
  • Access Cloud Storage
  • Data Lake Security: SDX
  • Apache Ranger
  • CDP Authorization / Authentication
  • Data Governance
  • Apache Atlas
  • Data Catalog

Observability

  • Overview
  • Support
  • Observability deployment architecture
  • Monitoring capabilities
  • Working with alerts, costs,and reports

Find Out More

  • This field is for validation purposes and should be left unchanged.