Building Data Lakes on AWS (BDLA)

 

Course Overview

In this course, you will learn how to build an operational Data Lake that supports the analysis of structured and unstructured data. You will learn about the components and functions of the services that are involved in creating a Data Lake. You will use AWS Lake Formation to build a data lake, AWS Glue to build a data catalog, and Amazon Athena to analyze data. The course presentations and exercises deepen what you have learned by analyzing several common data lake architectures.

Quem deve participar

This course is designed for:

  • Solutions Architects
  • Big Data Developer
  • Data Architects and Analysts
  • Other data analysis experts

Certificação

Este curso é parte das seguintes certificações:

Pré- requisitos

We recommend that participants in this course meet the following requirements:

  • Good practical knowledge of key AWS services such as Amazon Elastic Compute Cloud (EC2) and Amazon Simple Storage Service (S3)
  • Experience with a programming or scripting language
  • First knowledge of the Linux operating system and the command line interface
  • Notebook required to take part in the exercises, tablets are not suitable

Objetivos do Curso

What you will learn in this course:

  • Collect large amounts of data with services like Kinesis Streams and Firehose and store data securely and long term in Amazon Simple Storage Service.
  • Create a metadata index of your data lake.
  • Choose the best tools to capture, store, process, and analyze your data in Data Lake.
  • Applying the knowledge in hands-on labs where hands-on experience can be gained by building a complete solution.

Conteúdo do curso

The course covers the following concepts:

  • The key services for building a serverless Data Lake architecture
  • A data analysis solution that addresses the capture, storage, processing, and analysis workflows
  • Repeatable deployment of templates to implement a Data Lake solution
  • Create a metadata index and enable search
  • Set up a large data transfer pipeline for multiple data sources
  • Data transformation using simple functions triggered by events
  • Data processing using the appropriate tools and services for the application
  • Available options for optimized analysis of processed data
  • Best practices for deployment and operations

Preços & Delivery methods

Treinamento online

Duração
1 dia

Preço
  • Solicitar orçamento
Classroom training

Duração
1 dia

Preço
  • Solicitar orçamento

Click on town name or "Online Training" to book Agenda

Instructor-led Online Training:   Este é um curso Instructor-Led Online
This is a FLEX course, which is delivered both virtually and in the classroom.

Europa

Alemanha

Este é um curso FLEX. Hamburgo Inscrever
Treinamento online Fuso horário: Central European Time (CET) Inscrever
Este é um curso FLEX. Munich Inscrever
Treinamento online Fuso horário: Central European Summer Time (CEST) Inscrever

Reino Unido

Treinamento online Fuso horário: British Summer Time (BST) Inscrever

Suíça

Este é um curso FLEX. Zürich Inscrever
Treinamento online Fuso horário: Central European Time (CET) Inscrever
Este é um curso FLEX. Zürich Inscrever
Treinamento online Fuso horário: Central European Summer Time (CEST) Inscrever

itália

Treinamento online Fuso horário: Central European Time (CET) Inscrever
Treinamento online Fuso horário: Central European Summer Time (CEST) Inscrever