* Cantinho Satkeys

Refresh History
  • FELISCUNHA: ghyt74  pessoal  4tj97u<z
    21 de Abril de 2025, 10:38
  • cereal killa:
    19 de Abril de 2025, 21:17
  • j.s.: tenham uma Santa e Feliz Páscoa  49E09B4F 49E09B4F 49E09B4F
    19 de Abril de 2025, 18:19
  • j.s.:
    19 de Abril de 2025, 18:19
  • j.s.: dgtgtr a todos  4tj97u<z 4tj97u<z
    19 de Abril de 2025, 18:15
  • FELISCUNHA: Uma santa sexta feira para todo o auditório  4tj97u<z
    18 de Abril de 2025, 11:12
  • JPratas: try65hytr Pessoal  4tj97u<z classic k7y8j0
    18 de Abril de 2025, 03:28
  • cereal killa: try65hytr malta  classic 2dgh8i
    14 de Abril de 2025, 23:14
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  101041
    13 de Abril de 2025, 11:45
  • j.s.: e um bom domingo de Ramos  43e5r6 43e5r6
    11 de Abril de 2025, 21:02
  • j.s.: tenham um excelente fim de semana  49E09B4F
    11 de Abril de 2025, 21:01
  • j.s.: try65hytr a todos  4tj97u<z
    11 de Abril de 2025, 21:00
  • JPratas: try65hytr  y5r6t Pessoal  classic k7y8j0
    11 de Abril de 2025, 04:15
  • JPratas: dgtgtr A Todos  4tj97u<z classic k7y8j0
    10 de Abril de 2025, 18:29
  • FELISCUNHA: ghyt74  pessoal   49E09B4F
    09 de Abril de 2025, 11:59
  • cereal killa: try65hytr pessoal  2dgh8i
    08 de Abril de 2025, 23:21
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  43e5r6
    06 de Abril de 2025, 11:13
  • cccdh: Ola para todos!
    04 de Abril de 2025, 23:41
  • j.s.: tenham um excelente fim de semana  49E09B4F
    04 de Abril de 2025, 21:10
  • j.s.: try65hytr a todos  4tj97u<z
    04 de Abril de 2025, 21:10

Autor Tópico: A Big Data Hadoop and Spark project for absolute beginners (9/2020)  (Lida 101 vezes)

0 Membros e 1 Visitante estão a ver este tópico.

Online mitsumi

  • Moderador Global
  • ***
  • Mensagens: 119145
  • Karma: +0/-0

A Big Data Hadoop and Spark project for absolute beginners
Video: .mp4 (1280x720, 30 fps(r)) | Audio: aac, 44100 Hz, 2ch | Size: 1.92 GB
Genre: eLearning Video | Duration: 37 lectures (4 hour, 58 mins) | Language: English

 Hadoop, Spark, Python,PySpark, Scala, Dataproc, AWS S3 Data Lake, Glue, Athena

What you'll learn

    Big Data , Hadoop and Spark from scratch using Python and Scala. You will also learn how to use free cloud tools to get started with Hadoop and Spark programming in minutes. Additionally you will find two bonus projects on AWS data lake solution and Machine Learning Classification model

Requirements

    Students should have some programming background and some knowledge of SQL queries.

Description

A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign.

It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields.The data needs to be cleansed before any kind of analysis can be done.

Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.

What you will learn :

    Big Data, Hadoop concepts

    How to create a free Hadoop and Spark cluster using Google Dataproc

    Hadoop hands-on - HDFS, Hive

    Why there was a need for Spark

    Python basics

    PySpark RDD - hands-on

    PySpark SQL, DataFrame - hands-on

    Project work using PySpark and Hive

    Scala basics

    Spark Scala DataFrame

    Project working using Spark Scala

    Google Colab environment

    Bonus project - Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena

Prerequisites :

    Some basic programming skills

    Some knowledge of SQL queries

Who this course is for:

    Beginners who want to learn Big Data or experienced people who want to transition to a Big Data role

Download link:
Só visivel para registados e com resposta ao tópico.

Only visible to registered and with a reply to the topic.

Links are Interchangeable - No Password - Single Extraction