Big Data Fundamentals (BDF) – Contenuti

Contenuti dettagliati del Corso

Giorno 1

  • Course introduction
  • Fundamentals of Big Data
    • What is Big Data?
    • The V’s of Big Data
    • Data at Rest and Data in Transit (Batch vs Stream)
  • The Big Data Pipeline
    • Why a pipeline
    • Big Data and Machine Learning
    • Big Data Architecture
    • Decoupling the Architecture
  • Collecting Data
  • Data and Datastore
    • Storage solutions for huge amount of data
    • DEMO: scalable storage solutions
    • Databases, SQL and NoSQL
    • Introduction to Graph DB
    • Database as a Service, benefits
    • DEMO: Database as a service
  • Big data Processing and Analytics
    • Introduction to big data Processing and Analytics
    • How to perform simple querying
    • Ad hoc analytics
    • DEMO: ad hoc analytics

Giorno 2

  • Data Warehouse
    • On premises vs Managed data warehouse
  • Data Lake
    • Introduction to Data Lake
    • Single source of truth
    • Data Lake solutions
  • Hadoop & Map Reduce Fundamentals
    • Hadoop EcoSystem
    • Map Function and Reduce Function
    • MapReduce VS RMDBS
    • Hadoop Frameworks
    • Hadoop in the cloud
    • DEMO: Hadoop in the cloud (EMR & Dataproc)
  • Serverless Pipeline
    • What serverless means
    • Why going serverless
    • DEMO: creting a serverless pipeline
  • Data Visualization
    • Business Intelligence tools
    • Elastic Search, Logstash and Kibana
    • DEMO: how to visualize data
  • Big Data Solutions in Real World
  • Typical Business Use Case