/
Tech-study-notes

Google Cloud Platform - Professional Data Engineer

Note: Exam attended and passed in September 2023.


Data Lifecycle in Data Engineering

  1. Ingest: pull in the raw data, such as streaming data from devices, on-premises batch data, app logs, or mobile-app user events and analytics
  2. Store: the retrieved data needs to be stored in a format that is durable and easily accessible
  3. Process and analyze: the data is transformed from raw form into actionable information
  4. Explore and visualize: convert the results of the analysis into a format that is easy to draw insights from and to share with colleagues

Some of the :

GCP Services
Some of the available services at each step (source)

Storage

GCP offers various storage solutions for different use cases:


Compute

Compute services for data processing:


Data Processing

Services for data transformation and processing:


Machine Learning

ML services for data engineers:


Data Visualization

Tools for data visualization and exploration:


Data Transfer

Services for moving data in and out of GCP:


Management

Tools for managing GCP resources:


IAM (Identity and Access Management)

Security and access control:


Sample Questions