Role Summary:
GCP Data Engineer with 2–3 years of experience in building and supporting data pipelines on Google Cloud Platform. The resource will work on data ingestion, transformation, and loading into BigQuery and support reporting/analytics teams. The role involves Python, SQL, and workflow scheduling.
Key Skills:
Google Cloud Platform (GCP)
BigQuery
Cloud Storage (GCS)
Cloud run and Cloud Function
Pub/Sub (basic knowledge)
Python
SQL
ETL/ELT concepts
JSON & file processing (CSV/Parquet)
Git basics
Roles & Responsibilities:
Develop and maintain ETL pipelines using Python and SQL.
Load and transform data into BigQuery.
Create and manage Cloud run and Cloud Function.
Perform data validation and fix data issues and perform reprocessing/backfills.
Monitor jobs and handle failures.
Coordinate with upstream/downstream teams during failures.
Work with data scientists to provide clean datasets.
Optimize queries and improve pipeline performance
Support deployment and production issues.