All jobs

[Remote] Data Engineer - GCP

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. The Data Sherpas is a dynamic team focused on building innovative and scalable data solutions on Google Cloud Platform (GCP). They are seeking an experienced Google Cloud Data Engineer to design, develop, and manage scalable data pipelines and data infrastructure, ensuring data availability, accuracy, and performance for business insights and machine learning models.

Responsibilities

  • Design, build, and maintain scalable and reliable data pipelines using Cloud Dataflow, Cloud Pub/Sub, and Cloud Composer
  • Develop ETL/ELT processes to process and transform large volumes of structured and unstructured data
  • Optimize data pipeline performance, scalability, and reliability
  • Ensure data processing and ingestion workflows are monitored and meet performance SLAs
  • Design and implement data storage solutions using BigQuery, Cloud Storage, and Firestore
  • Optimize data structures and partitioning for performance and cost efficiency
  • Ensure data security, integrity, and availability in all storage solutions
  • Manage data lifecycle policies and archiving processes
  • Develop data transformation processes using BigQuery, Apache Beam, and Cloud Functions
  • Implement data quality checks, validation rules, and monitoring solutions
  • Support real-time and batch data processing needs
  • Integrate data from multiple sources, including APIs, databases, and third-party applications
  • Automate data ingestion, transformation, and export using tools like Cloud Composer and Cloud Functions
  • Ensure data consistency across different environments and systems
  • Work closely with data scientists and analysts to understand data needs and business goals
  • Provide technical guidance and best practices to the data engineering and business teams
  • Collaborate with security and compliance teams to ensure data governance standards are met
  • Monitor data pipeline performance and troubleshoot issues in real-time
  • Analyze data pipeline failures and implement fixes to prevent recurrence
  • Set up logging and monitoring using Stackdriver and Cloud Monitoring

Skills

  • Bachelor's degree in Computer Science, Data Engineering, or a related field; Master's degree is a plus
  • 3+ years of experience in data engineering, with at least 2+ years working with Google Cloud Platform
  • Google Professional Data Engineer certification is required
  • Strong proficiency with GCP services such as BigQuery, Cloud Dataflow, Cloud Composer, Cloud Pub/Sub, Firestore, and Cloud Functions
  • Hands-on experience with big data tools and frameworks such as Apache Beam, Hadoop, Spark, or Flink
  • Proficiency in programming languages such as Python, Java, or Scala
  • Strong knowledge of SQL, data modeling, and query optimization
  • Experience with CI/CD tools and version control (e.g., Git, Cloud Build)
  • Strong understanding of data governance, security, and compliance requirements
  • Ability to manage large-scale data processing and real-time data pipelines
  • Excellent problem-solving, analytical, and communication skills
  • Experience with machine learning pipelines and AI/ML model deployment
  • Familiarity with Terraform and Infrastructure as Code (IaC) principles
  • Experience with NoSQL databases and key-value stores on GCP
  • Knowledge of containerization and orchestration using Google Kubernetes Engine (GKE)

Benefits

  • Competitive salary and performance-based incentives.
  • Comprehensive health, dental, and vision coverage.
  • Professional development and training opportunities (including GCP certification).
  • Flexible work environment and remote work options.

Company Overview

  • The Data Sherpas emerge as a beacon of expertise and innovation in the rapidly evolving digital era. It was founded in 2010, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is https://www.thedatasherpas.com.
  • Apply To This Job

    You might also like

    [Remote] Traffic Performance Analyst

    100% Remote Full-time

    [Remote] Account Executive, Enterprise Automotive

    100% Remote Full-time

    [Remote] Sr. Partner Sales Manager, Financial Services

    100% Remote Full-time

    [Remote] Concept Designer

    100% Remote Full-time

    [Remote] Business Development Manager - East Coast US - Biospecimens & IVD

    100% Remote Full-time

    [Remote] Senior Full-Stack Engineer (React / Node.js)

    100% Remote Full-time

    [Remote] Sr ProServe Account Executive - Financial Services, NAMER FINANCIAL SERVICES

    100% Remote Full-time

    [Remote] Senior Account Executive

    100% Remote Full-time

    [Remote] Senior Account Executive

    100% Remote Full-time

    [Remote] Software Development Engineer, Amazon Data Firehose

    100% Remote Full-time

    Experienced Full Stack Data Entry Clerk – Remote Internship for High School Students at blithequark

    100% Remote Full-time

    Experienced Online Data Entry Specialist for Teens – Entry-Level Opportunity in Data Management at arenaflex

    100% Remote Full-time

    ATP- Social Behavior Change Senior Advisor for SDI - remote

    100% Remote Full-time

    Associate Director of Research Administration

    100% Remote Full-time

    Licensed Property & Casualty Insurance Agent - Remote USA

    100% Remote Full-time

    N-IDOH-Public Health Preparedness Field Coordinator - Remote with travel

    100% Remote Full-time

    Counsel, Marketing Legal - Games, Consumer Products & Experiences

    100% Remote Full-time

    Global Supply Chain Manager, Electricals

    100% Remote Full-time

    Nurse Practitioner Behavioral Health

    100% Remote Full-time

    Distributed Systems Engineer (L5), Content Engineering at Netflix in usa remote • los gatos, california, united states of america

    100% Remote Full-time