All roles

Data Engineer with PYSPARK and Google Big Query

Remote · USA Full-time New today

Position: Data Engineer with PysparkB/Google Big Query Location: Dallas, TX (On-site) JD: Pyspark/Google Cloud Platform Data Engineer will create, deliver, and support custom data products, as well as enhance/expand team capabilities. They will work on analyzing and manipulating large datasets supporting the enterprise by activating data assets to support Enabling Platforms and analytics. Google Cloud Data Engineers will be responsible for designing the transformation and modernization on Google Cloud Platform using Google Cloud Platform Services Responsibilities: Build framework and pipelines on Google Cloud Platform Cloud using Data proc, Pyspark, Kafka and Pub/Sub Implement schedules/workflows and tasks for Cloud Composer/Apache Airflow; Create and manage data storage solutions using Google Cloud Platform services such as BigQuery, Cloud Storage, and Cloud SQL Monitor and troubleshoot data pipelines and storage solutions using Google Cloud Platform's Stackdriver and Cloud Monitoring Develop efficient ETL/ELT pipelines and orchestration using Data Prep, Google Cloud Composer Develop and Maintain Data Ingestion and transformation process using Apache PySpark Automate data processing tasks using scripting languages such as Python or Bash Ensuring data security and compliance with industry standards by configuring IAM roles, service accounts, and access policies. Automating cloud deployments and infrastructure management using Infrastructure as Code (IaC) tools such as Terraform or Google Cloud Deployment Manager. Participate in Code reviews, contribute to development best practices and usage of Developer Assist tools to create a robust fail safe data pipelines Colloborate with Product Owners, Scrum Masters and Data Analyst to deliver the User Stories and Tasks and ensure deployment of pipelines Experience required: 7+ years of application development experience required using one of the core cloud platforms viz. AWS, Azure & Google Cloud Platform Minimum 1+ years of Google Cloud Platform experience. Experience working in Google Cloud Platform based Big Data deployments (Batch/Real-Time) leveraging Pyspark, Big Query, Google Cloud Storage, PubSub, Data Fusion, Dataproc, Airflow; Minimum 3+ years coding skills in Python/PySpark and strong proficiency in SQL; Extracting, Loading, Transforming, cleaning, and validating data + Designing pipelines and architectures for data processing; Architecting and implementing next generation data and analytics platforms on Google Cloud Platform cloud; Experience in working with Agile and Lean methodologies; Experience working with either a Map Reduce or an MPP system on any size/scale; Experience working in CI/CD model to ensure automated orchestration of pipelines. Apply Job!

Related roles

Bartender (Training Provided)

Remote · USA Full-time

Columbia - Customer Support Representative - Work-at-home

Remote · USA Full-time

UX Lead | Onsite - Austin, TX

Remote · USA Full-time

Remote SAT Math Tutor

Remote · USA Full-time

Virtual Scheduling Assistant-Entry Level

Remote · USA Full-time

Columbus - Customer Support Representative - Work-at-home

Remote · USA Full-time

Sr. Pharmacy Technician - Retail - R2- 9:30a - 8p

Remote · USA Full-time

Part Time Driver For Injured Patients

Remote · USA Full-time

Heavy Equipment Operator

Remote · USA Full-time

Receptionist Job at SERCO OF TEXAS INC - WEST CENTRAL TEXAS in Abilene

Remote · USA Full-time

Digital Design Intern - Spring Semester (Remote Eligible)

Remote · USA Full-time

Part-Time Remote Live Chat Agent | Unlock Your Potential in a Flexible and Rewarding Career

Remote · USA Full-time

Field Technician - Robotics Applications for Industrial Pipes

Remote · USA Full-time

American Airlines Virtual Assistant Jobs (Part/Full Time)

Remote · USA Full-time

Registered Nurse (RN) Part Time 7p-7a Adult Emergency Department – Amazon Store

Remote · USA Full-time

[Remote] Associate Account Director (Influencer Marketing)

Remote · USA Full-time

Software Engineer, Platform - Tacoma, WA, USA

Remote · USA Full-time

Communications Manager - College of Liberal Arts and Social Sciences

Remote · USA Full-time

Pharmacy Tech I (Call Center)

Remote · USA Full-time

Manager, Integrated Marketing & Synergy - Walt Disney Studios Motion Pictures and Disney Plus

Remote · USA Full-time