Google Cloud DevOps / Site Reliability Engineer (SRE) Job at Purple Drive, Alpharetta, GA

a2tRYXlZZDcvdXZSZ3JLWitLcGptd0dL
  • Purple Drive
  • Alpharetta, GA

Job Description

Role: Google Cloud DevOps / Site Reliability Engineer (SRE)

Location: Alpharetta, GA
Experience: 8-12 Years (Senior Level)

Job Summary

We are seeking an experienced Google Cloud DevOps / SRE Engineer to design, build, and operate highly reliable, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP) . The ideal candidate will bring deep Linux expertise, strong cloud networking and security knowledge, and hands-on experience with automation, CI/CD, and Kubernetes-based deployments. This role plays a critical part in ensuring system reliability, performance, and operational excellence across large-scale distributed systems.

Key Responsibilities

Cloud Infrastructure & Platform Engineering

  • Design, deploy, and manage cloud infrastructure using Google Cloud Platform services including Compute Engine, GKE, VPC, IAM, Cloud Storage, and Cloud SQL.

  • Architect and support highly available, scalable, and fault-tolerant systems on GCP.

  • Implement and manage Shared VPCs, VPC peering, firewall rules, load balancers, DNS, and VPN tunnels .

DevOps & Automation

  • Build and maintain CI/CD pipelines using Jenkins (Declarative & Scripted) and GitHub Actions .

  • Automate infrastructure provisioning and configuration using Terraform , including module development, remote state management, dependency handling, and DRY principles.

  • Implement modern deployment strategies such as Canary releases and Blue/Green deployments .

  • Manage container artifacts using Docker and Helm .

Site Reliability & Operations

  • Ensure high availability, performance, and reliability of production systems.

  • Troubleshoot complex system issues including CPU, memory, disk I/O bottlenecks , kernel issues, and system boot failures.

  • Analyze logs and metrics to proactively identify and resolve performance and stability issues.

  • Support incident response, root cause analysis, and post-incident reviews.

Linux Systems Engineering (Must Have)

  • Demonstrate deep hands-on expertise with Linux systems (RHEL, Ubuntu, CentOS).

  • Perform kernel tuning, system optimization, storage management (LVM), and systemd administration.

  • Maintain OS-level security, patching, and performance best practices.

Security & Identity Management

  • Implement and troubleshoot Cloud IAM , service accounts, and Workload Identity Federation .

  • Enforce least privilege access and security best practices across environments.

  • Partner with security teams to maintain compliance and secure cloud operations.

Collaboration & Process

  • Work closely with application teams, architects, and security stakeholders.

  • Participate in on-call rotations and incident management processes.

  • Contribute to operational documentation, runbooks, and best practices.

Required Skills & Qualifications

Must-Have Skills

  • Strong hands-on experience with Google Cloud Platform (GCP) .

  • Deep expertise in Linux systems engineering (RHEL, Ubuntu, CentOS).

  • Proficiency in at least one programming language: Python, Go (Golang), or Java .

  • Strong troubleshooting and debugging skills across infrastructure and application layers.

  • Hands-on experience with Terraform for infrastructure as code.

  • Experience with CI/CD pipelines using Jenkins and/or GitHub Actions.

  • Kubernetes experience with GKE , Docker, and Helm.

Preferred Qualifications

  • GCP Certifications:

    • Google Professional Cloud DevOps Engineer

    • Google Professional Cloud Architect

  • CKA (Certified Kubernetes Administrator) .

  • Experience supporting large-scale distributed systems and microservices architectures .

  • Familiarity with ITIL processes , Change Advisory Board (CAB) workflows, and incident management .

Soft Skills

  • Strong analytical and problem-solving abilities.

  • Excellent communication skills with the ability to collaborate across teams.

  • Ownership mindset with a focus on reliability and continuous improvement.

  • Ability to work in fast-paced, production-critical environments.

Job Tags

Remote work,

Similar Jobs

Cox Media Group

Senior Producer - KIRO TV Job at Cox Media Group

 ...Location:WA-Seattle Job Title: Senior Producer - KIRO TV Position Overview KIRO 7 News is searching for a bold, forward-thinking Senior Producer who can both serve our audience and help elevate our newsroom. This is not a fixer role it's a forward... 

Sysco

Diesel Fleet Mechanic Technician II Job at Sysco

 ...systems (including regeneration systems), intake systems, electrical systems, brake systems, HVAC systems and lift-gate hydraulic, mechanical and electrical systems.Follow procedures including documenting all work performed on work orders.Learn and develop efficiency... 

Gordon Food Service

CDL-A Route Delivery Driver Job at Gordon Food Service

 ...Schedule : 5 day work week between Monday-Saturday. Start time 3 AM-5 AM until route finish Pay : Guaranteed minimum of $1200/ week Daily base pay, plus component pay Paid for every mile driven, stop made, and case delivered Total Rewards at... 

Apex Staffing, Inc.

Front Desk - Financial Services Job at Apex Staffing, Inc.

 ...and produces report Maintain and retain office files consistent with firm/professional...  ...management applications Prepare and produce back-office support demands for basic client...  ...s degree preferred Experience in banking/financial services required Experience... 

MASONICARE

Housekeeper Job at MASONICARE

 ...independent and assisted living. Housekeeper - Essential Duties and Responsibilities: Receive instructions as to area and specific work assignment; assemble necessary cleaning supplies and equipment for transporting to the designated area. Routine duties include, but...