All roles

Site Reliability Engineer (SRE)

Remote · USA Full-time New today

Job Description

Location: Full remote, EU timezone (CET +/- 2 hours) Start Date: As soon as possible Languages: English required

We are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of our production systems. Youll work closely with engineering teams to automate operations, improve monitoring, and design resilient systems.

Responsabilities:

  • Design, implement, and maintain scalable, resilient AWS infrastructure
  • Develop and manage CI/CD pipelines and infrastructure-as-code (Terraform or similar)
  • Set up and optimize monitoring, alerting, and incident response processes
  • Proactively identify and resolve performance, reliability, and security issues
  • Collaborate with development teams to integrate SRE best practices into their workflows
  • Conduct post-mortems and root cause analyses on incidents
  • Participate in on-call rotations to support 24/7 system reliability

Requirements:

  • 5+ years of experience as an SRE or similar role
  • Deep knowledge of AWS services (EC2, ECS, RDS, Lambda, S3, etc.)
  • Proficient in infrastructure-as-code tools (Terraform, CloudFormation, etc.)
  • Solid experience with Linux systems administration and networking concepts
  • Strong programming/scripting skills (Python, Bash, Go, etc.)
  • Experience with CI/CD tools (GitLab CI, Jenkins, etc.)
  • Familiarity with observability tools (Prometheus, Grafana, Datadog, etc.)

Nice To Have:

  • Experience with container orchestration (ECS, EKS, or Kubernetes)
  • Understanding of security best practices in cloud environments
  • Exposure to incident management frameworks (SRE handbook, etc.)

Why Join Us:

  • 100% remote work with flexible hours
  • High-impact role with autonomy and ownership
  • Collaborative and international engineering team
  • Cutting-edge tech stack with strong focus on reliability and automation.

Originally posted on Himalayas

Apply To this Job

Related roles

Mobile Research Nurse (PRN); Portland, Oregon

Remote · USA Full-time

Enterprise Account Executive - Tri-State

Remote · USA Full-time

Engineering Operations Specialist

Remote · USA Full-time

Dealership Account Manager - Missouri

Remote · USA Full-time

Supply Chain Analyst *Remote*

Remote · USA Full-time

Lead AI/Machine Learning Engineer

Remote · USA Full-time

Immedaite Hiring for Perl Senior Developer

Remote · USA Full-time

Revenue Manager

Remote · USA Full-time

Risk Control Specialist (LATAM)

Remote · USA Full-time

Systems Administrator

Remote · USA Full-time

Senior Software QA Automation Engineer Apple Services Engineering.

Remote · USA Full-time

Experienced Remote Customer Support Agent for Bespoke Travel Experiences at blithequark

Remote · USA Full-time

Experienced Remote Customer Service Representative – Delivering Exceptional Travel Experiences for Delta Air Lines

Remote · USA Full-time

Territory Manager - South Bend, IN

Remote · USA Full-time

Embedded Software Engineer - Apple Austin, Texas - Work from Home - Power Innovation Group

Remote · USA Full-time

Director Information Technology

Remote · USA Full-time

Experienced Customer Success Specialist – Remote Opportunity for Career Advancement and Growth at Blithequark

Remote · USA Full-time

Financial Crime Analyst

Remote · USA Full-time

Senior / Lead Consultant SuccessFactors / HCM Integration

Remote · USA Full-time

EVP, Legal, Technology and Commercial, Streaming, Marketing, and Distribution

Remote · USA Full-time