All roles

Staff Site Reliability Engineer / DevOps

Remote · USA Full-time New today

Almedia is the fastest-growing advertising company in Europe, according to the Financial Times. Based in the heart of Berlin, we offer mobile game and app developers unparalleled returns from rewarded user acquisition, engineering the future of UA with our data-driven approach and community of over 50 million users.

We are at the forefront of a shift in the mobile app advertising landscape, changing the way people find and engage with apps. The industry is adapting around Almedia’s approach, and we are building a team that can push us even further.

Staff Site Reliability Engineer / DevOps

Berlin (preferred) or Remote

About you

  • An SRE or DevOps engineer with hands-on experience in high-traffic production systems

  • Strong in Linux, databases (MySQL, Postgres, MongoDB, Redis), and networking fundamentals

  • Comfortable with Kubernetes, CI/CD pipelines, and observability tools like Datadog

  • A self-starter who thrives in scaling environments and can work independently without PMs

  • Pragmatic, able to balance prevention, maintenance, and firefighting when needed

Your mission is to

  • Take ownership of uptime and reliability for a platform serving 50M+ users

  • Build robust monitoring, alerting, and incident response practices

  • Improve CI/CD pipelines and enable safe deployments (blue-green, canary)

  • Partner with engineers across teams to fix pain points in infra, tooling, and reliability

  • Bring initiatives that make the platform automatically reliable, cost-efficient, and scalable

Your impact

  • Collaborate with engineering teams to improve operational workflows and resilience

  • Design smart alerts, improve observability, and drive better performance monitoring

  • Lead incident response, including on-call, and drive improvement with blameless postmortems

  • Build safer delivery methods and improve deployments with Kubernetes and GitLab pipelines

  • Report directly to the CTO and act as the primary reliability leader in the company

Your toolkit

  • Linux, networking (TCP/IP), and distributed systems troubleshooting

  • Databases: MySQL, Postgres, MongoDB, Redis

  • Kubernetes, GitLab pipelines, CI/CD best practices

  • Observability tools like Datadog, OpenTelemetry, or ELK stack

  • Nice-to-haves: RabbitMQ, Kafka, Terraform, Ansible, GCP, Datadog

What makes this role exciting

  • Be the first senior SRE hire with ownership of reliability across the entire platform

  • Shape infrastructure and processes for a scale-up growing beyond 100 FTE

  • Work on a product serving millions of users worldwide with real engineering challenges

  • Gain autonomy while collaborating with strong product and engineering teams

  • Join a culture that values pragmatism, initiative, and continuous improvement

Why Almedia?

  • Own Our Growth: We offer all Berlin-based employees equity in Almedia to truly be a part of our success.

  • Scale With Almedia: Grow alongside a startup that has been profitable from day one.

  • Central Berlin Office: Work from a fully-stocked modern office built for collaboration, accessible from all around Berlin.

  • Other Benefits: Transport subsidy, breakfasts and lunches, language learning, Urban Sports Club, and more.

  • We Listen: We regularly add to our benefits through rigorous employee feedback.

We believe in fostering talent, evaluating all skill levels during the hiring process, and providing a clear path for growth. Almedia is an equal opportunity employer. We embrace and celebrate diversity, and encourage individuals from all backgrounds to apply.

Apply to this Job

Related roles

General Opening

Remote · USA Full-time

Data Scientist

Remote · USA Full-time

Full-Time Intern – People Operations (HR)

Remote · USA Full-time

Director, Business Development (IT Agency Staffing and Recruitment)

Remote · USA Full-time

Associate, Recruiting

Remote · USA Full-time

Account Executive, Entertainment Advocacy

Remote · USA Full-time

Sr Housekeeper

Remote · USA Full-time

Sr Accountant I

Remote · USA Full-time

Junior Security Engineer

Remote · USA Full-time

Senior/Staff Software Engineer, Compliance (KYC)

Remote · USA Full-time

[Remote] Enterprise Risk Analyst II

Remote · USA Full-time

Experienced Full Stack Software Engineer – Cloud Application Development with Operational Schedule Data Store (OSDS) Expertise at Southwest Airlines (Remote)

Remote · USA Full-time

Digital Marketing Specialist – Growth and Retention (ON-SITE) – Los Angeles, CA

Remote · USA Full-time

People Experience Program Manager, Employee Listening & ERGs

Remote · USA Full-time

Software Product Manager

Remote · USA Full-time

Capital One: Senior Data Engineer – Capital One Software (Remote)

Remote · USA Full-time

Experienced Remote Customer Support Specialist – Technical Troubleshooting and Customer Service Expert for blithequark

Remote · USA Full-time

Experienced Insurance Agent – Sales, Customer Service, and Career Growth Opportunities at arenaflex

Remote · USA Full-time

Management Consultant – ERP Selection

Remote · USA Full-time

Rev Ops Deal Desk Analyst Contractor

Remote · USA Full-time