[Remote] Senior Cloud Ops Engineer, Flickr
Note: The job is a remote job and is open to candidates in USA. Flickr is a mission-focused company dedicated to building a better world through photography. They are seeking a Senior Cloud Ops Engineer to manage multiple priorities and drive the design, development, and maintenance of complex cloud-native software applications, while improving operational practices and collaborating cross-functionally to enhance system performance and user experience.
Responsibilities
- Contributes to the design and delivery of scalable, reliable, cloud-native systems using AWS (ECS, Lambda, and related services), with a strong focus on automation and operational excellence
- Own complex technical and operational initiatives from problem definition through execution, anticipating risks and driving solutions to completion
- Take a leadership role in on-call operations, independently managing incidents, coordinating response efforts, and improving system reliability through root cause analysis and follow-through
- Contributes to improving engineering and operational practices, including deployments, monitoring, alerting, and incident response
- Partner cross-functionally with engineering, product, and support teams to deliver solutions that improve both system performance and user experience
- Drive technical discussions and decision-making, clearly articulating trade-offs and aligning solutions with broader organizational goals
Skills
- Strong experience building and operating distributed systems in AWS, with a focus on modern, cloud-native and serverless architectures
- Proven ability to lead projects end-to-end, including defining approaches, breaking down ambiguous problems, and delivering impactful solutions
- Minimum 5 years of experience in SRE, DevOps, infrastructure, or a related field, with demonstrated ownership of production systems
- Deep understanding of production systems, including monitoring, alerting, incident response, and debugging in live environments
- Experience improving and scaling operational processes, with an emphasis on reliability, efficiency, and automation
- Ability to go beyond standard best practices to develop pragmatic, effective solutions to complex problems
- Strong communication skills, with the ability to influence decisions and collaborate across teams
- Comfortable collaborating and sharing knowledge with peers
- Experience building or operating systems involving media uploads, processing pipelines, or large-scale content delivery (e.g., images, video, or other high-volume assets)
- Comfortable working across a mix of modern and legacy systems, and can quickly ramp up in unfamiliar technologies (e.g., PHP, VESPA, or similar large-scale production systems)
- A strong sense of ownership of systems and outcomes, especially in high-impact or high-pressure situations
- A systems thinker who understands how their work impacts reliability, scalability, and the broader business
- Proactive in identifying gaps and driving improvements without waiting for direction
- Confident leading during incidents and bringing clarity to ambiguous or complex situations
Benefits
- Health, dental, vision insurance with 100% premium coverage for you and dependents
- Health Savings Account contributions covering 90%+ of annual deductible
- 401(k) with company match and immediate vesting
- Professional development and learning opportunities
- Remote work support (internet, fitness, coworking space reimbursements)
- Company-sponsored therapy and coaching sessions
- Health, dental, vision insurance - 100% premium coverage for employee and dependents
- HSA contributions covering 90%+ of annual deductible
- 401(k) with company match and immediate vesting
- Unlimited PTO (and we mean it!)
- Learning & development opportunities for personal and professional growth
- Remote work stipends (internet, fitness, coworking space)
- Company-sponsored therapy and coaching sessions
- Flexible spending accounts available
- Company-sponsored phone plan
- A fully remote work environment (+ a coworking space reimbursement if you prefer)
- An experimental 4-day work week
Company Overview