Logo Adobe Systems GmbH

Cloud Reliability Engineer

Neu

Job

  • Level
    Erfahren
  • Job Feld
    IT, DevOps, Security
  • Anstellung
    Vollzeit
  • Vertragsart
    Unbefristetes Dienstverhältnis
  • Ort
    Hamburg
  • Arbeitsmodell
    Hybrid, Onsite
  • Job Zusammenfassung

    In dieser Position optimierst du die Zuverlässigkeit und Skalierbarkeit von Microservices in Kubernetes und AWS, baust effektive Monitoring-Lösungen und automatisierst Infrastrukturprozesse mit IaC-Tools wie Terraform.

    Job Technologien

    Deine Rolle im Team

    • Improve the reliability, scalability, performance, security, and cost-efficiency of the platform's microservices running on Kubernetes and AWS.
    • Build and maintain strong observability using metrics, logs, traces, dashboards, and meaningful alerting. Use monitoring solutions like Prometheus, New Relic, Grafana, and Splunk. This helps us detect and understand issues before customers do.
    • Own infrastructure-as-code and automated delivery with Terraform, Kubernetes, Helm, ArgoCD, and CI/CD pipelines - keeping infrastructure across AWS repeatable, consistent, reviewable, and auditable.
    • Drive down toil with AI-assisted and agentic automation - auto-remediation, self-healing workflows, and LLM-generated runbooks and IaC - rather than hand-crafting one-off scripts, so the team's effort compounds.
    • Help grow a shared automation platform that tackles auto-remediation, self-healing workflows, and infrastructure-as-code - where AI accelerates the build, and every contribution compounds the team's capability.
    • Partner with engineering teams, e.g. to forecast capacity based on usage trends or implement new technologies to ensure the platform scales to meet growing demand.
    • Contribute to the security and compliance posture of the platform, partnering with collaborators on controls, evidence, and audit readiness throughout daily reliability work.
    • Help set the bar for how the team uses AI in operations - choosing where agentic and LLM-assisted tooling adds real leverage, and where human judgment must stay in the loop.
    • Participate in healthy, sustainable on-call rotation, and help continuously improve our runbooks and operational practices.
    • Collaborate across Adobe's global Reliability organization to advance the shared mission of "delivering better software faster."

    Unsere Erwartungen an dich

    Ausbildung

    • A Bachelor's degree or higher in Computer Science, a related field, or equivalent experience. We value demonstrated ability over specific credentials.

    Qualifikationen

    • A modern, AI-forward mindset: you reach for agentic and LLM-assisted tooling to do the work, and you have the judgment to know where it accelerates you and where humans must stay in the loop.
    • Enough programming ability to read, debug, and contribute to services and tooling. These are largely Java/Spring services, so comfort reading and debugging Java is valuable, and Python is a strong advantage for automation and tooling.
    • Working knowledge of web services and supporting technologies including HTTP, JSON, REST, and service-to-service networking (e.g. proxies, load balancers, service meshes).
    • Exposure to the data stores that enable these services such as MongoDB, Cassandra, or DynamoDB is helpful, as Reliability Engineering manages these together with our Database Reliability team.
    • Strong communication and collaboration skills, and a genuine commitment to teamwork, shared ownership, and continuous improvement.
    • Professional working proficiency in English. German is a plus, given our Hamburg base, but not required.

    Erfahrung

    • Several years of professional experience operating, scaling, or building distributed systems in production (SRE, DevOps, platform, or backend engineering backgrounds all welcome).
    • Hands-on production experience with AWS and with container orchestration on Kubernetes (plus tooling like Docker, Helm, and ArgoCD).
    • Practical experience with infrastructure-as-code, ideally Terraform, and with modern GitOps based CI/CD workflows.
    • Experience with monitoring and observability solutions - for example Prometheus, New Relic, Grafana, or Splunk.
    • We expect enough software development experience to read, debug, and contribute to services, automation, and tooling. This includes Python and Golang for our own toolset, but also Java/Spring for the service we support.

    Unser Angebot

    • Real impact at scale: your work directly enables millions of users to create and collaborate.
    • A global, diverse team that brings different perspectives to hard problems.
    • 2-3 on-site team days per week at our Hamburg office at the Fischmarkt, with flexibility to work from home.
    • Growth opportunities in a rapidly expanding team, where you can take on more responsibility over time.

    Benefits

    Gesundheit, Fitness & Fun

    Mehr Netto

    Work-Life-Integration

    Themen mit denen du dich im Job beschäftigst

    Job Standorte

    • Standort Hamburg

      Deutschland

    Das ist dein Arbeitgeber

    Adobe Systems GmbH

    Adobe Systems GmbH

    Adobe ist ein Arbeitsumfeld, das auf der ganzen Welt für seine hervorragende Qualität anerkannt wird. Sie werden von Kollegen umgeben sein, die sich gegenseitig dabei helfen werden, durch unser einzigartiges Check-In-Verfahren weiterzuentwickeln, in dem regelmäßiges Feedback frei fließt.

    Description

  • Unternehmenstyp
    Etablierte Firma
  • Arbeitsmodell
    Full Remote, Hybrid, Onsite
  • Branche
    Internet, IT, Telekom
  • Dev Reviews

    by devworkplaces.com

    Gesamt

    (1 Bewertung)
    4.0
    • Workingconditions

      4.4
    • Career Growth

      3.7
    • Engineering

      3.6
    • Culture

      4.5
    Alle Dev Reviews anzeigen
    Logo Adobe Systems GmbH

    Cloud Reliability Engineer

    Ort
    Hamburg
    Arbeitsmodell
    Hybrid, Onsite
    Diversität
    Für alle Personen geeignet (m/w/d)

    Weitere Jobs