Logo envelio GmbH

Site Reliability Engineer

Job

  • Level
    Erfahren
  • Job Feld
    IT, System
  • Anstellung
    Vollzeit
  • Vertragsart
    Unbefristetes Dienstverhältnis
  • Ort
    Köln
  • Arbeitsmodell
    Full Remote, Hybrid, Onsite
  • Job Zusammenfassung

    In dieser Rolle kümmerst du dich um die Wartung und Optimierung von Kubernetes-Clustern in Cloud- und On-Premise-Umgebungen, automatisierst Prozesse mit Infrastructure-as-Code und verbesserst die Systemzuverlässigkeit durch Überwachung und Sicherheitsmanagement.

    Job Technologien

    Deine Rolle im Team

    • You maintain Kubernetes clusters across multiple clouds and on-premise environments, ensuring they are reliable, secure, and cost-effective.
    • You develop and maintain infrastructure-as-code (Terraform, SaltStack) to manage 100+ customer instances with layered configuration.
    • You design and maintain observability (monitoring, alerting, SLOs) so that production issues surface early and are resolved quickly.
    • You own and evolve secrets management, certificate automation, and security tooling across the platform.
    • You reduce operational toil through automation, better tooling, and solid runbooks.
    • You participate in incident response, root cause analysis, and drive follow-ups so the same issues do not reoccur.
    • You collaborate with development squads and the Operations team to improve the overall reliability of the IGP.

    Unsere Erwartungen an dich

    Qualifikationen

    • You are comfortable with Linux administration, networking, and distributed systems.
    • You have worked with configuration management tools like SaltStack, Ansible, or Chef.
    • You understand monitoring and observability and have worked with tools like Datadog, Prometheus, or Grafana.
    • You communicate effectively in asynchronous, remote-first environments.
    • You are curious, enjoy learning, and are open to using AI tools in your daily work.
    • You are business-fluent in English (Level C1).
    • Nice to have: German language skills.

    Erfahrung

    • You have proven experience running production workloads on Kubernetes in a cloud or hybrid environment.
    • You have hands-on experience with infrastructure-as-code tools such as Terraform or CloudFormation.
    • You have experience with container and orchestration technology (Docker, Kubernetes, Helm) in production.
    • You have experience as a software developer, ideally with languages like Python or Go.

    Unser Angebot

    • Join us fully remote #LI-Remote or at our lovely office in Cologne in a hybrid working mode.
    • Option for remote work from abroad (up to three months per year from anywhere in the EU or the USA).
    • State of the art technology and modern tech stack.
    • Excellent hardware equipment (16 inch MacBooks, 2 screens at your workplace).
    • 30 holidays + 3 corporate holidays.
    • Support for your health through sports membership cooperations.
    • Flexible use of a monthly mobility budget (e.g. Jobrad, public transport).
    • Time and resources for individual growth.
    • Envelio pension plan.
    • Regular company and team events.

    Benefits

    Gesundheit, Fitness & Fun

    Themen mit denen du dich im Job beschäftigst

    Job Standorte

    • Standort Köln

      Nordrhein-Westfalen

      Deutschland

    Das ist dein Arbeitgeber

    envelio GmbH

    envelio GmbH

    Die envelio GmbH, mit Sitz in Köln, ist ein innovatives Clean-Tech Softwareunternehmen, das eine Plattform zur Automatisierung und Digitalisierung von Stromnetzplanung bietet. Es unterstützt Verteilnetzbetreiber bei der Integration erneuerbarer Energien.

    Description

  • Unternehmenstyp
    Startup
  • Arbeitsmodell
    Full Remote, Hybrid, Onsite
  • Branche
    Energiewirtschaft, Umwelt
  • Logo envelio GmbH

    Site Reliability Engineer

    Ort
    Köln
    Arbeitsmodell
    Full Remote, Hybrid, Onsite
    Diversität
    Für alle Personen geeignet (m/w/d)

    Weitere Jobs