




Job Summary: We are seeking a role focused on ensuring the stability, monitoring, and performance of our platform in hybrid environments, with an emphasis on rapid incident resolution. Key Highlights: 1. Incident Management and Rapid Resolution to ensure service continuity. 2. Kubernetes cluster and hybrid environment administration and optimization. 3. Implementation of automation (CI/CD) and Infrastructure as Code. ### **Job Information** Date Opened **05/19/2026**Job Type **Permanent**Industry **IT Services**Work Experience **1\-3 years**City **Madrid**State/Province **Madrid**Country **Spain**Zip/Postal Code **28001**### **Job Description** The role focuses on ensuring the stability, monitoring, and performance of our platform across hybrid environments (Azure, AWS, On\-Premise). This is not merely a deployment role but one centered on service assurance and efficient "firefighting", ensuring network or infrastructure outages are resolved in the shortest possible time (low MTTR). **Key Responsibilities** * **Incident Management and Rapid Resolution:** Immediate response to network, performance, or platform availability incidents. Agile diagnosis and remediation of critical failures. * **Platform Architecture and Management:** Administration and optimization of **Kubernetes** clusters (AKS, Rancher) and hybrid environments (Azure, AWS, On\-Premise). * **Automation and Infrastructure as Code:** Implementation of CI/CD pipelines (Jenkins, ArgoCD) and infrastructure management using Terraform and Ansible. * **Monitoring and Observability:** Design and maintenance of the monitoring stack (ELK, Prometheus, Grafana) to detect anomalies before they impact end users. * **Configuration and Security Management:** Management of repositories (Nexus), message queues (RabbitMQ, ActiveMQ), and security tools (Wiz). * **Documentation and Continuous Improvement:** Use of Jira and Confluence for incident tracking and documentation of resolution runbooks. ### **Requirements** **Mandatory Requirements** * Solid experience with **Kubernetes** (AKS, Rancher) and container orchestration. * Proficiency in orchestration and deployment tools: **Helm, ArgoCD, Jenkins**. * Proven experience in hybrid environments: **Azure, AWS, and On\-Premise management**. * Strong networking knowledge and ability to troubleshoot connectivity and performance issues. * Hands-on experience with Infrastructure-as-Code tools: **Terraform, Ansible**. * Proficiency in monitoring stacks: **ELK, Prometheus, Grafana**. * Relational database management: **PostgreSQL, Oracle**. * Message queue and messaging systems: **RabbitMQ, ActiveMQ**. * Familiarity with modern automation tools (n8n) and project management tools (Jira, Confluence). **Desirable** * Experience with cloud security tools (Wiz). * Knowledge of backup management and disaster recovery (Velero). * Familiarity with large language models or AI integration (Claude) into DevOps processes.


