Perfil buscado (Hombre/Mujer)
Responsibilities
• Document and share any work knowledge, incident experiences, and any improvements with the team
• Participate in mentoring and coaching new and junior employees or new members of team
• Participate and contribute to the community of knowledge sharing of SRE practices. For example, Solution Monitoring program
• Implement solution monitoring and observability monitoring, automate detections and responses
• Implement SLI and SLO measurements and monitoring in our Solution Monitoring
• Conduct Service improvement actions and review with the team using data from SLI and SLO
• Troubleshoot incidents, post-incidents analysis, perform root cause analysis
• Implement workarounds to avoid recurrence of incidents, improvements to monitoring detection
• Implement Observability monitoring and perform distributed tracing analysis of applications
• Deployment of new application releases to the preproduction and production environments
• Participate and contribute to automation in deployment, automated testing, and monitoring detection
• Collaborate with SQC team on testing automation deployment
• Collaborate with DevOps on continuous delivery
• Participate in solution FAT process and performance tests
• Drive the solution SAT process in collaboration with Development, DevOps, Platform teams
• Participate in the planning and review sessions with Development, DevOps, Platform teams
• Expand and grow the technical knowledge, skillsets, and expertise expected of an SRE
• Create and document any artifacts related to SRE practices, for example, good practices or patterns or customized dashboards or workarounds or troubleshooting methods, solution monitoring and observability improvements.
• Empresa en pleno crecimiento|Empresa internacional
Knowledge, skills:
Language skills: Fluent in English
Others: Good written and verbal communication
Technical skills:
• Experience in troubleshooting or debugging applications and complex systems
• Understand application tracing and log analysis
• Strong knowledge of Linux and VM
• Hands-on experience in Shell Scripts
• Experience in application deployment, and deployment tools (e.g. Jenkins)
• Experience in programming and development at least one programming language (e.g. Python, C, Java, etc).
• Experience with incident resolution and root cause analysis and incident management
• Knowledge and experience in using JIRA, a ITSM ticketing tool and any documentation tools (e.g. Wiki)
• Knowledge and experience in Nagios and Splunk or similar tools
• Knowledge and experience in Dockers, OpenShift, Kubernetes or similar technologies
• Knowledge and experience in automation (e.g. Ansible) will be an advantage
Empresa internacional
50% remoto
beneficios
Idiomas: Inglés (Alto)
Nivel Profesional: Empleado
CVs inscritos en el proceso: 10
Regístrate como candidato en Tecnoempleo.com y vincula tu CV a las ofertas de empleo.
Crea tu cuenta gratis