Senior Site Reliability Engineer

Чк Its Partner Ltd.
Занятость Полная занятость
Полная занятость
Адрес Казахстан, Алматы
Описание вакансии

Responsibilities:

  • Perform and improve system surveillance, server deployment, patch management, incident management, and automation
  • Contribute to implementation of systems and services according to documentation standards and guidelines
  • Contribute together with team and architects on creating roadmaps and strategy for our services in collaboration with corresponding owning organization
  • Contribute in continuous improvement of services scalability, redundancy, stability, availability, automation, and supportability
  • Participate and contribute together with the team in our Agile planning.
  • Review new microservices before release to production to ensure they follow best practices, including supportability and monitoring
  • Raise risks on technical changes, projects, releases, or existing implementations
  • Assess security vulnerabilities and determine resolutions in collaboration with team specialists
  • Review and perform hardware lifecycle requests concerning hardware and software maintenance and upgrades
  • Drive excellence in all activities and improve for the future in collaboration with Product Owner and COAO Manager
  • Document services and procedures and manage technical debt backlog
  • Python coding and scripting
  • Admin tools: Ansible, Jira, Confluence, Bitbucket, Git

What we expect from you:

  • Proven experience as System engineering (SRE)
  • Experienced in system automation, preferably Ansible
  • Proficient in Linux and open source, able to perform advanced troubleshooting and root cause analysis in large-scale environments
  • Experienced in scripting languages such as Bash, Python, or similar
  • Knowledge of Docker, Kubernetes, cloud computing, and databases is beneficial
  • Deep understanding of infrastructure systems and willingness to learn new technologies, including 3rd party software
  • Analytical and problem-solving skills with the ability to assess complex IT systems and propose effective solutions
  • Monitoring tools: grafana, nagios, grafana, Prometheus
  • Cloud, virtualization and hardware: Azure, AWS, Terraform, vSphere, Docker

Conditions:

  • Salary in US dollars.
  • Medical insurance and coverage of sports activities.
  • Remote job at international project.
  • Great team and colleagues.
  • Knowledge sharing.
  • Corporate culture with people-oriented approach.
Требования
Опыт Более 6 лет
Условия работы
График работы Полный день
Добавлено вчера
Для связи с работодателем или просмотра контактов нажмите на кнопку