Key responsibilities:
Support of Linux-based servers and ensure smooth operation of the service,
Participation in shifts, solving problems and incidents on the highly loaded system with an SLA of 99.999%,
Tight work with the engineering and architecture teams on maintaining reliability of production,
Capacity planning & cloud infrastructure cost optimization.
Requirements:
Good knowledge of Linux environment, TCP/IP, network routing, DNS,
Hands-on troubleshooting experience with UNIX based OS,
Good communication skills with English level intermediate or above,
Experience with Kubernetes, AWS cloud,
Ability to use Python/Golang/Shell to automate work and develop internal tools.
Would be a plus:
Experience with IaaC tools like Terraform, Ansible, etc,
Experience with various monitoring and logging systems, such as Zabbix, Elastic Stack (ELK), Grafana,
Hands-on experience in troubleshooting of Java, C++ applications,
Experience with Apache Kafka, Nginx, SQL and NoSQL DB.
You will get experience in:
Working in a high-performance team in a strong IT company,
Maintaining a worldwide distributed system with high SLA,
Troubleshooting of Java, C++ applications based on enterprise technologies (NGINX, Apache Kafka, Cassandra, RabbitMQ, MongoDB, GridGain, Coherence, etc.),
Automatization, CI/CD, IaaC,
Integration between Private and Public Clouds.
We offer:
Well-coordinated professional team
Cutting edge technologies, interesting and challenging tasks, dynamic project, great opportunities for self-realization, professional and career growth
Additional Health and Life Insurance Package
This role requires on-site presence at our office 4 days a week to support effective collaboration and teamwork.