SITE RELIABILITY ENGINEER (SRE)

Effective until: 31-08-2025

As an SRE, your responsibility is to ensure stable system operations through continuous monitoring, timely incident handling, performance optimization, and operational automation. You will collaborate with internal teams to deploy, operate, and improve infrastructure systems, aiming to enhance the quality of IT services.

key responsibilities

  • - Monitor, detect, and resolve system issues based on alerts, tickets, and user feedback.
  • - Build, develop, and optimize tools to support system monitoring and operations.
  • - Support the deployment and configuration of systems based on actual needs.
  • - Participate in root cause analysis, incident handling, and post-incident reporting.
  • - Propose and implement automation solutions for operational tasks.
  • - Create technical documentation and handbooks to support operational activities.
  • - Collaborate with relevant teams to follow internal processes and tools.
  • - Research, experiment, and recommend the adoption of new technologies to enhance IT system performance and service quality.
  • - Working hours: Flexible 8-hour shifts.

basic Qualifications

  • - Bachelor's degree in Information Technology, Electronics & Telecommunications, or equivalent practical experience.
  • - At least 1 year of experience in roles such as System Engineer, SRE, or DevOps Engineer.
  • - Experience in administering Linux systems and common Linux-based applications such as Docker, web servers, and load balancers.
  • - Solid understanding of networking concepts (e.g., switch, firewall).
  • - Proficiency in scripting with Bash or Python.
  • - Strong sense of responsibility, ability to work independently, collaborate in teams, and perform well under pressure.

preferred Qualifications

  • - Experience deploying and managing OpenStack, Ceph Storage, or VMware ESXi.
  • - Hands-on experience with system monitoring tools.
  • - 2 years of experience working with infrastructure systems or platform services.
  • - 1 year of experience participating in medium to large-scale technical projects.
  • - Ability to analyze and troubleshoot complex system issues and propose appropriate technical solutions.
  • - English proficiency equivalent to TOEIC 550+ or IELTS 5.0+ is a plus.

Team: SE

Company: AVTVN

Contact: hr@avtvn.com