If building and maintaining cloud-scalable systems and solving complex technical problems is your passion, and you thrive in a fast-paced, collaborative environment, this role is for you. The Senior Cloud Operations Engineer will be responsible for designing, developing, and documenting robust cloud infrastructure and monitoring solutions. You will work closely with cross-functional teams to gather requirements, deploy cloud technologies, and help scale a high-performance platform powering global digital experiences.
Key Responsibilities:
Automate manual operational tasks to streamline workflows and improve efficiency.
Identify areas for system optimization to enhance reliability and reduce incidents.
Manage and support proactive monitoring solutions across production environments.
Analyze trends and recurring issues in live applications to prevent future occurrences.
Perform real-time diagnosis of major incidents involving backend or frontend systems.
Investigate and resolve complex, hard-to-reproduce technical problems.
Provide analysis and reports on frequently occurring live issues to inform long-term fixes.
Collaborate with developers to accelerate bug triage and resolution.
Serve as Tier 3 IT support for production infrastructure, participating in 24x7 on-call rotations.
Create and manage automated infrastructure solutions, including builds and configuration management.
Design and maintain large-scale, mission-critical systems emphasizing networking, security, scalability, redundancy, and performance KPIs.
Rapidly learn and integrate new technologies, PaaS offerings, and APIs.
Requirements:
5+ years of hands-on experience in DevOps, including GIT, CI/CD pipelines, and automation design/delivery.
Strong scripting and programming experience in PowerShell, plus proficiency in one or more of the following: Python, Bash/Shell, Java, JavaScript, or Node.js.
Experience with Jenkins, Ansible, and Terraform.
Hands-on experience with AWS in a large-scale enterprise environment.
Solid understanding of containerization technologies such as Docker and Kubernetes.
Knowledge of monitoring and alerting tools (e.g., Grafana, DataDog).
Familiarity with security architecture, design principles, and best practices.
Experience with source control and change management processes.
Excellent communication skills, with the ability to explain complex technical concepts to both technical and non-technical audiences.
Strong analytical and problem-solving skills, with a customer-focused mindset.
Ability to create and maintain technical documentation, diagrams, and checklists for internal knowledge sharing.
Highly organized, with the ability to prioritize and manage multiple tasks in a fast-paced environment.
Job Type: Full-time
Pay: RM15,000.00 - RM16,000.00 per month
Benefits:
Health insurance
Maternity leave
Opportunities for promotion
Professional development
Work Location: In person
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.