If building and maintaining scalable cloud systems and solving complex technical challenges is your passion, this is the perfect opportunity for you. We are looking for a
Senior Cloud Operations Engineer
who is reliable, collaborative, and thrives in a fast-paced environment. In this role, you will design, develop, and document cloud infrastructure and monitoring solutions. You'll work closely with cross-functional teams to deploy and optimize cloud technology that supports large-scale, mission-critical platforms.
Key Responsibilities:
Automate manual operations tasks to streamline processes and reduce manual effort.
Identify and implement improvements to enhance system reliability and minimize incidents.
Manage and support proactive monitoring solutions across production environments.
Investigate and analyze recurring issues in live applications to prevent repeat occurrences.
Perform real-time diagnostics on live systems (backend/frontend) to resolve major incidents.
Analyze and troubleshoot complex or difficult-to-reproduce system problems.
Provide detailed analysis and reporting on frequently occurring live issues.
Collaborate with developers to provide detailed insights for faster bug resolution.
Serve as Tier 3 IT support for production infrastructure, participating in 24x7 on-call rotations.
Create and manage automated infrastructure solutions, including builds and configuration management.
Ensure high availability, security, scalability, and performance across on-prem and cloud systems.
Continuously explore and integrate new technologies, PaaS offerings, and APIs.
Requirements:
Minimum
5 years of hands-on experience
in DevOps, automation, and CI/CD environments.
Strong experience with
PowerShell
, plus other scripting or programming languages such as
Python, Bash/Shell, Java, JavaScript, or Node.js
.
Proficiency with
Jenkins, Ansible, and Terraform
.
Deep understanding of
AWS
cloud infrastructure, ideally within a large-scale enterprise environment.
Knowledge of
Docker, Kubernetes
, and container orchestration technologies.
Experience with monitoring and alerting tools such as
Grafana
and
DataDog
.
Strong grasp of
security architecture, networking, and system scalability
principles.
Understanding of
source control and change management
practices (e.g., Git).
Excellent communication skills with the ability to explain complex issues to both technical and non-technical audiences.
Strong analytical, problem-solving, and documentation skills -- able to create clear technical references and guides.
Ability to
multitask and prioritize
effectively in a dynamic, fast-paced environment.
Job Type: Full-time
Pay: RM4,500.00 - RM5,500.00 per month
Benefits:
Health insurance
Maternity leave
Opportunities for promotion
Professional development
Work Location: In person
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.