If building and maintaining scalable cloud systems and solving complex problems is your passion, this role is for you. We are seeking a
Cloud Operations Engineer
who is reliable, collaborative, and thrives in a fast-paced environment. You will be responsible for
designing, developing, and maintaining our cloud infrastructure and monitoring solutions
, ensuring the reliability, performance, and scalability of platforms powering video advertising worldwide.
Key Responsibilities
Automate manual operations tasks to streamline processes and reduce manual effort.
Identify opportunities to improve system reliability and reduce incidents.
Manage and support proactive monitoring solutions across production environments.
Analyze trends in recurring application issues and facilitate investigations to prevent reoccurrence.
Perform real-time diagnosis on product codebase (backend/frontend) during major live incidents.
Troubleshoot and resolve complex, hard-to-reproduce problems.
Provide detailed analysis and reporting on frequently occurring live problems.
Collaborate with developers to accelerate bug triage and resolution.
Participate in
Tier 3 IT support
for production infrastructure as part of a 24/7 on-call rotation.
Create and manage automated infrastructure solutions, including builds and configuration management.
Ensure systems adhere to best practices in
networking, security, redundancy, scalability, monitoring, and performance
.
Stay current with new technologies, PaaS offerings, and APIs, integrating them into solutions when beneficial.
Requirements
5+ years
of hands-on experience in
DevOps and Cloud Operations
, including Git, CI/CD environments, and system automation.
Strong scripting and programming experience with
PowerShell
and at least one of:
Python, Bash/Shell, Java, JavaScript, Node.js
.
Experience with automation tools such as
Jenkins, Ansible, and Terraform
.
Hands-on experience with
AWS
in a large-scale enterprise environment (other cloud platforms a plus).
Experience with
Docker, Kubernetes, and container orchestration technologies
.
Familiarity with monitoring and alerting tools such as
Grafana, DataDog, or similar
.
Strong understanding of
security architecture, change management, and source control practices
.
Ability to create and maintain clear technical documentation, diagrams, and checklists for team use.
Strong problem-solving and systems-thinking mindset, with the ability to communicate effectively with both technical and non-technical stakeholders.
Proven ability to prioritize and multitask in a
fast-paced, mission-critical environment
.
Job Type: Full-time
Pay: RM15,000.00 - RM16,000.00 per month
Benefits:
Health insurance
Maternity leave
Opportunities for promotion
Professional development
Work Location: In person
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.