Site Reliability Engineer Based In Johor Bahru

Johor Bahru, M01, MY, Malaysia

Job Description

Salary: MYR 12,000 - MYR 14,000 Per Month


Location: Johor Bahru, Johor



Requirements:



Strong experience with Linux systems and distributed computing fundamentals

Proven experience in troubleshooting application issues with focus on performance and connectivity

Familiarity with networking concepts and effective troubleshooting techniques

Experience in Bash/Shell scripting or automation for system administration tasks

Experience in programming languages such as Python, Golang, Java, or similar (added advantage)

Demonstrated experience in system architecture and design, prioritizing reliability and scalability

Understanding of SRE principles including SLOs, SLIs, toil reduction, and incident post-mortems (added advantage)

Hands-on experience with cloud environments (AWS, Azure, Google Cloud)

Excellent problem-solving abilities and proactive approach to operational challenges

Ability to work independently while effectively collaborating within a team environment

Open to work in rotational shifts

Able to communicate in Mandarin


Responsibilities:



Monitor and maintain system performance to ensure stability and reliability of applications and infrastructure

Design and implement resilient system architectures that support high availability and scalability

Develop automation tools and scripts to enhance operational efficiency and reduce manual effort

Define, track, and analyze SLOs and SLIs to ensure reliability and performance meet business needs

Conduct thorough post-mortem analyses following incidents, driving continuous improvement

Collaborate with development and operations teams to establish best practices in system reliability

Troubleshoot and resolve issues related to database performance, network connectivity, and deployment failures

Ensure issues are resolved within stipulated Service Level Agreements (SLAs)

Identify and troubleshoot performance bottlenecks in applications and infrastructure

Maintain detailed documentation of processes and incident responses

Improve monitoring solutions to proactively identify and mitigate issues

Assist in deployment and configuration of new applications and services

Participate in on-call rotations and respond to critical incidents

Analyze system logs and metrics to identify trends and improvement areas


Skills Required:



Site Reliability Engineering (SRE)

Cloud Computing (AWS, Azure, GCP)

Automation

DevOps Practices

Monitoring and Alerting

Incident Management

Linux System Administration


Benefits:



Competitive Salary and commission

Collaborative working environment with multilingual teams

Full Training provided

Other benefits shared during interview


Job Type: Full-time

Pay: RM12,000.00 - RM14,000.00 per month

Benefits:

Additional leave Opportunities for promotion Professional development
Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD1299096
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Johor Bahru, M01, MY, Malaysia
  • Education
    Not mentioned