Site Reliablity Engineer

Malaysia, Malaysia

Job Description


: Responsible for improving and perfecting the existing operation and maintenance system, improving the professional skills of the operation and maintenance team, establishing a standardized and efficient operation and maintenance system, and achieving high availability and stability of the business Be able to lead the review of technical solutions and system operation and maintenance architecture, master relevant technical architecture and principles, be able to actively identify program risks from the perspective of operation and maintenance, and provide professional solutions Participate in the demand analysis of the operation and maintenance platform, and formulate platform construction strategies and plans based on business needs and status quo. As a team technical expert, control technical solutions, design and develop core modules, and tackle technical difficulties. Responsible for the exploration and application of emerging operation and maintenance technologies, follow up and study business trends, promote the implementation of new technologies in business scenarios, and continuously improve system operation and maintenance and development capabilities Responsible for technical training and personnel training within the team, train and lead team members, and build a high-performance technical team. Requirements: Computer and related majors, bachelor degree or above, more than 5 years of operation and maintenance work experience, experience in operation and maintenance structure, operation and maintenance management, and DevOps of medium and large Internet companies is preferred Python and Go need to be proficient in at least one of them, solid programming foundation, good code style, proficiency in commonly used software engineering methods, design patterns, data structures and algorithms In-depth understanding of computer architecture, linux kernel, distributed system architecture, virtualization technology, network communication and system programming at least two directions Have a unique understanding of system architecture, understand the adaptability, advantages and disadvantages of different architecture methods, maintain a strong interest in technology, and have a certain understanding and follow-up of advanced technologies in the industry. Have rich experience in operation and maintenance of large-scale service clusters, high-concurrency architecture experience, familiar with the planning and construction of high-availability clusters and load-balancing clusters Responsible for the systematic planning and construction of the operation and maintenance system level, and has rich practical experience in this area Proficient in the principle and use of mainstream middleware (at least 2 or more) such as Redis/Codis, Kafka/RabbitMQ, Ceph/ElasticSearch, Etcd/Zookeeperl Familiar with Jenkins, Gitlab, etc., have practical experience in CI/CD process formulation and integration Familiarity with Docker/k8s container platform and related underlying technologies and principles is preferred Understand mainstream big data technologies such as Hadoop/Spark/Filnk/Hive/Drill/Druid Possess strong promotion and coordination skills, organizational and communication skills,

foundit

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD966409
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Malaysia, Malaysia
  • Education
    Not mentioned