Requires maintaining reliable, fault tolerant cloud infrastructure to meet service availability.
This position will work closely with all other teams to optimize performance and security on cloud infrastructure, and to assist incident management in the production environments
Key responsibilities for this role include facilitating onboarding for new services, building infrastructure enhancements, and managing ongoing tasks.
Duties & Responsibilities
Troubleshoot and resolve cloud infrastructure issues, ensure proper security controls are in place.
Maintain regular backups of the system.
Able and be willing to provide on-call duty.
Develop metrics for cloud infrastructure; establish dashboards for monitoring metrics and key performance indicators.
Provision and de-provision services and ensure the service availability SLA is met.
Work with Engineering team on the capacity expansion and change request.
Job Requirements
A recognized university degree in Electrical/Electronic/Telecommunications Engineering or equivalent.
Solid understanding of system and network administration (RHEL, Open Stack, TCP/IP, DNS, VLAN, VPN, VxLAN etc).
Familiarity with penetration testing, vulnerability scanning, log/configuration management systems is preferred.
Preferable with 2 years hand-on experience in OpenStack administration and in cloud infrastructure.
Knowledge of or desire to learn containerization technologies such as Docker, Kubernetes and scripting language such as Bash, Python and Powershell.