No. Of Resource – 1 No.
Location: Oman
Industry – Bank
Email – hr@staffconnect.ae
WhatsApp – +971 52921270
Job Responsibilities
- Architect and implement container infrastructure technologies related to identity and access management, storage, networking, and telecommunications across hybrid environments (on-prem & cloud).
- Lead onboarding of new projects on OpenShift, setting the technical direction and best practices.
- Manage and support production applications across 11+ Red Hat OpenShift (OCP) clusters.
- Perform capacity planning, hardware sizing, and disaster recovery planning (active-active & active-passive).
- Update resource quotas and support microservice-based projects and application releases.
- Implement and maintain RBAC policies and Prometheus custom rules for user workload monitoring.
- Create and maintain Change Requests (CRs) for both platform and application-level configurations.
- Raise and manage support tickets with Red Hat, and coordinate updates on weekly TAM calls.
- Migrate projects between PROD, Stage, and DR clusters, including cluster decommissioning efforts.
- Configure Alert manager for platform notifications via email, SMS, Slack, etc.
- Set up and monitor CronJobs for ETCD backups to NFS mount points.
- Verify connectivity between OCP and DBs, NFS, Nexus, GitLab, and internal image registries.
- Implement Kasten for backup/restore operations integrating with S3 object storage.
- Integrate OCP with Azure Arc and Syslog servers for central logging and monitoring by security teams.
- Create custom Storage Classes integrating with NFS (Power Store, Dell Isilon).
- Integrate LDAP/Active Directory for authentication in OpenShift clusters.
- Set up Argo CD for CI/CD deployment pipelines and support application sync and deployment issues.
- Troubleshoot Kibana health status (Red, Yellow, Green) and Daemon Set issues to ensure reliable logging.
- Create and manage Docker/Podman containers, registries, and image streams (build, push, import).
- Develop and manage Ansible playbooks for automation and node management.
- Provide 1st to 3rd-level support in a global operations environment.
- Conduct patching, upgrades, and release testing across applications and infrastructure.
- Manage user access and incident requests for ongoing project needs.
- Collaborate with vendors to align with standard cloud and enterprise architecture frameworks.
- Ensure compliance with internal technology standards, security policies, and industry best practices.
- Engage with IT leadership and stakeholders to present new technology initiatives and innovation opportunities.