About the job
Service Reliability Engineering (SRE) is an important endeavor for Emirates NBD Group IT to get more out IT investments. SREs are focused on optimizing new/existing services including the underlying technology, continual assessment of cloud infrastructure including its micro services and elimination of any manual work through automation.
Senior Service Reliability Engineers (SSREs) advocate and augment the reliability engineering principles, guidelines and standards. SREs partner with Product Owners, Platform and Engineering Teams to drive the Availability, Reliability, Scalability, Usability, Recoverability of application services and technologies in the production environment. They combine engineering and development experience and an innate drive to improve existing and new systems and processes. They collaborate with Development, Platform, Operations team to build and run scalable, sustainable production services which can advance and adapt to evolving business needs.
Essential Job Requirements include:
A bachelor’s degree in Computer Engineering, Computer Science, Information Systems or other related field is highly preferred; however, equivalent work (6+ years) experience in Reliability Engineering will not be overlooked.
Passion for designing, building, and managing resilient applications and infrastructures
Experience with project management or lead technical role in large enterprise wide projects.
Extensive work experience with large sets of data and data analysis.
Ability to program (structured and OO) with one or more high level languages
Have clear understanding in dynamic resource management frameworks, cloud, server, distributed storage, networks, virtualized environments, applications, databases and associated tool sets.
An understanding and practical experience with containerization frameworks
Must be good at forecasting, statistical analysis and modeling are part of the job.
Java Spring Boot Experience
DevOps Experience/ Tools which helps to be a DevOps Engineer
AWS & AZURE
Cloud Transition Model Waterfall/Agile – CI / CD DevOps /Dev Sec Ops
Chaos Testing Automation on the MicroServices
OPENSHIFT (PaaS Platform)
RHEL ,CENTOS & UBUNTU (OS)
VIRTUALBOX & VAGRANT (Virtualization)
DOCKER (Container RUNTIME Engine).
NGINX (Performing webserver for Containers)
Knowledge on ANSIBLE AUTOMATION
KUBERNETES (Container Orchestration), HELM (Kubernetes Package Management)
ENVOY & ISTIO (Service Mesh Data and Control Planes)
HARSHICORP (Securing Credentials)
Knowledge on MicroServices Fundamentals & Patterns, Monitoring the MicroServices , Custom Alerting
Understanding of monitoring/telemetry solutions (Icinga, ELK, AppDynamics) for data ingestion and analysis
PROMETHEUS (Container Infrastructure Monitoring), ELK (Log Monitoring), RUM (Real User Monitoring), GRAFANA Monitoring Dashboard Tool
Mongo DB, Postgres, Oracle
Experience with Atlassian suite of products
AS Mentioned in the JD