Job Description
Job Overview
- The Enterprise Monitoring Engineer will manage various Open source and cloud-native monitoring tools. Single point of contact for operational activities and initiatives for monitoring and observability tools.
Job Responsibilities
- Work closely with various IT teams to provide ongoing support for the monitoring tools operations (Grafana, Prometheus, ELK, Influx DB, PromQL, Azure monitor, AWS CloudWatch)
- Ensure that availability of Applications exceed the defined service levels.
- Maintain Enterprise Management solutions environment during business hours and offhours as a responsibility.
- Review performance on a regularly and ensure proactive actions and communication to avoid disruption to services.
- Lead new initiatives and process improvements.
- Prepare, review, and send various KPI reports.
- Various QA checks and review of Enterprise management systems/applications activities on a daily, weekly, and monthly basis.
- Share innovative and value driven ideas.
- Able to troubleshoot and resolve application issues.
- Create and maintain necessary Enterprise Management Systems Operational procedures/Training materials/Documentations.
- Ensure that Changes to applications are performed within the Architecture and change guidelines.
- Write scripts to automate the operational activities.
- Provide first level support for incidents and problems.
- Ensure that incidents and problems do not impact the agreed service levels.
Education
- Bachelor of Engineering/Technology. ITIL certifications Agile, Scrum certifications
Experience:
- 5+ years of IT experience
- Grafana, Prometheus, ELK, Influx DB, PromQL, Azure monitor, AWS CloudWatch UNIX/Windows/SQL/Oracle working knowledge.
- AWS, OCI, Azure knowledge
- Familiarity of supporting applications
- ITIL process familiarity
- Automation skills using any automation tools/scripts.
Behavioural Competencies:
- Strong customer-facing skills. Strong ability to interact with internal & external customers effectively using various communication channels.
- Willingness to work on night shifts/weekends/public holidays.
- Self-motivated and accommodated according to the environment.
- People-oriented, capable of handling stressful situations.
- Interested to be updated with latest technology trends.
- Excellent Analytical, troubleshooting & vendor co-ordination skills.
- Excellent capability to learn & adapt to changes.