Join us, and enhance system reliability with advanced monitoring solutions!
Krakow-based opportunity with the possibility to work 70% remotely!
As an Observability Engineer, you will be working for our client, a leading global financial services organization that focuses on enhancing critical IT infrastructure services. The project you’ll be working on is centered around monitoring and observability for one of their key applications. Your role will involve building and implementing monitoring solutions, providing consultancy, and collaborating with internal teams to ensure the performance and reliability of the system. The team is highly collaborative and committed to continuously improving the monitoring and observability services for mission-critical applications.
Your main responsibilities: Implementing and maintaining observability and monitoring frameworks
- Collaborating with application teams to set up observability for their infrastructure and applications
- Designing and optimizing dashboards, visualization, and self-healing solutions
- Building performance and tracing solutions using Spunk, AppDynamics, and ThousandEyes
- Engineering and establishing standards for functional components, including agent deployments and application tuning
- Automating operational tasks through scripting and seeking integration opportunities
- Assisting in training sessions to promote tool adoption and best practices
- Providing input for improving global monitoring and observability operating models
- Adhering to HSBC policies and raising concerns on potential issues
- Continuously evolving monitoring tooling towards a self-service automated platform
You’re ideal for this role if you have:
- 2+ years of experience working with Splunk, AppDynamics, or ThousandEyes
- Experience with application development (preferably Java) at an enterprise level
- Knowledge of cloud technologies such as AWS or GCP
- Experience with Kubernetes, OpenShift, PCF, and other architecture tech stacks
- Familiarity with monitoring and observability solutions, including server and network performance
- Practical knowledge of distributed service design, messaging protocols, and autonomous software design practices
- Strong understanding of application performance metrics and KPIs
- Experience with event management tools and operational automation like AIOps
- Ability to develop and optimize monitoring extensions using REST API
- Excellent communication skills and the ability to work independently and in team
It is a strong plus if you have:
- Experience working with ServiceNow, Confluence, and Jira
- Knowledge of machine learning and AI/ML concepts in relation to observability
- Familiarity with Elasticsearch, Grafana, Prometheus
- Experience defining and supporting monitoring dashboards for mission-critical applications
- Technical writing experience for queries, reports, and presentations
#GETREADY to meet with us!
We would like to meet you. If you are interested please apply and attach your CV in English or Polish, including a statement that you agree to our processing and storing of your personal data. You can always also apply by sending us an email at recruitment@itds.pl.
Internal number #6198
Adres:
SKYLIGHT BUILDING | ZŁOTA 59 | 00-120 WARSZAWA
BUSINESS LINK GREEN2DAY BUILDING | SZCZYTNICKA 11| 50-382 WROCŁAW
Kontakt:
INFO@ITDS.PL
+48 883 373 832