Engineer the future of global finance. At Citi, our Tech team doesn't just support finance - we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech.
The SRE Observability Specialist is a hands on expert, delivering the future of Observability across Services Technology. This role is a part of a central SRE enablement team within Services Production, working closely with SREs, developers, and platform teams to embed telemetry, implement SLOs, and build meaningful visualizations for key production flows - particularly in critical Payments Business.
The ideal candidate will have deep technical knowledge, a collaborative mindset, and the ability to translate strategy into scalable engineering outcomes. You will also act as a bridge between Services Technology teams and central infrastructure/CTO teams, prioritising observability needs from line of business teams and driving improvements. A strong understanding of observability tooling, evolving AI/ML capabilities, and enterprise tooling ecosystems will be essential.
This role requires providing a technological support solution for Project Orion which provides end to end payment monitoring like building an end to end payments dashboard, toil reduction, transformation of legacy monitoring into an observability based monitoring solution, requiring good understanding of different payments taxonomy (ACH, wires, instant payments, etc.). Strong commercial awareness, technical credibility, and excellent communication skills are essential to negotiate internally, influence peers, and drive change. Some external communication may be necessary.
Key Responsibilities
- Define the roadmap for Engineering enablers for Project Orion team aligned with enterprise reliability and SRE Services organization goals.
- Translate organization strategy into an actionable delivery plan in partnership with Services Products, Operations & Engineering functions, delivering incremental, high value milestones.
- Understand critical business services functional scope and translate into end to end monitoring solutions.
- Deliver against the observability roadmap for Services Technology by building scalable, reusable telemetry solutions.
- Periodic review and analyse application monitoring toil and collaborate with stakeholders and remediate them as per organization goal.
- Create and maintain dashboards and visualisations for critical client journeys, including real time flows across payments.
- Guide line of business teams in implementing SLIs/SLOs, golden signals, and effective alerting to support operational excellence.
- Support integration and adoption of observability tooling across on prem, public cloud (AWS/GCP), and containerised environments (ECS, Kubernetes).
- Customise shared dashboards and observability components in partnership with CTI and other central engineering functions, ensuring usability and flexibility.
- Provide technical support and implementation guidance to SREs and developers facing integration or tooling challenges.
- Effectively manage the observability book of work for Services Technology and drive initiatives to reduce MTTD and improve recovery outcomes.
- Serve as a key connection point between line of business SREs and central infrastructure functions by gathering tooling feedback, surfacing systemic issues, and influencing platform enhancements via the Services Observability Forum.
- Stay current with observability trends, including AI/ML driven insights, anomaly detection, and emerging OSS practices, and assess their applicability.
- Maintain strong knowledge of observability platform features and vendor offerings to advise teams and maximise the value of tooling investments.
- Foster AI adoption by building use cases performed by Orion L1 functions and remediation using Citi AI tech stack.
Qualifications
- Experience in SRE, observability engineering, or platform infrastructure roles focused on operational telemetry.
- Hands on experience in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms.
- Deep understanding of SLIs, SLOs, error budgets, and telemetry best practices in high availability environments.
- Proven ability to troubleshoot integration issues and support observability across hybrid platforms (on prem, cloud, containers).
- Experience building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments.
- Familiarity with modern observability tooling ecosystems, including AI/ML capabilities, trace correlation, baselining, and alert tuning.
- Strong interpersonal and collaboration skills - able to operate across federated engineering teams and central infrastructure groups.
- Experience in enablement or platform teams with a track record of scaling best practices across diverse business units.
Education
- Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
What we'll provide you
By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (which is annually reviewed), and enjoy a whole host of additional benefits such as:
- 27 days annual leave (plus bank holidays)
- A discretionary annual performance related bonus
- Private medical care & life insurance
- Employee assistance programme
- Pension plan
- Paid parental leave
- Special discounts for employees, family, and friends
- Access to an array of learning and development resources
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi's EEO Policy Statement and the Know Your Rights poster.