Senior Site Reliability Engineer

  • iManage
  • City, Belfast
  • 11/05/2026
Full time Information Technology Telecommunications Java Python

Job Description

Senior Site Reliability Engineer - iManage

SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams - SRE teams are anchored to iManage offices across the globe. Tuesdays and Thursdays are dedicated to in office collaboration, rapid innovation, and developing a sense of belonging at iManage. Mondays and Fridays are reserved for focus time to get things done. Have the best of both work styles in a workplace that is intentional about belonging, collaboration, and accomplishment.

Being a Senior Site Reliability Engineer at iManage means

You are an engineer, a builder, and a systems thinker. You'll create middleware and platform guardrails that empower developers to innovate quickly and reliably. You combine deep technical judgment with empathy to eliminate customer pain, especially when working with enthusiastic teams stewarding the world's most privileged data.

You uplift those around you, act as a subject matter expert, mentor others, and drive change. You chase contributing factors over root causes, value code over documentation, and documentation over process. You'll engage in and often lead architectural discussions, reduce toil, and deliver scalable, resilient platforms that support our customers and organization.

As a Senior SRE, you'll help scale our cloud platform, collaborate across teams to promote standardization and resiliency, and participate in on call rotations. You'll become a key voice in observability, change management, and service scalability, providing guidance during complex technical decisions and high impact events.

iManage is experiencing explosive growth in its flagship cloud product. We're seeking senior software and systems engineers specializing in reliability and platform services to join our transformative cloud journey. This requires rethinking technical decisions with a beginner's mindset and a focus on resilience and sustainability. If you write code, think in systems, embrace complexity and automation, and are passionate about service resilience and scalability - we want to talk to you.

sRE Responsibilities
  • Eliminate TOIL through automation and software development.
  • Partner cross functionally with application teams and internal stakeholders.
  • Create a modern, cloud native platform that is resilient, cost effective, and secure by default.
  • Scale cloud infrastructure to support our Kubernetes based ecosystem.
  • Maintain the freshness and utility of platform services.
  • Improve the security posture of our products.
  • Design automation, orchestration, observability, and disaster readiness into our products.
  • Participate in production support and on call rotations, providing senior level guidance during critical events.
  • Lead incident management and post incident retrospectives, coaching teams in these practices.
Qualifications
  • Experience writing design documents, postmortems, and refactoring application code.
  • Built automation to reduce operational burden or developed internal SaaS tools.
  • Ability to advocate for SRE principles (e.g., SLOs vs SLAs) and introduce them effectively.
  • Experience in public cloud or hosted datacenter environments (Azure and AKS preferred).
  • A passion for collaborative teamwork and influencing reliability best practices across teams.
Bonus Points
  • Hands on experience with Linux server stacks (Ubuntu/Debian preferred).
  • Knowledge of cloud provisioning platforms (Terraform preferred).
  • Exposure to configuration management tools (Chef preferred).
  • Experience with containerization/clustering technologies (Docker preferred).
  • Familiarity with observability and alerting tools (Prometheus/Grafana or ELK/EFK).
  • Practical experience with CI/CD pipelines and rollout strategies.
  • A bachelor's degree (or equivalent experience) in Computer Engineering or related field.
  • Proficiency in one or more programming languages (e.g., Java, Python, Golang).
  • Familiarity with scripting languages (e.g., PowerShell, Bash, Python, Ruby).
Benefits
  • Creating an inclusive environment where you're encouraged to help shape the culture.
  • Market leading salary determined through a fair and consistent process, equitable for all employees.
  • Annual performance based bonus.
  • Enhanced parental leave (20 weeks for primary and 10 weeks for secondary caregiver at 100% pay).
  • Matching pension contribution (up to 6%).
  • Private medical insurance and cash plan.
  • Group life cover, income protection, and critical illness protection.
  • Flexible time off policy, 25 days of annual leave with additional flexibility.
  • Wellness days each year to prioritize mental health and well being.
  • Access to RethinkCare, a global behavioral health platform.

We welcome those who come with a growth mindset and a hunger for learning; if you are excited about this role but your past experience doesn't align perfectly with every qualification, we encourage you to apply anyway.

iManage is committed to providing an excellent candidate experience and will never ask you to engage in recruitment activity via text and exclusively communicate from emails using domain. If you have any concerns or questions about communications you have received, please send them to so our team members can review.

iManage provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.