Pantheon has been at the forefront of private markets investing for more than 40 years, earning a reputation for an innovative approach to investing in secondaries, co investments, and primary fund investments, as well as capital formation across commingled funds, evergreen vehicles and customized solutions. Our specialist investment capabilities span multiple strategies across private equity, infrastructure and real assets, and private credit. Through our collaborative and committed culture, we find new ways to solve complex problems together and deliver innovative investment opportunities across private markets. Pantheon currently manages approximately $82.3 billion in AUM across all its strategies, serving more than 750 institutional and 638 private wealth clients worldwide.
Pantheon is in the process of building a cloud native, AI ready Data Platform based on the Databricks Lakehouse architecture, enabling analytics, operational use cases, and advanced ML/AI workloads. We require an experienced and passionate hands on Senior Data Engineer to design and implement new data pipelines for adaptation to business and/or technology changes. This role will be integral to the success of this program and establishing Pantheon as a data centric organisation.
You will be working with a modern Azure tech stack and proven experience of ingesting and transforming data from a variety of internal and external systems is core to the role.
You will be part of a small and highly skilled team, and you will need to be passionate about providing best in class solutions to our global user base.
Key Responsibilities
- Design, build, and maintain scalable, secure, and high performance data pipelines on Azure, primarily using Azure Databricks, Azure Data Factory, and Azure Functions.
- Develop and optimise batch and streaming data processing solutions using PySpark and SQL to support analytics, reporting, and downstream data products.
- Implement robust data transformation layers using dbt, ensuring well structured, tested, and documented analytical models.
- Collaborate closely with business analysts, QA teams, and business stakeholders to translate data requirements into reliable technical solutions.
- Ensure data quality, reliability, and observability through automated testing, monitoring, logging, and alerting.
- Lead on performance tuning, cost optimisation, and capacity planning across Databricks and associated Azure services.
- Implement and maintain CI/CD pipelines using Azure DevOps, promoting best practices for version control, automated testing, and deployment.
- Enforce data governance, security, and compliance standards, including access controls, data lineage, and auditability.
- Contribute to architectural decisions and provide technical leadership, mentoring junior engineers and setting engineering standards.
- Produce clear technical documentation and contribute to knowledge sharing across the data engineering function.
Knowledge & Experience Required Essential Technical Skills
- Python and PySpark for large-scale data processing.
- SQL (advanced querying, optimisation, and data modelling).
- Azure Data Factory (pipeline orchestration and integration).
- Azure DevOps (Git, CI/CD pipelines, release management).
- Lakehouse architecture (Databricks Unity Catalog, Delta Lake optimisation techniques such as Z ordering, liquid clustering).
- Data modelling (star schemas, data vault, or lakehouse aligned approaches).
- Data quality, testing frameworks, and monitoring/observability.
- Strong problem solving ability and a pragmatic, engineering led mindset.
- Experience in Agile SW development environment.
- Excellent communication skills, with the ability to explain complex technical concepts to both technical and non technical stakeholders.
- Leadership and mentoring capability, with a focus on raising engineering standards and best practices.
- Significant commercial experience (typically 5+ years) in data engineering roles, with demonstrable experience designing and operating production grade data platforms.
- Strong hands on experience with Azure Databricks, including cluster configuration, job orchestration, and performance optimisation.
- Proven experience building data pipelines with Databricks and Azure Data Factory; integrating with Azure native services (e.g. Data Lake Storage Gen2, Azure Functions).
- Advanced experience with Python for data engineering, including PySpark for distributed data processing.
- Strong SQL expertise, with experience designing and optimising complex analytical queries and data models.
- Practical experience using dbt in a production environment, including model design, testing, documentation, and deployment.
- Experience implementing CI/CD pipelines using Azure DevOps or equivalent tooling.
- Data as a Product mindset.
AI / ML / GenAI Enablement
- Enable ML/AI workloads on the Databricks data platform.
- Collaborate with AI Product Team to deliver use cases.
- Enable RAG pipelines / vector storage patterns to support AI products.
Desired Experience
- Financial services industry or private market experience.
- Development with coding agents (e.g., Anthropic Claude Code, OpenAI Codex, etc).
This job description is not to be construed as an exhaustive statement of duties, responsibilities, or requirements. You may be required to perform other job-related duties as reasonably requested by your manager.
Pantheon is an Equal Opportunities employer. We are committed to building a diverse and inclusive workforce so if you're excited about this role but your past experience doesn't perfectly align we'd still encourage you to apply.
We are committed to ensuring that all candidates have an equal opportunity to participate in the recruitment process. If you require any reasonable adjustments to accommodate your needs, please use this space to describe the nature of the adjustments you require.