Senior Database Administrator

  • Kraken
  • 23/05/2026
Full time Information Technology Telecommunications

Job Description

The opportunity
  • Scale and tune high-throughput PostgreSQL clusters that support continued global expansion and new product initiatives.
  • Own PostgreSQL reliability fundamentals: WAL behavior, checkpoints, autovacuum, query planning, locking, replication lag, backup/restore, and capacity planning.
  • Help build and operate vector data capabilities, including pgvector and dedicated VectorDB platforms where appropriate.
  • Define production patterns for embeddings, approximate nearest neighbor indexes, metadata filtering, recall/latency tradeoffs, reindexing, data freshness, and drift management.
  • Strengthen high-availability, disaster-recovery, PITR, and backup approaches through sound design and regularly validated procedures.
  • Reduce manual operational work by building automation, improving process consistency, and enabling safe, low-friction database workflows.
  • Improve observability and alert quality by championing meaningful metrics, reducing noise, and ensuring operational clarity across PostgreSQL and vector workloads.
  • Enhance database security through robust access controls, disciplined patching and upgrade practices, encryption, auditability, and secure operational patterns.
  • Contribute to modern platform initiatives, including containerized environments, infrastructure-as-code workflows, GitOps, reproducible deployments, and self-service database operations.
  • Partner with service, AI, and platform teams to drive better performance patterns, operational readiness, data hygiene, and safe use of vector search across the engineering organization.
  • Participate in on-call rotations with a long-term focus on making on-call predictable, well instrumented, and shaped by preventative engineering.
Skills you should HODL
  • 5+ years operating PostgreSQL in high-volume production environments, including performance tuning, replication, backup/restore, upgrades, and incident troubleshooting.
  • Strong understanding of PostgreSQL internals and operations: MVCC, transaction isolation, locks, WAL, checkpoints, autovacuum, bloat, statistics, query planner behavior, partitioning, and index strategy.
  • Hands on experience with high availability and read scaling: streaming replication, replication slots, failover, lag management, PITR, backups, and disaster recovery drills.
  • Experience with connection pooling and traffic management for PostgreSQL, especially PgBouncer, HAProxy, Kubernetes service routing, or comparable patterns.
  • Practical VectorDB or vector search experience, such as pgvector, Qdrant, Milvus, Weaviate, OpenSearch vector search, or similar systems.
  • Ability to reason about vector index types and operational tradeoffs, including HNSW/IVFFlat style indexes, recall, latency, memory, ingestion throughput, metadata filters, rebuilds, and versioned embeddings.
  • Practical experience with CI/CD, GitOps, and Infrastructure as Code workflows. Terraform experience is ideal.
  • Solid cloud, Linux, storage, and networking fundamentals.
  • Experience with containers and orchestration platforms, including building container images and managing Kubernetes workloads at scale.
  • Strong security instincts around access control, credential lifecycle, encryption, auditability, upgrade processes, and safe operational workflows.
  • Observability expertise: monitoring, alerting hygiene, SLOs, dashboards, query level visibility, and readiness for incident response.
  • Strong communication and collaboration skills with the ability to partner with stakeholders, negotiate long term plans, write formal documentation, and tie success to specific metrics and KPIs.
Nice to haves
  • Experience with SRE methodologies such as error budgets, operational reviews, reliability programs, and failure exercises.
  • Strong scripting or programming ability, preferably Python, Go, or Rust, used to build automation and internal tools.
  • Hands on experience with GitOps tooling such as ArgoCD, GitHub Actions, or GitLab CI.
  • Exposure to multi region, multi datacenter, or active/passive disaster recovery designs.
  • Experience supporting AI, retrieval, personalization, fraud, search, or recommendation systems that depend on embeddings or hybrid search.
  • Interest in cryptocurrency or decentralized systems.

Please note, applicants are permitted to redact or remove information on their resume that identifies age, date of birth, or dates of attendance at or graduation from an educational institution.

We consider qualified applicants with criminal histories for employment on our team, assessing candidates in a manner consistent with the requirements of the San Francisco Fair Chance Ordinance.

As an equal opportunity employer, we don't tolerate discrimination or harassment of any kind. Whether that's based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.