GitLab

Data System Architect, Data Engineering & Monetization

Remote

Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Software Development, DevSecOps, AIIndustries

About GitLab

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what's possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC.

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Co-create the future with us as we build technology that transforms how the world develops software.

An Overview of This Role

Join GitLab as Data System Architect to drive our strategic data platform evolution. You'll architect scalable, distributed solutions that transform how we manage and leverage data across our SaaS and self-managed deployments, supporting enterprise-scale growth and innovation.

What You’ll Do

  • Drive architectural vision for scalable, distributed data systems across SaaS and self-managed deployments, designing database stack solutions that optimize OLTP/OLAP performance and scalability requirements
  • Define enterprise data product standards and governance frameworks including data lineage, SDLC, versioning, and compliance practices for regulated environments
  • Build governed, monetizable data services and APIs that support both internal operations/analytics and external SaaS product offerings with semantic structure
  • Partner with product and engineering teams to embed modern agentic and AI-driven patterns into data infrastructure and customer-facing solutions
  • Architect event-driven systems and cross-stack orchestration supporting hybrid transformations through tools like Argo, Airflow, and Kubernetes with unified metadata-rich telemetry
  • Design end-to-end data lifecycle architecture covering integration, pipelines, transformation workflows, and consolidated metadata systems across multiple platforms
  • Establish CI/CD best practices for data systems ensuring reliable deployment, monitoring, and maintenance across diverse deployment models
  • Transform ambiguity into strategic roadmaps and lead complex technical engagements where data architecture creates competitive differentiation

What You’ll Bring

  • Experience architecting large-scale distributed data systems in complex, regulated domains with unified platforms integrating cloud-native compute, orchestration, and semantic modeling
  • Demonstrated leadership building multi-modal data services with strong developer experience principles, focusing on monetization, governance, and data product lifecycle management
  • Hands-on expertise with modern data stack technologies including Python, Docker, Airflow, Trino, Postgres, distributed query engines, and graph-based metadata systems, integrating them into the GitLab ecosystem comprised of Ruby on Rails and Go services.
  • Advanced knowledge bridging cloud and on-premises deployments with automation, developer self-service focus, and data integration through connector marketplaces
  • Deep understanding of data processing paradigms and standards including synchronous vs. asynchronous processing, schema management, logical data modeling, and formats like OpenTelemetry, OpenMetadata, and OpenLineage
  • Experience with AI-driven architectures and emerging technologies including model orchestration

Skills

Data Architecture
Distributed Systems
Data Platform Evolution
SaaS
Self-managed Deployments
OLTP
OLAP
Data Governance
Data Lineage
SDLC
Data Monetization
API Design

GitLab

Unified DevOps platform for software development

About GitLab

GitLab offers a DevOps platform that simplifies the software development process by providing a single application for collaboration, visibility, and speed. The platform integrates various tools needed for software development, which helps teams manage their projects more efficiently without juggling multiple tools. This allows companies to concentrate on enhancing their products instead of spending too much time on builds. GitLab serves a wide range of clients, including large corporations from different industries, demonstrating its versatility. The company operates on a subscription-based model, where clients pay for access to the platform, which includes features for continuous integration and deployment. GitLab also provides free trials and regularly updates its platform to deliver ongoing value to its users. By customizing its offerings and partnering with other technology providers, GitLab aims to enhance its ecosystem and drive revenue.

San Francisco, CaliforniaHeadquarters
2014Year Founded
$421.8MTotal Funding
IPOCompany Stage
Consulting, Enterprise SoftwareIndustries
1,001-5,000Employees

Benefits

Spending Company Money
Equity Compensation
Life Insurance
Financial Wellness
Paid Time Off
Growth and Development Benefit
GitLab Contribute
Business Travel Accident Policy
Immigration
Employee Assistance Program
Incentives
All-Remote
Part-time contracts
Meal Train
Fertility & Family Planning
Parental Leave

Risks

AI-powered coding assistants like Claude pose a competitive threat to GitLab's platform.
Potential sale to Datadog may lead to strategic shifts misaligned with customer expectations.
Integration of Oxeye may distract from GitLab's core DevOps offerings.

Differentiation

GitLab offers a unified DevOps platform, reducing complexity in software development.
The platform integrates tools for collaboration, visibility, and speed, enhancing development processes.
GitLab's open-source model fosters continuous innovation with a large developer community.

Upsides

Acquiring Oxeye enhances GitLab's cloud security, appealing to security-conscious enterprises.
Partnership with Ooredoo Kuwait expands GitLab's influence in the telecommunications sector.
Potential sale to Datadog could create strategic synergies and expand market reach.

Land your dream remote job 3x faster with AI