Senior Software Engineer, Data
Flex- Full Time
- Senior (5 to 8 years)
Candidates should have 8+ years of experience designing and building scalable, distributed systems. Strong programming skills in Java, Scala, or C++ with an emphasis on performance and reliability are required. A deep understanding of distributed transaction processing, concurrency control, and high-performance query engines is essential. Experience with open-source data lake formats such as Apache Iceberg, Parquet, or Delta, along with challenges associated with multi-engine interoperability, is necessary. Familiarity with data governance, security, and access control models in distributed data systems is also important.
As a Senior Software Engineer, you will design and implement scalable, distributed systems to support Iceberg DML/DDL transactions, schema evolution, and partitioning. You will architect systems that integrate Snowflake queries with external Iceberg catalogs and various data lake architectures. Your role will involve developing high-performance solutions for catalog federation and collaborating with Snowflake’s open-source team and the Apache Iceberg community. Additionally, you will work on core data access control features for Polaris and contribute to the managed Polaris service, ensuring external query engines can interact with Iceberg tables. Finally, you will build tooling and services that automate data lake table maintenance for enhanced performance.
Data management and analytics platform
Snowflake provides a platform called the AI Data Cloud that helps organizations manage and analyze their data. This platform allows users to store and process large amounts of data efficiently, offering services like data warehousing, data lakes, data engineering, data science, and data sharing. Snowflake's system works by uniting data from different sources, enabling secure sharing and performing various types of analytics. What sets Snowflake apart from its competitors is its ability to operate seamlessly across multiple public clouds, allowing users to access their data from anywhere. The company's goal is to help businesses leverage their data for better decision-making by providing a flexible subscription-based service that scales according to their needs.