Analytics Engineer
Machinify- Full Time
- Mid-level (3 to 4 years)
Candidates should have at least 3 years of experience as a data engineer and 8 years of any software engineering experience, including data engineering. Proficiency in at least one programming language commonly used in Data Engineering, such as Python, Scala, or Java is required. Experience with distributed processing technologies and frameworks like Hadoop, Flink, and distributed storage systems such as HDFS or S3 is essential. Expertise with ETL schedulers like Airflow, Dagster, or Prefect is also necessary, along with a solid understanding of Spark and the ability to write, debug, and optimize Spark code.
The Analytics Data Engineer will design, build, and manage data pipelines, ensuring user event data is integrated into the data warehouse. They will develop canonical datasets to track key product metrics, work collaboratively with various teams to understand their data needs, and implement robust systems for data ingestion and processing. Additionally, they will participate in data architecture and engineering decisions while ensuring data security, integrity, and compliance with industry standards.
Develops safe and beneficial AI technologies
OpenAI develops and deploys artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models capable of performing various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the versatility of its AI applications. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and ensures that excess earnings are redistributed to maximize the social benefits of AI. This commitment to safety and ethical considerations is central to its mission of ensuring that artificial general intelligence (AGI) serves all of humanity.