Data Scientist
AE StudioFull Time
Junior (1 to 2 years)
Candidates must possess a current/active TS/SCI security clearance and be willing to obtain a CI polygraph. A Bachelor's degree in data science, computer science, engineering, statistics, GIS, or a related discipline is required, with an additional 2 years of experience accepted as a substitute for the degree. The role demands 5 years of professional experience in data science, analytics, or data engineering, proficiency in Python and SQL, and familiarity with R. Experience with data wrangling libraries like pandas and PySpark, working with APIs or batch processing tools, and familiarity with ETL pipelines and orchestration tools such as Apache Airflow or Clairvoyant are necessary. Candidates should be comfortable working with large datasets in distributed environments like Spark, have a foundational understanding of containers like Docker for deploying data workflows, and possess exposure to cloud platforms like AWS, Azure, or GCP, particularly in data-related services. Strong attention to detail and documentation practices are also essential.
The Mid-Level Data Scientist will collaborate with senior data scientists to prepare, clean, and process structured and unstructured datasets for analysis. Responsibilities include automating ETL workflows and streamlining repetitive data preparation tasks using Python, SQL, and scripting tools, as well as operating within big data ecosystems using tools like Spark, Hadoop, or their cloud-native equivalents. The role involves assisting in the development and deployment of data pipelines, implementing basic statistical analyses, visualizations, and reporting, and maintaining detailed documentation of data preparation methods, scripts, and pipeline configurations. Additionally, the position supports the integration of data into downstream modeling and LLM/NLP workflows, contributes to operationalizing data products, APIs, and internal data access tools, and works within containerized environments and CI/CD pipelines. Exposure to knowledge management systems supported by LLMs and assistance in tagging or curating training datasets for LLM models are also part of the role.
Earth intelligence and space infrastructure solutions
Maxar Technologies specializes in Earth intelligence and space infrastructure, providing essential solutions for both government and commercial clients. The company offers services that help clients monitor and understand changes on our planet, including global broadband communications and advanced capabilities for space exploration. Maxar utilizes its extensive experience and commercial technology to deliver solutions that are fast, scalable, and cost-effective. Unlike many competitors, Maxar focuses on delivering precise and reliable data that supports informed decision-making and strategic planning. The company's goal is to generate value through contracts and partnerships, ensuring clients have the information they need to navigate complex global challenges.