Software Data Engineer
Axsome Therapeutics Inc- Full Time
- Senior (5 to 8 years), Junior (1 to 2 years)
Candidates are required to possess over 5 years of experience leveraging Python for scripting, automation, data analysis, data engineering, and/or data science, along with over 3 years in life sciences, either through bench work in Biopharma, bioinformatics, or within a vendor space. Applicants should demonstrate a passion for science and a commitment to building solutions that make data more accessible to end-users, coupled with intellectual curiosity and resilience. Excellent communication skills and attention to detail are also essential.
The Scientific Data Architect will translate scientific data workflows into solutions utilizing the Tetra Data Platform, owning and implementing solutions ranging from programmatic interrogation of instrument output files to data structure design using JSON schemas and Python-based parser development. They will integrate lab software via APIs, create data visualizations using tools like Jupyter Notebooks, Streamlit, and plotting libraries, and collaborate with customers to understand requirements and test solutions. This role involves proactively communicating implementation progress, facilitating sprint planning, and assisting the product team in prioritizing the roadmap based on customer feedback. Additionally, the architect will lead retrospective meetings and identify opportunities for productization.
Cloud platform for scientific data management
TetraScience offers a cloud-based platform called the Scientific Data Cloud, which helps biopharmaceutical companies manage and harmonize their scientific data for research and development, quality assurance, and manufacturing. The platform connects various lab instruments and software, streamlining data management and significantly reducing task completion time. TetraScience's vendor-neutral and open design allows it to work with any lab equipment, making it a flexible solution in the life sciences sector. The company's goal is to enhance scientific outcomes by preparing data for artificial intelligence and machine learning applications.