Data Scientist II, Tissue Engineering
Valo HealthFull Time
Mid-level (3 to 4 years), Senior (5 to 8 years)
Candidates must have over 3 years of experience in Python and SQL and possess a PhD degree in biology, chemistry, or a related field. Extensive wet lab experience, particularly with HPLC and Mass Spec, and familiarity with specific LC instrument software from Waters, Thermo Fisher, Cytiva, Shimadzu, Sciex, or Agilent are required. Experience with data plotting and dashboarding tools like Streamlit, Tableau, or Jupyter Notebook is a plus. Excellent communication, attention to detail, confidence, proactive problem-solving skills, high bandwidth for managing multiple projects, intellectual curiosity, creative thinking, and a team-oriented, hands-on approach are essential.
The Scientific Data Engineer will research data acquisition strategies for scientific lab instrumentation and develop file parsers for instrument output files. Responsibilities include designing and building data models, data pipelines, unit tests, integration tests, and reusable utility functions. The role involves cross-analyzing instrument data to design common data model components and building visualizations, reports, and dashboards using tools like Streamlit, Tableau, and Jupyter notebooks. A key aspect of the role is driving customer value by testing solutions to ensure they meet requirements and deliver tangible benefits.
Cloud platform for scientific data management
TetraScience offers a cloud-based platform called the Scientific Data Cloud, which helps biopharmaceutical companies manage and harmonize their scientific data for research and development, quality assurance, and manufacturing. The platform connects various lab instruments and software, streamlining data management and significantly reducing task completion time. TetraScience's vendor-neutral and open design allows it to work with any lab equipment, making it a flexible solution in the life sciences sector. The company's goal is to enhance scientific outcomes by preparing data for artificial intelligence and machine learning applications.