Software Engineer – Web Crawling
WoflowFull Time
Mid-level (3 to 4 years)
Candidates should have experience in designing, building, and maintaining web scrapers. Familiarity with advanced scraping techniques such as fingerprinting methods (cookies, headers, user-agent rotation, proxies) is required. Experience handling dynamic content, complex DOM structures, and managing session/cookie lifecycles is also necessary. The role requires the ability to develop monitoring solutions and alerting frameworks. Staying up-to-date with industry trends and evolving bot-detection tactics is important.
The Web Scraping Engineer will be responsible for refactoring and maintaining existing web scrapers to improve reliability and efficiency, implementing best coding practices, and utilizing advanced techniques to avoid detection. This role involves collaborating with cross-functional teams to gather requirements and ensure data quality, as well as providing support and documentation to internal stakeholders. Additionally, the engineer will monitor and troubleshoot scraper performance, diagnose bottlenecks, and drive continuous improvement by proposing new tooling and methodologies.
Data analysis and research services provider
YipitData provides data analysis and research services by converting large amounts of raw data from various sources into clear and actionable insights. These insights help clients, primarily investors and corporate entities, understand market trends and company performance. YipitData operates on a subscription model, offering clients regular datasets and reports that include updates on various sectors such as autos, marketplaces, and logistics. Each subscription provides tailored information with three layers of coverage to meet specific client needs. The company differentiates itself by offering deep dives into topics, continuous tracking of metrics, and data integration best practices, ensuring clients have access to relevant and detailed information. The goal of YipitData is to empower clients to make informed decisions based on comprehensive data analysis.