關於我們
我們是一家數據驅動型公司,通過大規模網絡爬蟲為企業提供可操作的洞察。我們的爬蟲團隊是我們數據獲取策略的骨幹,構建和維護複雜的系統,從網絡各處收集、處理和構建數據。
職位概述
作為我們爬蟲團隊的後端工程師,您將通過維護現有系統和實施對我們網絡爬蟲平台的改進來支持我們的數據獲取基礎設施。您將在資深後端工程師的指導下工作,確保我們數據收集服務的可靠性和性能。
主要職責
執行爬蟲系統的日常維護,確保持續運作
解決出現的解析器問題並進行修復
編寫並執行測試來驗證爬蟲功能
按照高級後端工程師的指導實施新功能和改進
協助代碼審核和文檔編寫
與團隊成員合作解決技術挑戰
在爬蟲失敗或遇到問題時應對緊急情況
參與特定平台爬蟲的實施
協助優化數據提取流程的效率和可靠性
為自動監控和警報解決方案做出貢獻
必要資格
計算機科學、工程或相關領域的學士學位,或同等經驗
3 年以上後端開發經驗
2 年以上 TypeScript 開發經驗
基本了解 HTTP/HTTPS 協議和網絡技術
了解測試方法和實踐
熟悉版本控制系統(Git)
基本了解數據庫系統,特別是 noSQL 數據庫(MongoDB、Firestore 等)
理解 HTML、CSS 和 DOM 操作
優先資格
具備網絡爬蟲庫和工具經驗(Puppeteer、Playwright、Selenium)
了解 HTML、CSS 和 JavaScript 以進行網頁分析
理解數據處理和 ETL 概念
具備自動測試框架經驗
熟悉雲基礎設施(AWS、GCP 或 Azure)
具備容器化技術經驗(Docker)
接觸過單一代碼庫開發環境
基本了解反機器人技術和偵測規避
我們提供
具有競爭力的薪資和福利待遇
來自經驗豐富的資深後端工程師的指導
使用尖端數據獲取技術的實踐經驗
協作的團隊環境
專業發展機會
靈活的工作安排
明確的晉升至資深後端工程師的職業發展路徑
有機會參與具有實際應用的重要項目
重要提示
我們致力於道德爬蟲實踐和尊重網站服務條款。我們的爬蟲活動遵循法律指南和行業最佳實踐。
About Us
We are a data-driven company that leverages web scraping at scale to provide actionable insights for businesses. Our Scraper Team is the backbone of our data acquisition strategy, building and maintaining sophisticated systems that collect, process, and structure data from across the web.
Job Overview
As a Backend Engineer on our Scraper team, you will support our data acquisition infrastructure by maintaining existing systems and implementing improvements to our web scraping platform. You will work under the guidance of Senior Backend Engineers to ensure the reliability and performance of our data collection services.
Key Responsibilities
Perform daily maintenance of scraper systems to ensure continuous operation
Troubleshoot and fix parser issues as they arise
Write and execute tests to validate scraper functionality
Implement new features and improvements as directed by Senior Backend Engineers
Assist with code reviews and documentation
Collaborate with team members to resolve technical challenges
Respond to emergency situations when scrapers fail or encounter issues
Participate in the implementation of platform-specific scrapers
Help optimize data extraction processes for efficiency and reliability
Contribute to automated monitoring and alerting solutions
Required Qualifications
Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience
3+ years of experience in backend development
2+ years of experience in Typescript
Basic understanding of HTTP/HTTPS protocols and web technologies
Knowledge of testing methodologies and practices
Familiarity with version control systems (Git)
Basic understanding of database systems, particularly noSQL databases (MongoDB, Firestore, etc.)
Understanding of HTML, CSS, and DOM manipulation
Preferred Qualifications
Experience with web scraping libraries and tools (Puppeteer, Playwright, Selenium)
Knowledge of HTML, CSS, and JavaScript for web page analysis
Understanding of data processing and ETL concepts
Experience with automated testing frameworks
Familiarity with cloud infrastructure (AWS, GCP, or Azure)
Experience with containerization technologies (Docker)
Exposure to Monorepo development environments
Basic understanding of anti-bot technologies and detection avoidance
What We Offer
Competitive salary and benefits package
Mentorship from experienced Senior Backend Engineers
Hands-on experience with cutting-edge data acquisition technologies
Collaborative team environment
Professional development opportunities
Flexible work arrangements
Clear career path to Senior Backend Engineer
Opportunity to work on impactful projects with real-world applications
Important Note
We are committed to ethical scraping practices and respect for website terms of service. Our scraping activities follow legal guidelines and industry best practices.