Get daily remote job opportunities in your inbox

No middlemen, no spam, no infinite scrolling.

Get relevant job opportunities, one email at a time.

Unsubscribe at any time.

Data Engineer (Web Scraper) Intern @Clootrack Software Labs Private Limited

[Hiring] Data Engineer (Web Scraper) Intern @Clootrack Software Labs Private Limited

Mar 31, 2025 - Clootrack Software Labs Private Limited is hiring a remote Data Engineer (Web Scraper) Intern. 💸 Salary: unspecified. 📍Location: India.

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We're looking for a skilled Web Scraping Data Engineer (Intern) to design and implement robust data extraction systems. In this role, you'll develop scalable crawling architectures to collect high-quality data while ensuring compliance with ethical standards and data regulations.

  • Design and maintain efficient web crawling systems using frameworks like Scrapy, Playwright, or Selenium
  • Implement data processing pipelines to clean, normalize, and structure extracted content
  • Optimize crawling strategies to improve efficiency while respecting website policies
  • Develop monitoring systems to identify and resolve scraping issues quickly
  • Deliver high-quality datasets for analysis and model training
  • Implement storage solutions for large-scale data management
  • Ensure compliance with data regulations and ethical scraping practices

Qualifications

  • Strong Python programming experience
  • Good to know SQL
  • Hands-on experience with web scraping tools (BeautifulSoup, Scrapy, Selenium)
  • Proficiency with HTML, JavaScript, and HTTP protocols
  • Experience with data processing libraries (pandas, PySpark)
  • Familiarity with Linux/UNIX environments
  • Knowledge of version control systems and code review practices
  • Strong problem-solving abilities and attention to detail
  • Excellent communication skills (written and verbal English)

Requirements

  • Familiarity with AI frameworks (Hugging Face, LangChain, OpenAI)
  • Familiarity with LLM training pipelines and data requirements
  • Experience with text data augmentation and synthetic data generation

Preferred Qualifications

  • Experience with large-scale distributed crawling systems
  • Knowledge of proxy management and anti-bot evasion techniques
  • Familiarity with cloud platforms (AWS, GCP, Azure)
  • Experience with containerization (Docker, Kubernetes)

Benefits

  • Opportunity to work on cutting-edge data collection projects
  • Collaborative environment with talented engineers
  • Competitive compensation package
  • Professional growth and development opportunities

Similar Remote Jobs

More jobs at Clootrack Software Labs Private Limited

More Software Development jobs

More jobs in India

Before You Apply
📍 Be aware of the location restriction for this remote position: India
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Data Engineer (Web Scraper) Intern @Clootrack Software Labs Private Limited
Software Development
Salary 💸 unspecified
Remote Location
India
Job Type unspecified
Posted Mar 31, 2025
Apply for this position Unlock 54,631 Remote Jobs
📍 Be aware of the location restriction for this remote position: India
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Data Engineer (Web Scraper) Intern Apply for this position Unlock 54,631 Remote Jobs
×
  • Unlock 54,631 hidden remote jobs.
  • Your shortcut to remote work. Apply before everyone else.
  • Click and apply. No middlemen, no hassle.

We’re not like the other sites. Come see why!

50% off in April 2025
  • Single payment
  • Lifetime access
  • Filter by location/skills/salary…
  • Create custom email alerts
  • Private Slack Community