Robust Data collection/scraping powered by AWS

Completato Pubblicato 5 anni fa Pagato alla consegna
Completato Pagato alla consegna

Requirements:

1. Continuously and reliably scrape and collect job posting data from websites like indeed, dice, careerbuilder, Monster, etc. (any one or two would be sufficient). The best solution would be rotating among those sites.

2. It queries jobs based on a randomly generated combination of keywords, such as "java, Dallas Texas".

3. It should be disruption-free and utilize AWS Spot EC2 instance to power the scraping. That means, the solution should include programmatically create a spot instance and start working there.

4. The collected data should be saved to a central server, in a format of zipped csv file or Mongodb.

Servizi web di Amazon NoSQL Couch & Mongo Parallel Processing Web Scraping

Rif. progetto: #16990869

Info sul progetto

4 proposte Progetto a distanza Attivo 5 anni fa

Assegnato a:

zekovicm

Hi there,I am Miljan,web scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this job ! I can start immediately and finish it within Altro

$155 USD in 3 giorni
(71 valutazioni)
6.7

4 freelance hanno fatto un'offerta media di $176 per questo lavoro

mantislin

Hi sir, This is Lin and I am scraping expert, i have checked all details for your project. can we discuss more info then i can provide example data for you? Please message me then we can discuss more ASAP. Altro

$172 USD in 5 giorni
(260 valutazioni)
7.5
cyberskytech

We are a small team of experienced IT professionals who excel in System Engineering, DevOps, Cloud, Web Development and Cyber Security. Our primary goal is to provide the best solutions for the least cost. Managed I Altro

$155 USD in 10 giorni
(3 valutazioni)
2.2