
Data Scraping
Upwork
Remoto
•3 hours ago
•No application
About
hey, I need someone to get clean data from these sites without automation. although these are 3700 listings approx. I am even happy with 2000 high quality ones. Delivery is needed in 2-3 days in separate csv files for each link. might need a pdf file alongside each csv aswell. Looking forward to chat about what ideal profile of each high quality entry/lead should look like. format is just name of shop, address in Dehradun and the phone number. Links to scrape from: 1/ https://www.justdial.com/Dehradun/General-Stores-in-Haridwar-Bypass-Road/nct-10227851?trkid=160-remotecity-fcat-catsle&term=General+Stores&filters=%5B%7B%22e%22:%22100%22,%22v%22:%5B%22Relevance%22%5D%7D%5D&filtersApplied=%5B%7B%22mv%22:%2210000%22,%22v%22:%5B%22Relevance%22%5D%7D%5D&checkin=1758672000&checkout=1758758400 (general stores) 153+ Listings 2/ https://www.justdial.com/Dehradun/Printing-Services/nct-10855100 (printing services dehradun) 850+ Listings 3/ https://www.justdial.com/Dehradun/Plumbing-Contractors/nct-10378056 (plumbing contractors) 298+ Listings 4/ https://www.justdial.com/Dehradun/Paan-Shops/nct-11271038?trkid=20905-remotecity&term=Paan%20Shops (paan shops) 194+ Listings 5/ https://www.justdial.com/Dehradun/Stationary-Shops/nct-10453443?trkid=37685-remotecity-fcat-catsle&term=stationary%20shops (stationary shops) 1110+ listings 6/ https://www.justdial.com/Dehradun/Cleaning-Services/nct-10101229?trkid=2037-remotecity&term=Cleaning%20Services (cleaning services) 229+ Listings 7/ https://www.justdial.com/Dehradun/Electricians/nct-10184166 (electricians) 1076+ Listings 8/ https://www.google.com/maps/search/small+general+stores/@30.3076208,77.9645844,13.33z/data=!4m2!2m1!6e6?entry=ttu&g_ep=EgoyMDI1MDkxNy4wIKXMDSoASAFQAw%3D%3D 9/ https://www.google.com/maps/search/shops/@30.3359653,77.962457,19.54z/data=!4m2!2m1!6e6?entry=ttu&g_ep=EgoyMDI1MDkxNy4wIKXMDSoASAFQAw%3D%3D 10/ https://www.google.com/maps/search/shops/@30.3342679,77.9605894,17.69z/data=!4m2!2m1!6e6?entry=ttu&g_ep=EgoyMDI1MDkxNy4wIKXMDSoASAFQAw%3D%3D