Yellow Pages scraper
Yellow Pages scraper is tool which can collect some data from Yellow Pages using multi-threading access with connections thru proxy servers. App can search and check proxies by own functionality. Multi-threading makes app possible do its job very fast.
One of our customers needs in data collection from Yellow Pages. Using ordinary scrapers do not reach success.
In this case solution is based on using multi-threading what make all operations more faster. Also in app uses several sources of proxy servers to avoid ban of Yellow Pages Scraper. Some threads may be hanged up during scraper’s work. We use system of timers in separated threads and solve this problem. And as result we have very smart and powerful scraper tool – Yellow Pages Scraper.
Yellow Pages Scraper can perform following actions:
- saves it status each minute,
- saves results after processing each page,
- work can be restored from any point,
- user can set up points and statuses. It makes possible to re-read some categories/cities/pages. It also makes possible lock some categories/cities for scanning,
- controls workflow by itself. Statuses of connections and data collector displays on right panel.