WebHarvy is a handy application designed to enable you to automatically extract data from web pages and save the extracted content in different formats.
With WebHarvy, capturing data from web pages is as easy as navigating to the pages which contain data and clicking on the data to be captured. WebHarvy will intelligently identify patterns of data occurring in web pages. Using WebHarvy, you can extract data such as product catalogues or search results from a variety of websites which fall in to different categories like Real Estate, Ecommerce, Academic Research, Entertainment, Technology etc. The data extracted from web pages can be saved in a variety of formats. Often web pages display data such as search results in multiple pages. WebHarvy can automatically crawl and extract data from multiple pages.
Here are some key features of "SysNucleus WebHarvy":
- Incredibly easy-to-use, start scraping within minutes
- Extract data from multiple pages/categories/keywords
- Save extracted data to file or database
- Built-in scheduler and proxy support
- Point and Click Interface
- Regular Expressions
- Export data to file/database
- Auto Pattern Detection