Tool Name:
Tool Version: 4.0.4.2
Tool Type: Shareware
Tool Cost In: 299 US$
Tool Target Platform: Windows
Tool OS Support: Win2000,WinXP,Win7 x32,Win7 x64,Windows 8,Windows 10,WinServer,WinOther,WinVista,WinVista x64
Limitations: Free C# developer license for testing and evaluation before deployment: http://ironsoftware.com/csharp/webscraper/licensing/iron-webscraper-eula-license.html
Tool Info URL: Click to view
Video 1: Link for download
Video 2: Link for download
Download 1: Click to download
Download 2: Click to download
|
|
Short Description: Iron Webscraper makes C# development of screen scraping and data-mining applications possible by providing a C#/VB interface for developers to write web scraping workflows that mimic human browsing behavior. Available as a .Net DLL & Nuget package.
Long Description 1: The C# web-scraper framework is a sophisticated set of C# screen scraping classes (also available on Nuget) which make extracting data form web applications and turning it into .Net objects, JSON, CSVs and Spreadsheets enjoyable. The Iron WebScraper DLL to your project brings in behind the scenes management of threading, document parsing, proxies, headers and cookies so that you can focus on linear scraping logic which is easy to code and debug.
Long Description 2: The web-scraper for C# allows .Net developers to create logical that extract content from web applications and turn it into JSON, spreadsheets, C# objects or even SQL using simple C# and Linq code.
Iron WebScraper is a web scraping library for the .Net 4.5 and Core platform which allows developers to use clean, simple logic to reverse any web resource back into C# objects or SQL. It can extract pages using set-by-step (if-this-then-that) workflows, effortlessly scraping and parsing html, javascript, xml, RSS, pdfs and office documents on the internet or local intranets back into useful structured data.
This leaves the developer with clean, efficient web-scraping applications which are easy to understand and debug.
The C# Web Scraping Library is extremely polite, ensuring that no domain or IP address has too many concurrent requests. It intelligently throttles both client and server side looking for excessive CPU usage and slowing to an appropriate pace. In addition, it can obey robots.txt directives including bot specific crawl rates and limitation. The exact urls and content types to be strapped can be set using logical workflows and regex/wildcard rules.
Screen-scraping is made easier with identity control, automatically managing threads, rate limits, urls, duplicates, retries, proxies, headers and cookies into a an army of virtual browser which can mimic human behavior and even client buttons, fill in forms or log in behind security walls. This is useful for migrating legacy systems, populating enterprise search facilities and for statistical competitive analysis
Full documentation, support and downloadable DLLS for the C# Web Scraper are available from http://ironsoftware.com/csharp/webscraper/ , in addition to links to a .Net 4.5+ Nuget package with full Azure and Mono compatibility.
|