About Octoparse

Octoparse is a cloud-based web data extraction solution that helps users extract relevant information from various types of websites. It enables users from a variety of industries to scrape unstructured data and save it in different formats including Excel, plain text and HTML.

Users can click an element on a web page to select the type of data to extract. Octoparse allows users to run multiple extraction tasks simultaneously. Tasks can be scheduled to run at regular intervals, or they can be run in real time. Users can also scrape product comments and reviews and social media channels to collect information on consumer sentiment.

Octoprase's Wizard mode provides users step-by-step instructions to extract data, while the Advanced mode provides advance...


Read More

Supported Operating System(s):

Windows 7, Windows XP, Windows 8, Windows 10

55 Reviews of Octoparse

Average User Ratings

Overall

4.64 / 5 stars

Ease-of-use

4.5

Value for money

4.5

Customer support

4.5

Functionality

4.5

Ratings Snapshot

5 stars

(44)

44

4 stars

(6)

6

3 stars

(3)

3

2 stars

(0)

0

1 stars

(2)

2

Likelihood to Recommend

Not likely

Very likely

Showing 1 - 5 of 55 results

March 2017

Andy from Koillis-suomen puu LTD

Company Size: 501-1,000 employees

Industry: Commercial Real Estate

Time Used: Less than 6 months

Review Source: Capterra


Ease-of-use

5.0

Value for money

5.0

Customer support

5.0

Functionality

5.0

March 2017

Octoparse Review

The software is much easier to use, visually appealing, and on going customer support as well as tutorials have been created with the user in mind. Octoparse Web Scraper Experience: I have been looking professional web scraper for about two months now. I did try so many software's. Some was hidden mist! Most did not work at all. Then I did end up to get Octoparse web scraper! Wau! That cloud base software was exactly what I was looking for! This software really works. Software works even with some of the complex website. I definitely recommend! I use Octoparse on a daily basis and at my organization. there is no smoother way of web scraping! The software has never given me any issues. I think nobody can find better software to scrape data from web. Software It works exactly as expected. Octoparse is easy to use interface no experience scrapping websites is needed - but can do a lot. Octoparse software It has enabled me to ingest a large number of data point and focus my time on statistical analysis vs. data collection. It has safe me some much time! Same jobs would take me hours before and now data is collected in few minutes! When I need a quick way to grab structured web data, Octoparse software will be my first choice. It took time to learn the tool, but when you master it - there are lot's of powerful features. In just few days I managed to extract product information from thousands of products with very little effort! Using Octoparse to scrape a lot of data we needed was MUCH faster than custom building any solution. The user interface is intuitive, pricing very reasonable, and support was outstanding! To put it simply, if you've ever found a website where you wished that you could copy/paste hundreds of records from. Then Octoparse is your solution. It can automatically collect complete content structures such as product catalogs or search results. It's very user-friendly, yet sophisticated enough to extract data from highly dynamic websites. Octoparse software is data extraction tool that anyone can use to get data from the web. You'll never have to write a web scraper again and can easily create APIs from websites that don't have them. Octoparse software can handle interactive maps, calendars, search, forums, nested comments, infinite scrolling, authentication, dropdowns, forms, Javascript, Ajax and much more. Octoparse software is really the tool for your company!

Pros

easy to use.

Cons

Cost

March 2017

F. from C. Diffusion

Company Size: 2-10 employees

Industry: Retail

Time Used: Less than 6 months

Review Source: Capterra


Ease-of-use

4.0

Value for money

4.0

Customer support

4.0

Functionality

5.0

March 2017

I wish I had discovered this jewel years ago...

I have been crawling and parsing websites for a while, with use of php and cUrl. Years after years, it sounded clear that my extracting routines running on my server were more and more difficult to maintain in a good working shape. In fact, websites regularly change minor things on their pages, and in the best case, you wouldn't get anymore some or all of the awaited data, in the worse case, absolutely inaccurate data. Then came for me (and I must admit, my limited skills) THE hammer : AJAX ! Yes, html + Javascipt + css + dom... and the dynamic pages that don't load at first sight, that wait for you to click on a button, that just show as you scroll down, that exchange static pictures urls with javascipt dynamically shown pictures.. In two word : a nightmare ! So, I had to find a way to still be able to extract my needed data, without having to pass an engineer degree in information technology... had to be fast, had to be robust ! I gave a try to some scraping tools, and my final choice was made to Octoparse. Several reasons for it : easy to set up lots of tutorials to start easily Ajax is handled as easy as a basic html url... as if it wouldn't be any ajax routines on the pages. It's really what make me give a try... because I was unable to access the most important part of the data I needed... hidden behind an 'Display' Ajax button that I wasn't able to deal with (with php / cUrl) 10 tasks are offered for free, and as far I know, won't be public tasks as it's the case with some of Octoparse competitors Smart Mode and Wizard mode make it easy to find the data, often at first sight. Sometimes you need to find alternate ones... but Octoparse tries to do it for you. But of course, the Advanced Mode is the most important part ... and you don't need to start with it : Start with smart, or with wizard, and then Edit in Advanced Mode... and extract with accuracy what you need. I've been using kind of Xpath for years with php... but here, its easy and clear. You can even save a data extraction configuration files, to be used in new project, or elsewhere. The only drawback I have noticed, is that Octoparse uses mostly children/children/children xpath ways, that seems, to me, less robust than locations with specific attributes like class, id, or others, when Wizard Mode is used. But you can make it more robust and edit it in the advanced mode. It should definitively help me to gain a lot of time... and money (as far as I'm able to set up the APIs

Pros

Barely, you can start to use it easily without never having heard about xPath

Cons

Not one single API link in free mode, not one possibility to upload a single - even limited - task in the cloud, to test the speed difference with local extraction...

August 2019

Masita Dwi Mandini from Infimap

Verified Reviewer

Company Size: 11-50 employees

Industry: Research

Review Source: Capterra


Ease-of-use

1.0

Value for money

1.0

Customer support

1.0

Functionality

1.0

August 2019

Dont ever try the trial version

The trial version likes to create a trap for their user. They send a notification on the expired trial version on 7 days before and 2 minute before. For me is crazy to get 2 minute before notification. So if you kind a person how likes to forget the expired time of your trial version don't ever try this app, not worthty.

Pros

They have some template on crawling specific website

Cons

I try to crawl some literature databased from google scholar using their template, but it doesn't work well since google detected it as a robot. So totally crawl all the data was imposible, the system will stopped before finished the entired data crawling.

June 2020

Sarah from Phoenix Finders Group

Company Size: 2-10 employees

Industry: Consumer Services

Time Used: Less than 2 years

Review Source: Capterra


Ease-of-use

4.0

Value for money

4.0

Customer support

5.0

Functionality

5.0

June 2020

Great for data scraping

Octoparse lets our company free up time to do other projects. We are able to set up data projects in Octoparse and it goes right to work doing what we could never do in the amount of time it takes. We love using Octoaparse for our data scraping needs.

Pros

There are a lot ways to scrape data from several different sites. Ocotparse also offers lots of tutorials if you are unsure of how to scrape data from certain sites.

Cons

There are some features that might be hard to understand how to use if you do not have a basic understanding of coding. Ocotparse does provide many how to guides to help those that do not have a good background in coding.

March 2017

Andrew from Private

Time Used: Free Trial

Review Source: Capterra


Ease-of-use

5.0

Value for money

4.0

Customer support

5.0

Functionality

5.0

March 2017

Easy to start using, no coding required.

It took me about a day to look into all available web scrapers. At the end stopped on Octoparse for couple reasons. Pros: - Installs on Windows, so I could use spare Windows Server for scraping. No nodejs learning or programming needed. - GUI was simple to understand, can dump a list of links that need to be scraped, select content on the page that needs to go into Excel spreadsheet and click start. That's it, no need to select specific HTML divs or write regex code. Don't know how, but this was the only scraper that could analyze and grab a specific text on the page without setting any rules, the other scrapers I've tried had a hard time and had to make complicated rules. - During scraping opens the pages in a real browser, so Javascript, AJAX websites would work as well. - You can export to Excel, directly to SQL, MYSQ or Oracle database, CSV, TXT or HTML file. - You can also back up your scraped data to Octoparse as a backup, will be saved with your task. - Configuration and scraper apps run in different programs. If one suddenly would to shut down because of some error, other Octoparse tasks would still continue to work as nothing has happened. Cons: - Had a hard time adding a list of 50000 links into the queue, but not a problem because you can have multiple tasks 30-40K links in my case, just divide links between those tasks. - Did not say anywhere that it was saving the tasks to their servers, so that's why probably has trouble with large tasks. On the other hand, this one is also a Pro, because you can create tasks on your computer and load them up on your server just by restarting the app. Overall: You can have 2 active tasks running at the same time for free, if you want more, you can upgrade to a paid version. It takes about a second to open a page, so roughly you can scrape one page per second per task. Overall this worked better than great. Did not have to ask our devs to write a scraper, the time I spent creating the scraper would be the same amount of time I would spend discussing with our devs how to scrape the content. And now devs are asking me for stats on scraped data, not the other way around. If you do any marketing and wish to gather data for stats or just create your database from any website, super easy to do, recommend it.