Table of Contents
IntroductionWhen dealing with web scraping, the most challenging part you may face is getting the exact data you want to scrape. You may want, for instance, to scrape data from the Google search results page for certain keywords and set up separate projects for each keyword. But a simpler way is to use a tool that will automate web scraping of the Google search results and help you extract data for each keyword. Depending on your project’s tasks and complexity, you can try a free keyword scraper available, a paid solution, or a custom tool developed specifically for your needs. We are going to check how to search up keywords on a website with the help of ParseHub, Octoparse, and WebHarvy tools.
Web Scraping Based on Keywords with ParseHubFirst of all, you need to install ParseHub, it’s free. The next step is to set up a project in it: Click on “New Project”, indicate the URL of the site you are going to scrape data from. With the help of settings, in the “Starting Value” section, define the keyword list to search through. Then go back to your project with “Back to Commands”, click the sign “+” that is next to your “page” selection, click on “Advanced” and then on the “Loop” command. Click “+” next to “For each item in keywords” and select “Begin New Entry”. This command will by default be given a name “list1”, then click “+” next to the “list1” tool and then “Select”. With this command, click directly on the search bar of the targeted site to select it. Then make the scraper add the keyword for each result next to it (click on the “+” sign next to the command “list1” and select the “extract” command).
Get Keywords from Website with OctoparseAt times, you may face the need to interact with a site when extracting data. It is especially actual for the cases when you need a list of keywords to be searched through, but you don’t want to enter them into a search box one by one. With the built-in Octoparse browser, you can interact with the web page by pointing and clicking, just like using any ordinary browser. The basic steps to input the text presuppose the following actions:
- Selecting the input field on the webpage of the browser.
- “Enter text” selection.
- Entering keywords into “Text list” to generate a loop search action. Every word will be automatically entered in the list into the search box, one at a time.
- By Clicking on the “Loop Item” box one can see the keywords that have been put displayed in “Loop Item”.
- Then choosing a keyword, and clicking on the “Enter Text” you can see that the chosen word is put in the search box.
- Clicking on “Click Item”.