How-To: Extract HTML Page Titles with Inspyder Power Search
A quick guide to capturing all the page titles on a website.
Step 1: Enter your Root URL
Run Power Search and enter the URL of the website you want to work on. For this
example, we'll use 'www.inspyder.com'.
Step 2: Enter your Query
Enter the following into the Query field (no quotes): "<title>#PageTitle#</title>".
Set the Query mode as Wildcard.
This query will extract any text found on each webpage between the <title>
tags
and store it into a field in the results called "PageTitle".
Later, we can export the results and open them in Excel for further analysis.
Step 3: Configure Additional Settings
Ensure that "Ignore Case" and "Include HTML" Query Options are checked. Ignore case
tells Power Search to match the <title> tags
in upper or lower case. Include HTML tells Power Search not to remove the HTML tags
before using our query on the page.
Step 4: Click Go!
Click the Go button to start the crawl. Power Search will go through each page on
the site and capture all the text found between HTML <title> tags. You can stop
the crawl at any time by clicking the red "X" button. When the crawl is completed
you can export the results by clicking "Project | Export Results..." from the main
menu.

More Tutorials