This page describes the details of the process used by eValid Site Analysis in processing a WebSite search. It is written with the assumption that you are already familiar with the terms and eValid Site Analysis options mentioned.

Basic Mapping Process
A WebSite is mapped by creating a worklist of all the URLs specified on the starting page. Each URL is then visited and, if it resides on the same WebSite/sub-WebSite as the starting page, it is in turn mapped and the URLs it contains are added to the worklist.

Any link that is OFF the starting WebSite/sub-WebSite is visited (to check for its existence) but is not mapped.

Processing continues from the current worklist until every URL has been visited once. At the end of the search process the completed worklist is used as the basis for the map reports.

Limits to Mapping
The following settings impose or relax limits on the Basic Mapping Process:

  1. Choose between Search for All Links or Search with Limitations.

    DEFAULT: Search for All Links.

    This means to search with ALL EXTENSIONS and BROWSER PROTOCOLS.

  2. If Search with Limitations is selected there are two sub-choices: Limit by Extension/Query String and Limit by Protocols.

  3. Include Sites/Sub-Sites list:

    DEFAULT: (empty) (starting WebSite is implied)

    Additional WebSites or sub-WebSites that are to be searched can be added to this list.

    URLs that are on such WebSites will be treated as if they are on the starting WebSite, i.e. they will be visited AND mapped and their URLs will be added to the worklist.

    Example: If the starting page is the starting WebSite is

    The user could add "" to the include WebSites/sub-WebSites list to ensure ALL pages on the WebSite are mapped, not just those on the sub-WebSite. As each page is mapped, the worklist is built up according to the above criteria.

    Whether a link is then visited (and in turn mapped to have its URLs added to the worklist) depends on the following criteria.

  4. "Excluded URLs"

    DEFAULT: (empty)

    This is a text file, specified by name in the SiteMap Preferences.

    Any string added to this file that is not a #comment is used to determine which URLs are to be excluded.

    Any URL which contains one or more of these strings is not visited and is marked on the worklist as [Excluded URL].

  5. "Do Not Visit off-WebSite links"


    If this is set to ON then URLs that are 'off-WebSite', according to the above, are not visited. They are marked on the worklist as [Off-Site].