|
General Introduction
eValid achieves site analysis through use of a built-in WebSite
"spider" that starts at any given page and systematically visits
all of the links reachable from there, applying various filters
to every page along the way.
(Read the Detailed Summary). eValid's
patented 3D-SiteMap makes it easy to understand how a web site is
organized and how all the pages relate to each other.
Root Selection, Blocking
Site Analysis of a WebSite always involves an initial URL -- the starting point of the mapping process.
From the starting URL the
eValid Site Analysis engine
downloads the initial page and then each page it points to, and each page each of them point to, etc.
This is a recursive descent of the sub-WebSite that starts at the initial URL.
Searches can take from several minutes to many hours, depending on the the complexity of the WebSite
and the
eValid Site Analysis engine
always produces a detailed and complete Site Analysis -- unless you interrupt the process or start it with constraints.
Protocols, Extensions, Domain Selection
The eValid Site Analysis engine handles all of the standard protocols and also
can have user-selected protocols added.
You can select the URL filename extensions that are to be searched.
And, you can specify non-Root domains to visit.
Constraining The Search
You can limit an eValid Site Analysis based on:
Maximum Search Depth. Analyze only the top two layers of a WebSite.
Search Time. Don't spend more than 15 minutes searching.
Search Page Count. Don't look at more than 1,000 pages.
Blocked URLs. Don't visit any URL that matches a pre-determined set of patterns.
Filters & Reports
The Site Analysis process can perform page by page analyses during the search process.
The filters you can apply include:
Slow Loading Pages. Find pages that load slower than a specified threshold.
Large Pages. Enumerate all pages that exceed a specified byte-count size.
Broken Links. List links that are broken or unavailable (error code 404 or higher)
Off-Site Pages. List pages that are "off-site" relative to the base URL.
Pages That Match A Pattern. Identify pages that satisfy a selected search criteria.