Initial Queue
This is considered a full site traversal.
Not including SSIs before parsing for links.
Not running CGI scripts.
Not checking user pages.
Not verifying remote URLs.
Ignoring These Local Docs if they are missing
/_self$|^$
Ignoring These Local Docs
/_self|^$
Ignoring These Remote URLs
www\.[Ee][Pp][Ll][Ee][Yy][Ss]\.com|^ftp:|^file:|^telnet:|^gopher:|^archie:
Critical Pages
Following documents are to be considered critical:
Red Flags
Flagging the absence of the following items:
| Regular Expression | File Types |
| <[Tt][Ii][Tt][Ll][Ee][\t >].+?</[Tt][Ii][Tt][Ll][Ee]> | \.s?html?$ |
| <[Bb][Oo][Dd][Yy][\t >].+?</[Bb][Oo][Dd][Yy]> | \.s?html?$ |
| <[Hh][Tt][Mm][Ll][\t >].+?</[Hh][Tt][Mm][Ll]> | \.s?html?$ |
| <[Hh][Ee][Aa][Dd][\t >].+?</[Hh][Ee][Aa][Dd]> | \.s?html?$ |
Not checking documents matching this pattern:
^$
|