Explanation of LOCWWW
- The character sequence which followed the link of the homepage in arbitrary specific domains (host) one after another, and was specified is searched.
- The link to an external domain (host) is not followed.
- It is the real-time reference which does not use a database.
When there is much number of pages
Although processing time is taken, most is the time which communication with a server takes.
- It does not leak to a standard HTML tag and a link can be followed.
(Frame correspondence, proxy correspondence, Javascript, and page reload are un-corresponding)
- It is the original reference robot (Spider) made for [ of a strange server ] reference.
- Only a HTML document is set as the object of character sequence reference.
- The information on the link file of all kinds is simultaneously outputted to a log window.
When there is much information, depending on a browser, it may be unable to display on a log window.
In this case, it will be displayed if it rereads once it saves a page at a file.
- The dead link of a homepage can be checked.
- It corresponds to Shift JIS and the Japanese code system of JIS and EUC.
- It is processing in the single process.
- To a huge site, since load is large, please withhold reference.
Moreover, please abstain from reference which becomes a public trouble.
- 1 time of processing time is restricted in a maximum of 10 minutes.
- It can work by Windows, UNIX (Linux), Mac, etc.
- It can work also from the command line of a terminal.
- There is also a function which carries out all file downloads the whole directory class.
- There is also a function which downloads only URL specified by the list.
- Installation is also received in onerous or onerous (use in a public institution).
Please ask.
- I have this page linked to freedom, and it is splendid.
- Please send an opinion, comment, etc. freely.
- Work of software development is also invited.
|