How many bookmarks to the pages of a Web site? This script checks the popularity
of a site by finding the number of saved pages Del.icio.us, and calculates
the sum of all bookmarks.
It is therefore a means of assessing the popularity of a site providing it
is sufficiently important for have bookmarks on several pages.
This scripts depends upon the availability of the bookmarks site and takes a while to complete its tasks.
Update 2014: The script no long work because the site now blocks access to its data.
Popularity Checker operates in HTTP mode. It crawls the site starting from the home page and following internal links.
For each page of the site, it researches on Del.icio.sus the number of "saved". To do so it downloads the page of favorites and extract the number of saved with a regular expression.
The program detects internal broken links and displays an error in this
case. It knows how to interpret relative links, but it does not recognize
redirects. This can cause errors and false broken links which should not be
taken into account.
It is recommended to use absolute paths in internal links on its site, including
the http:// protocol and the domain.
The program also returns errors if it can not access the pages of bookmarks and that happens when the requests succeed too quickly, because access is then denied.
The command syntax is as follows:
php popche [options] url
The URL is the start page, normally the home page, in the form:
http://www.scriptol.com/index.php
Do not omit the slash in suffix or otherwise add the file name.
Options:
-r recursive, process linked pages. -v verbose, displays each page processed. -q quiet, no display.
A sub-directory can be reached in the form http://www.scriptol.com/design/ or http://www.scriptol.com/design/index.php. To avoid duplication, the program tries to reconstitute the full path by trying the usual names and extensions for a home page. This system can not take into account all possible cases.
Once the script is launched, you should start another task until it has finished. The program is not fast, and it should not be because the bookmarks site prohibits requests too close together.
The use of the script as a Web service is discouraged. The site Del.icio.us
temporarily denes access to its pages to any site that source them
too frequently, to avoid spam.
Even at command line, the program must not be used too often and its speed
must not be improved.
The archive contains the source code in Scriptol language and the PHP binary
code. It works on any computer where is installed the PHP 5 interpreter.
It is distributed under the GPL 2.0. You can use it without reserve and distribute
it provided you keep the copyright notice in the Scriptol code and the PHP
code.