Table of Contents

Portscout v0.8.1 - README

NAME

portscout - A tool to scan for new versions of FreeBSD ports.

SYNOPSIS

    portscout build
    while (lazy) {
        portscout rebuild
        portscout check
        portscout showupdates
    }

DESCRIPTION

Portscout is an automated system designed to search for new versions of software available in the FreeBSD ports tree. It is primarily designed for use by FreeBSD port maintainers, who can avoid trailing around dozens of websites looking for updates. However, I hope that others might find it useful too.

The current version of Portscout is also capable of checking OpenBSD's ports, NetBSD's pkgsrc, and also a generic list of software from an XML file.

SYSTEM REQUIREMENTS

The following software is required to run Portscout:

    - PostgreSQL or SQLite
    - POSIX-compatible system
    - The FreeBSD ports tree
    - Berkeley make
    - Perl 5.6+

Plus we need a few Perl modules:

    - URI
    - DBD::Pg or DBD::SQLite
    - Net::FTP
    - Proc::Queue
    - LWP::UserAgent
    - MIME::Lite
    - XML::XPath
    - XML::XPath::XMLParser

SQLite support is currently limited to non-forking mode only. That is, if you decide to use SQLite, Portscout will only check one port at a time; this will severely limit Portscout's speed/efficiency. It is therefore suggested that SQLite only be used for relatively light workloads.

INITIAL SET-UP

Initialise Database

The recommended database backend is PostgreSQL.

Option One: PostgreSQL

Create database:

    # createuser -U pgsql -P portscout
    # createdb -U pgsql portscout

Execute the included pgsql_init.sql script via psql:

    # psql portscout portscout < sql/pgsql_init.sql

This will create the database tables for you.

Option Two: SQLite

Create a database file with the included script:

    # sqlite3 /var/db/portscout.db < sql/sqlite_init.sql

Configure Portscout

Review portscout.conf, and check it suits your needs. The defaults should be reasonable for most people. You can reduce num_children and workqueue_size if you don't want Portscout sucking up all your resources.

Please note that Portscout's internal defaults differ from the defaults in portscout.conf - this is because without a config file, Portscout tries to be ``portable'' and use its own directory for storing things under, whereas if a config file is found, it assumes it is installed and being used ``system-wide''.

Any of the options in portscout.conf can also be set on the fly on the command line. For example:

    $ portscout --precious_data --num_children=8

Update Ports Tree (FreeBSD Only)

Ensure your ports tree is up to date.

Populate Database

We need now to populate the database with the software we want to check.

Option One: FreeBSD

If you're using Portscout to check FreeBSD ports, run:

    $ portscout build

This takes around 70 minutes for me. Basically, Portscout is extracting all the information it needs from the ports tree. Ports (by virtue of make) is slow; the database we're building is much faster. After this initial build, we will do incremental 'builds', only updating what has changed.

Option Two: Other Software Repositories

If you would like to check another software repository/source, Portscout has several options.

Firstly, NetBSD and OpenBSD's ports trees are supported by the standard ``Ports'' backend. See portscout.conf for details on how to configure this. Make sure you've got the correct make at hand for Portscout if you're checking either of these from another operating system (e.g. FreeBSD).

Caveat: neither of the above have been well-tested, and support should probably be considered experimental.

Secondly, you can use the ``XML'' backend for a finite list of software that you want to manage by hand. See Portscout::DataSrc::XML for more information.

REGULAR OPERATION

Update Ports Tree (FreeBSD Only)

Ensure your ports tree is up to date.

Incremental Database Update

If your ports tree / data source was updated since your last build/rebuild, ensure Portscout knows about the changes:

    $ portscout rebuild

Run Version Checks

    $ portscout check

This will instruct Portscout to search for new distfiles for each port in the database. It will take several hours for a complete ports tree scan.

View Results

Now you've got some results, you can view them.

Option One: HTML Reports

    $ portscout generate

This will put HTML pages inside html_data_dir - existing pages will be deleted.

Option Two: E-Mail Reports

    $ portscout mail

This will send out an e-mail message to the maintainers of ports with updates. The e-mail messages are opt-in; you will need to add addresses to the database before any e-mails are sent out.

Option Three: Console Summary

    $ portscout showupdates

This will output a summary of software with outstanding updates. It is recommended if you're checking a limited set of software/ports.

ADDING E-MAIL ADDRESSES

If you want to send e-mail reports to maintainers of updated of software, the e-mail addresses need to be registered with Portscout. This is a safeguard to ensure no one gets e-mails they don't want.

Use the following to manage these e-mail ``subscriptions'':

    $ portscout add-mail dave@example.net
    $ portscout remove-mail john@localhost
    $ portscout show-mail

UPGRADING

When upgrading, check the sql directory for any relevant database schema upgrade scripts. If there were multiple schema updates between the previous version of Portscout and the version to which you have upgraded, be sure to run each script in sequence to arrive at the latest database version.

CHECKING ALGORITHM

For anyone interested in how Portscout operates, here is a high-level summary of the checking algorithm in use:

 Test 1:
   1) Order master sites using previous reliability data.
   2) Attempt to get an FTP listing or web server index from each site.
   3) Extract version from files found; compare to current version.
   4) Skip other tests if new or current version is found.
 Test 2:
   1) Increment each part of the port's version string and attempt to
      download file, e.g. for 1.4.2, try 2.0.0, 1.5.0 and 1.4.3

The last test is not yet included in Portscout, but I may add it at some point, depending on the results of testing:

 Test 3:
   1) Locate port's WWW page and spider site in an attempt to find a
      page that looks like a "download area". Scan page for possible
      links to new files.

BUGS AND LIMITATIONS

Portscout tries to make a reasonable guess when it encounters version strings in a different format to the original distname (e.g. 3.2, 3.6-pre7), but this is difficult and error-prone since vendor version schemes vary wildly.

The only real problem at the moment is version strings which seem to to count backwards (e.g. 2.11 -> 2.2).

There's some difficulty in deciding what to do with trailing zeros in version guesses. Currently, they are left intact, but this is not always going to be the right course of action. In other words, from 4.3.9, will the next major version be 4.4.0 or 4.4?

The restrict_* variables don't affect generate/mail.

Portscout doesn't handle ports with multiple distfiles very well.

At least one port (archivers/zip, as of 2010-04-28) doesn't provide a version string in the vendor's format. portscout doesn't know what to do in this case, although the version could theoretically be ascertained from the distfile name.