Skip to main content
PyCon US is happening May 14th-22nd in Pittsburgh, PA USA.  Learn more

Liches a server for linkchecker results

Project description

Introduction
==================

In fantasy fiction, a lich is a type of undead creature. (wikipedia_)
Liches is used to find dead links in your website so I thought this name
was appropriate (and a nicer explanation than liches stands for LInk Checker
Server).

Liches wraps the linkchecker_ output into a web interface.
You can run Liches as a stand-alone service for use with your websites
regardless of the language they're written in. While Liches itself is
written in Python, it interacts with your website purely via HTTP and
JSON_. You can even integrate it with pure javascript without the need of
server side programming or use it as a stand alone service.
An example how to integrate it into a website can be found at
https://github.com/collective/collective.liches

Rational
--------

While linkchecker_ itself can produce html this results in a single
monolithic page with all results included. This can be intimidating
and is from a usability point of view suboptimal. Liches presents
the results agregated per page, so you get a quick overview of how
many pages in your site have broken links and for each page an overview
which links are broken. You can also filter the results for a specific
error message so you can narrow down and prioritize which errors to deal
with first.

You can configure your linkchecks and invoke linkchecker through the
webinterface.

Install
=======

Prerequisites
-------------

It is strongly recommended to install Liches in a virtualenv_

::

mkdirliches virtualenv --no-site-packages liches/
cdliches/Inthisvirtualenvyoucaninstalllichesforproductionordevelopment.InstallforproductionInthevirtualenvyoucreatedaboveexecutethesecommands:: wget http://github.com/downloads/wummel/linkchecker/LinkChecker-8.4.tar.xz
xzdLinkChecker8.4.tar.xz bin/pip install LinkChecker-8.4.tar
bin/pipinstallliches wget https://raw.github.com/cleder/liches/master/production.ini
bin/initializelichesdbproduction.iniUsername[admin]:Password[generate]:(passwordgenerated:,whmgfi6r)Fullname:AdministratorEmail:root@localhost bin/pserve production.ini

Username and password are written to the file 'password.txt'. Delete this
file after installation.

Install for development
------------------------

In the virtualenv you created above execute these commands:

::

wgethttps://raw.github.com/cleder/liches/master/buildout.cfg mkdir buildout-cache
mkdirbuildoutcache/eggs mkdir buildout-cache/downloads
bin/easyinstallusetuptools wget http://python-distribute.org/bootstrap.py
bin/pythonbootstrap.py bin/buildout
rmbuildout.cfg ln -s src/liches/buildout.cfg
lnssrc/liches/development.ini bin/initialize_liches_db development.ini
Username [admin]:
Password [generate]:
('password generated: ', 'whmgfi6r')
Fullname: Administrator
Email: root@localhost
Double subscripts: use braces to clarify bin/liches_linkchecker development.ini

This command will call linkchecker_ for all the checks you have enabled
and import the results into the database. For regular linkchecks you can
call this command as a cron job.


Manual Import
--------------
Check a site for bad links with e.g:

::

bin/linkcheckerfileoutput=csvpause=3nowarningshttp://localhost/index.htmlPleaserefertothelinkcheckermanualforusage.Importtheoutputproducedbylinkcheckerintoliches:: bin/import_liches_csv production.ini

View the results
-----------------

Open `http://localhost:6543/` in your browser to see the results. The
frontpage tells you how many pages with broken urls are in your site.
Click on the link *'You have XYZ pages with broken links'* to view the
pages at `http://localhost:6543/getpages`.

At `http://localhost:6543/getpages?format=json` you can access the data
in JSON_ format.

.. image:: https://raw.github.com/cleder/liches/master/docs/liches-brokenpages.png

The links will take you to a page with detailed results for this page e.g.
`http://localhost:6543/checkurl?url=http://localhost/index.html`
which can also be accessed as JSON_
`http://localhost:6543/checkurl?url=http://localhost/index.html&format=json`

.. image:: https://raw.github.com/cleder/liches/master/docs/liches-brokenlinks.png

.. _linkchecker: http://wummel.github.io/linkchecker/
.. _virtualenv: http://www.virtualenv.org/
.. _JSON: http://www.json.org/
.. _wikipedia: https://en.wikipedia.org/wiki/Lich


Changes
========

0.7 (2013/11/28)
----------------

- remove linkchecker dependency so you can `pip install -U liches`
- pyramid 1.5 compatible (if you are using pyramid < 1.5 use liches 0.6)

0.6 (2013/11/28)
----------------

- add ability to invoke linkchecker in the webinterface
- add a broken links count to each page in overview


0.5 (2013/08/29)
-----------------

- add ability to show invalid pages only
- inititialize_liches_db writes a file 'password.txt' with user:password

0.4 (2013/08/12)
----------------

- bug fixes
- add ability to filter pages by error


0.3 (2013/08/07)
-----------------

- add script to check all linkchecks
- add linkcheck contenttype
- add authentication
- add form to change password
- add user add form


0.2 (2013/07/31)
-----------------

- Add script to empty db
- fixes
- more documentation
- add license


0.1 (2013/07/31)
----------------

- Initial version

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page