skip to navigation
skip to content

warc 0.2.1

Python library to work with ARC and WARC files

WARC (Web ARChive) is a file format for storing web crawls.

This warc library makes it very easy to work with WARC files.:

import warc
f ="test.warc")
for record in f:
    print record['WARC-Target-URI'], record['Content-Length']


The documentation of the warc library is available at


This software is licensed under GPL v2. See LICENSE file for details.

File Type Py Version Uploaded on Size
warc-0.2.1.tar.gz (md5) Source 2012-05-15 17KB