Skip to main content

a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project description

pywordform: a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project website: http://www.decalage.info/python/pywordform

INSTALLATION:

  • on Windows, launch install.bat

  • on other systems, launch: setup.py install

HOW TO USE THIS MODULE:

Open sample_form.docx in MS Word, and edit field values.

From the shell, extract all fields with tags:

> python pywordform.py sample_form.docx field1 = “hello, world.” field2 = “hello,” field3 = “value B” field4 = “04-03-2012”

In a python script:

import pywordform fields = pywordform.parse_form(‘sample_form.docx’) print fields

=> this returns a dictionary of field values indexed by tags.

See http://www.decalage.info/python/pywordform See main program at the end of the module, and also docstrings.

LICENSE:

See LICENSE.txt.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywordform-0.02.zip (23.3 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page