skip to navigation
skip to content

Not Logged In

html2text 2014.4.5

Turn HTML into equivalent Markdown-structured text.

Latest Version: 2015.6.21

# [html2text](

[![Build Status](](

html2text is a Python script that converts a page of HTML into clean, easy-to-read plain ASCII text. Better yet, that ASCII also happens to be valid Markdown (a text-to-HTML format).

Usage: [(filename|url) [encoding]]

<kbd>--version</kbd> show program’s version number and exit
<kbd>-h, --help</kbd> show this help message and exit
<kbd>--ignore-links</kbd> don’t include any formatting for links
 don’t include any formatting for images
<kbd>-g, --google-doc</kbd>
 convert an html-exported Google Document
<kbd>-d, --dash-unordered-list</kbd>
 use a dash rather than a star for unordered list items
<kbd>-b <var>BODY_WIDTH</var>, --body-width=<var>BODY_WIDTH</var></kbd>
 number of characters per output line, 0 for no wrap
<kbd>-i <var>LIST_INDENT</var>, --google-list-indent=<var>LIST_INDENT</var></kbd>
 number of pixels Google indents nested lists
<kbd>-s, --hide-strikethrough</kbd>
 hide strike-through text. only relevent when -g is specified as well

Or you can use it from within Python:

import html2text print html2text.html2text(“<p>Hello, world.</p>”)

Or with some configuration options:

import html2text h = html2text.HTML2Text() h.ignore_links = True print h.handle(“<p>Hello, <a href=’’>world</a>!”)

_Originally written by Aaron Swartz. This code is distributed under the GPLv3._

## How to install

html2text is available on pypi

` $ pip install html2text `

## How to do a release

  1. Update the version in
  2. Update the version in
  3. Run python sdist upload

## How to run unit tests

python test/ -v
File Type Py Version Uploaded on Size
html2text-2014.4.5.tar.gz (md5) Source 2014-04-05 24KB
  • Downloads (All Versions):
  • 904 downloads in the last day
  • 10917 downloads in the last week
  • 34362 downloads in the last month