pdfminer 20090330
Newer version available (20191125)
Released:
PDF parser and analyzer written entirely in Python.
Navigation
Unverified details
These details have not been verified by PyPIProject links
Meta
- License: MIT License (MIT/X)
- Author: Yusuke Shinyama
- Tags pdf, html, text, extraction, conversion, data mining
Classifiers
- Development Status
- Environment
- Intended Audience
- License
- Natural Language
- Topic
Project description
PDFMiner is a suite of programs that aims to help extracting or analyzing text data from PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document. It can be also used as a basis for a full-fledged PDF interpreter.
Project details
Unverified details
These details have not been verified by PyPIProject links
Meta
- License: MIT License (MIT/X)
- Author: Yusuke Shinyama
- Tags pdf, html, text, extraction, conversion, data mining
Classifiers
- Development Status
- Environment
- Intended Audience
- License
- Natural Language
- Topic