No project description provided
Project description
PyLaia
PyLaia is a device agnostic, PyTorch based, deep learning toolkit for handwritten document analysis.
It is also a successor to Laia.
Get started by having a look at our Documentation!
Installation
To install PyLaia from PyPi:
pip install pylaia
Please note that the CUDA version of nnutils (nnutils-pytorch-cuda
) is installed by default. If you do not have a GPU, you should install the CPU version (nnutils-pytorch
).
The following Python scripts will be installed in your system:
pylaia-htr-create-model
: Create a VGG-like model with BLSTMs on top for handwriting text recognition. The script has different options to customize the model. The architecture is based on the paper "Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?" (2017) by J. Puigcerver.pylaia-htr-train-ctc
: Train a model using the CTC algorithm and a set of text-line images and their transcripts.pylaia-htr-decode-ctc
: Decode text line images using a trained model and the CTC algorithm. It can also output the char/word segmentation boundaries of the symbols recognized.pylaia-htr-netout
: Dump the output of the model for a set of text-line images in order to decode using an external language model.
Contributing
If you want to contribute new feature or found a text that is incorrectly segmented using pySBD, then please head to CONTRIBUTING.md to know more and follow these steps.
- Fork it ( https://gitlab.teklia.com/atr/pylaia/-/forks/new )
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create a new Merge Request ( https://gitlab.teklia.com/atr/pylaia/-/merge_requests/new )
Code of conduct
We are committed to providing a friendly, safe and welcoming environment for all. Please read and respect the PyLaia Code of Conduct.
Acknowledgments
Work in this toolkit was financially supported by the Pattern Recognition and Human Language Technology (PRHLT) Research Center.
BibTeX
@misc{puigcerver2018pylaia,
author = {Joan Puigcerver and Carlos Mocholí},
title = {PyLaia},
year = {2018},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/jpuigcerver/PyLaia}},
commit = {commit SHA}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.