Accessing and processing data from the DFG-funded SPP Computational Literary Studies
Project description
This repository contains pyhton code for working with the source data (SPP-CLS_AnnotationTables_Data)
Installation
Setup an virtual environment, if necessary:
python3 -m venv env
source env/bin/activate
Install dependencies:
pip install -r requirements.txt
python -m spacy download de_core_news_lg
cls.py
tokenise.py
TODO: fix character offset to be byte instead
check.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sppcls-0.0.1.tar.gz
(2.9 kB
view hashes)
Built Distribution
sppcls-0.0.1-py3-none-any.whl
(3.9 kB
view hashes)