A library for Natural Language Processing style preprocessing for source code in different languages
Project description
sourCy
A library for NLP like preprocessing for source code in different programming languages. The architecture follows the one of spaCy.
Free software: GNU General Public License v3
Documentation: https://sourcy.readthedocs.io.
Features
Grammar based parsing
Multilanguage Support
Usage
To use sourCy in a project:
import sourcy
code = """
# This function computes the factor of the argument passed
def print_factors(x):
print("The factors of",x,"are:")
for i in range(1, x + 1):
if x % i == 0:
print(i)
num = 320
print_factors(num)
"""
# Creates a pipeline to process source code
scp = sourcy.load("python")
# Process the code and create a document with the tokens and the annotation
doc = scp(code)
for token in doc:
print(token.token, token.annotation, token.block)
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
History
0.1.0 (2020-08-11)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sourcy-0.2.2.tar.gz
(91.3 kB
view hashes)