grako

Grako takes a grammar in a variation of EBNF as input, and outputs a memoizing PEG/Packrat parser in Python.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

At least for the people who send me mail about a new language that they’re designing, the general advice is: do it to learn about how to write a compiler. Don’t have any expectations that anyone will use it, unless you hook up with some sort of organization in a position to push it hard. It’s a lottery, and some can buy a lot of the tickets. There are plenty of beautiful languages (more beautiful than C) that didn’t catch on. But someone does win the lottery, and doing a language at least teaches you something.

Dennis Ritchie (1941-2011): Creator of the C programming language and of Unix

Grako

NOTE: DEVELOPMENT OF GRAKO HAS STOPPED

If you’re interested in the technology, or in future developments such as proper support for left recursion, then please follow the Tatsu project.

Grako (for grammar compiler) is a tool that takes grammars in a variation of EBNF as input, and outputs memoizing (Packrat) PEG parsers in Python.

Grako can also compile a grammar stored in a string into a Grammar object that can be used to parse any given input, much like the re module does with regular expressions.

Grako is different from other PEG parser generators:

Generated parsers use Python’s very efficient exception-handling system to backtrack. Grako generated parsers simply assert what must be parsed. There are no complicated if-then-else sequences for decision making or backtracking. Memoization allows going over the same input sequence several times in linear time.
Positive and negative lookaheads, and the cut element (with its cleaning of the memoization cache) allow for additional, hand-crafted optimizations at the grammar level.
Delegation to Python’s re module for lexemes allows for (Perl-like) powerful and efficient lexical analysis.
The use of Python’s context managers considerably reduces the size of the generated parsers for code clarity, and enhanced CPU-cache hits.
Include files, rule inheritance, and rule inclusion give Grako grammars considerable expressive power.
Automatic generation of Abstract Syntax Trees and Object Models, along with Model Walkers and Code Generators make analysis and translation approachable

The parser generator, the run-time support, and the generated parsers have measurably low Cyclomatic complexity. At around 5 KLOC of Python, it is possible to study all its source code in a single session.

The only dependencies are on the Python standard library, yet the regex library will be used if installed, and colorama will be used on trace output if available. pygraphviz is required for generating diagrams.

Grako is feature-complete and currently being used with complex grammars to parse, analyze, and translate hundreds of thousands of lines of input text, including source code in several programming languages.

Rationale

Grako was created to address some recurring problems encountered over decades of working with parser generation tools:

Some programming languages allow the use of keywords as identifiers, or have different meanings for symbols depending on context (Ruby). A parser needs control of lexical analysis to be able to handle those languages.
LL and LR grammars become contaminated with myriads of lookahead statements to deal with ambiguous constructs in the source language. PEG parsers address ambiguity from the onset.
Separating the grammar from the code that implements the semantics, and using a variation of a well-known grammar syntax (EBNF) allows for full declarative power in language descriptions. General-purpose programming languages are not up to the task.
Semantic actions do not belong in a grammar. They create yet another programming language to deal with when doing parsing and translation: the source language, the grammar language, the semantics language, the generated parser’s language, and the translation’s target language. Most grammar parsers do not check the syntax of embedded semantic actions, so errors get reported at awkward moments, and against the generated code, not against the grammar.
Preprocessing (like dealing with includes, fixed column formats, or structure-through-indentation) belongs in well-designed program code; not in the grammar.
It is easy to recruit help with knowledge about a mainstream programming language like Python, but help is hard to find for working with complex grammar-description languages. Grako grammars are in the spirit of a Translators and Interpreters 101 course (if something is hard to explain to a college student, it’s probably too complicated, or not well understood).
Generated parsers should be easy to read and debug by humans. Looking at the generated source code is sometimes the only way to find problems in a grammar, the semantic actions, or in the parser generator itself. It’s inconvenient to trust generated code that one cannot understand.
Python is a great language for working with language parsing and translation.

Documentation

The complete documentation is available at Grako’s home page.

License

Copyright (C) 2017      by Juancarlo Añez
Copyright (C) 2012-2016 by Juancarlo Añez and Thomas Bragg

You may use Grako under the terms of the BSD-style license described in the enclosed LICENSE.txt file. If your project requires different licensing please email.

Changes

See the CHANGELOG for details.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

3.99.9

May 6, 2017

3.22.0

Apr 19, 2017

3.21.1

Apr 16, 2017

3.21.0 yanked

Apr 14, 2017

Reason this release was yanked:

deprecated

3.20.1

Apr 13, 2017

3.19.4

Apr 2, 2017

3.18.2

Feb 4, 2017

3.18.1 yanked

Dec 13, 2016

Reason this release was yanked:

deprecated

3.18.0 yanked

Dec 6, 2016

Reason this release was yanked:

deprecated

3.17.0

Dec 1, 2016

3.16.5

Nov 4, 2016

3.16.4 yanked

Nov 2, 2016

Reason this release was yanked:

deprecated

3.16.3 yanked

Oct 28, 2016

Reason this release was yanked:

deprecated

3.16.2 yanked

Oct 24, 2016

Reason this release was yanked:

deprecated

3.16.1 yanked

Oct 16, 2016

Reason this release was yanked:

deprecated

3.16.0 yanked

Oct 2, 2016

Reason this release was yanked:

deprecated

3.15.1

Sep 28, 2016

3.15.0 yanked

Sep 23, 2016

Reason this release was yanked:

deprecated

3.14.0

Aug 29, 2016

3.13.0

Aug 18, 2016

3.12.1

Aug 6, 2016

3.11.0

Aug 6, 2016

3.10.1

Jul 18, 2016

3.10.0 yanked

Jul 15, 2016

Reason this release was yanked:

deprecated

3.9.3

Jun 29, 2016

3.9.2 yanked

Jun 23, 2016

Reason this release was yanked:

deprecated

3.9.1 yanked

May 26, 2016

Reason this release was yanked:

deprecated

3.9.0 yanked

Apr 25, 2016

Reason this release was yanked:

deprecated

3.8.2

Apr 23, 2016

3.8.1 yanked

Mar 18, 2016

Reason this release was yanked:

deprecated

3.8.0 yanked

Mar 15, 2016

Reason this release was yanked:

deprecated

3.7.0

Mar 6, 2016

3.6.8

Feb 18, 2016

3.6.7 yanked

Jan 27, 2016

Reason this release was yanked:

deprecated

3.6.6 yanked

Dec 2, 2015

Reason this release was yanked:

deprecated

3.6.5 yanked

Sep 13, 2015

Reason this release was yanked:

deprecated

3.6.4 yanked

Sep 13, 2015

Reason this release was yanked:

deprecated

3.6.3 yanked

Aug 1, 2015

Reason this release was yanked:

deprecated

3.6.2 yanked

Jul 22, 2015

Reason this release was yanked:

deprecated

3.6.1 yanked

Jun 7, 2015

Reason this release was yanked:

deprecated

3.6.0 yanked

May 12, 2015

Reason this release was yanked:

deprecated

3.5.1

Mar 12, 2015

3.4.3

Nov 22, 2014

3.4.2 yanked

Nov 4, 2014

Reason this release was yanked:

deprecated

3.4.1 yanked

Aug 14, 2014

Reason this release was yanked:

deprecated

3.4.0 yanked

Aug 12, 2014

Reason this release was yanked:

deprecated

3.3.0

Jul 22, 2014

3.2.1

Jul 21, 2014

3.2.0 yanked

Jul 17, 2014

Reason this release was yanked:

deprecated

3.1.2

Jul 14, 2014

3.1.1 yanked

Jul 13, 2014

Reason this release was yanked:

deprecated

3.1.0 yanked

Jul 5, 2014

Reason this release was yanked:

deprecated

3.0.4

Jul 1, 2014

3.0.3 yanked

Jun 27, 2014

Reason this release was yanked:

deprecated

3.0.1 yanked

Jun 22, 2014

Reason this release was yanked:

deprecated

3.0.0 yanked

Jun 16, 2014

Reason this release was yanked:

deprecated

2.4.3

Jun 8, 2014

2.4.2 yanked

May 26, 2014

Reason this release was yanked:

deprecated

2.4.1 yanked

Apr 19, 2014

Reason this release was yanked:

deprecated

2.4.0

Mar 8, 2014

2.3.0

Nov 27, 2013

2.2.2

Oct 31, 2013

2.2.1 yanked

Oct 22, 2013

Reason this release was yanked:

deprecated

2.2.0 yanked

Oct 8, 2013

Reason this release was yanked:

deprecated

2.1.0

Sep 6, 2013

2.0.4

Aug 15, 2013

2.0.3 yanked

Jul 5, 2013

Reason this release was yanked:

deprecated

2.0.2 yanked

Jun 26, 2013

Reason this release was yanked:

deprecated

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grako-3.99.9.zip (159.3 kB view hashes)

Uploaded May 6, 2017 Source

Built Distribution

grako-3.99.9-py2.py3-none-any.whl (82.1 kB view hashes)

Uploaded May 6, 2017 Python 2 Python 3

Hashes for grako-3.99.9.zip

Hashes for grako-3.99.9.zip
Algorithm	Hash digest
SHA256	`fcc37309eab7cd0cbbb26cfd6a54303fbb80a00a58ab295d1e665bc69189c364`
MD5	`2e94817e48be1cd2a7603ce92cc22b8e`
BLAKE2b-256	`330d6db911c7f6458974745c91c1e71841e347364798a5cc01e8149e84352c77`

Hashes for grako-3.99.9-py2.py3-none-any.whl

Hashes for grako-3.99.9-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`28813fd09e31edf26761e2f814ef65adc073d432f0f1e4311ae5e5b1519978c2`
MD5	`8bedeec19e0c76dc8326d3ddbc541b62`
BLAKE2b-256	`c91dc10ddc7ccf060695d1cefda9726a2923d45e7a4d39ac08ede3b014e9b33f`