Skip to main content

A library for mapping CIDOC-CRM classes to Python objects

Project description

[![Build Status](https://travis-ci.org/gri-is/crom.svg?branch=master)](https://travis-ci.org/gri-is/crom) [![Coverage Status](https://coveralls.io/repos/github/gri-is/crom/badge.svg?branch=master)](https://coveralls.io/github/gri-is/crom?branch=master)

# Cromulent

A Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects/properties, thereby making the CRM “CRoMulent”, a Simpsons neologism for “acceptable” or “fine”.

## Status: Alpha

The core vocabulary loading functionality is reasonably stable. The vocabulary section is expanding as we find new, useful terms to include and will likely change to instead be loaded separately from configurations.

The code is actively being developed and compability breaking changes are thus to be expected as we use it in various projects across The J Paul Getty Trust, and beyond.

## How to Use It

### Basic Usage

Import the classes from the model module. As the classes are dynamically generated, they’re not in the code but will be there once the build_classes function has been called.

`python from cromulent.model import factory, Person p = Person("Mother") p2 = Person("Son") p3 = Person("Daughter") p.parent_of = p2 p.parent_of = p3 print factory.toString(p, compact=False) `

### Vocabulary

`python from cromulent.model import factory from cromulent.vocab import Height h = Height() h.value = 6 print factory.toString(h, compact=False) `

### Tricks and Gotchas

  • Assigning to the same property repeatedly does NOT overwrite the value, instead it appends. To overwrite a value, instead set it to a false value first.

### Factory settings

There are several settings for how the module works, which are managed by a factory object in model.

  • base_url The base url on to which to append any slug given when an object is created

  • base_dir The base directory into which to write files, via factory.toFile()

  • default_lang The code for the default language to use on text values

  • context_uri The URI to use for @context in the JSON-LD serialization

  • debug_level Settings for debugging errors and warnings, defaults to “warn”

  • log_stream An object implementing the stream API to write log messages to, defaults to sys.stderr

  • materialize_inverses Should the inverse relationships be set automatically, defaults to False

  • filename_extension The extension to use on files written via toFile(), defaults to “.json”

  • full_names Should the serialization use the full CRM names for classes and properties instead of the more readable ones defined in the mapping, defaults to False

  • validate_properties Should the model be validated at run time when setting properties, defaults to True (this allows you to save processing time once you’re certain your code does the right thing)

  • prefixes A dictionary of prefix to URI for URIs to compress down to prefix:slug format

  • prefixes_rev The reverse of the prefixes dictionary

  • context_json The parsed JSON object of the context from which the prefixes are derived

Note that factories are NOT thread safe during serialization. A property on the factory is used to maintain which objects have been serialized already, to avoid infinite recursion in a cyclic graph. Create a new factory object per thread if necessary.

## How it Works

At import time, the library parses the vocabulary data file (data/crm_vocab.tsv) and creates Python classes in the module’s global scope from each of the defined RDF classes. The names of the classes are intended to be easy to use and remember, not necessarily identical to the CRM ontology’s names. It also records the properties that can be used with that class, and at run time checks whether the property is defined and that the value fits the defined range.

## Hacking

You can change the mapping by tweaking utils/vocab_reader.py and rerunning it to build a new TSV input file. See also the experimental code for loading completely different ontologies.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cromulent-0.4.2.tar.gz (64.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page