skip to navigation
skip to content

Not Logged In

cubicweb-ner 0.1.0

Cube for named entities source and recognition (NER).

Summary
-------
Cube for named entities source and recognition (NER).


This cube provides:

- the notion of NerSource (i.e. Named Entities Source), e.g. dbpedia or dbpedia-en (for
Dbpedia in english).

- the notion of NerEntry, which is a token/word/entry that could be recognized.
Basically it requires a "label" and a "cwuri", but an "unormalize_label"
could be given for quicker match, a "weight" for disambiguation or
a "lang" for sorting. It should be related to a NerSource.

- the notion of NerProcess, which is an entity type that stores the parameters
for a Named Entities Recognition: a "name", an "host" (appid or url of a sparql endpoint),
a request (RQL or SPARQL, with the "token" key for substitution), a type ('rql' or 'sparql')
for now, and a lang (for sorting).


Basically a lexic could be defined (NerSource), that contains entries (NerEntry).
Thus processes (NerProcess) could be defined in other applications to retrieve these entries
in some content.



Installation
------------

Creation of the instance:

* Create an instance using: cubicweb-ctl create ner <name-of-instance>

* Create the instance's database using: cubicweb-ctl db-create <name-of-instance>


Creating entities
-----------------

For creating a NerSource (in a cw shell):

session.create_entity('NerSource', name=<name of="" the="" source="">)

E.g.:

session.create_entity('NerSource', name=u"dbpedia-38-en")


For creating a simple NerEntry (in a cw shell):

session.create_entity('NerEntry', label=  
File Type Py Version Uploaded on Size
cubicweb-ner-0.1.0.tar.gz (md5) Source 2014-03-14 19KB
  • Downloads (All Versions):
  • 1 downloads in the last day
  • 12 downloads in the last week
  • 57 downloads in the last month