Skip to main content

Archetypes storage for storing fields raw values on the file system.

Project description

By Ingeniweb.



1 About FileSystemStorage

FileSystemStorage (FSS) is an Archetypes storage for storing fields raw values on the file system. This storage is used to avoid unnecessary growth of the ZODB’s FileStorage (Data.fs) when using a lot of large files.

Please note that FSS is a Plone component for content types developers. Do not expect anything more in your Plone site if you don’t use Plone components that may use FSS, such as AttachmentField.

3 Requirements

4 Upgrading

4.1 From FSS 2.5.x and older

The way we configure the storage and backup paths as well as the storage strategies HAVE CHANGED IN DEPTH. There’s no automatic migration from this version. Please create a configuration file that clones your old configlet preferences as indicated in FSS main configuration.

4.2 From iw.fss 2.7.1

From this version and later version, iw.fss registers upgrade steps in GenericSetup. When upgrading to a newer version of iw.fss, just have a look at the “Upgrade” tab of the portal_setup tool in your Plone site.

Warning, due to the removal of the portal_fss tool upgrading from 2.7.1 does not retain the RDF options (activated and specific script). If you used RDF with 2.7.1, you need to redo the same RDF setting in the FileSystemStorage config panel.

5 Migrating between strategies

Use the bin/strategymigrator.py shell utility that ships with FSS. Get more info on this utility using the –help option.

Note that only flat -> directory and directory -> flat are available today. Sponsorship is welcome - as indicated in Support and feedback - if you need other strategy migrations.

6 Installation

6.1 With buildout

This example speaks of itself:

[buildout]
...
eggs =
  ...
  iw.fss
  ...

[instance]
...
zcml =
  ...
  iw.fss
  iw.fss-meta
  ...

You may also use the iw.recipe.fss recipe egg to configure fully an instance.

6.2 From SVN reposo

  • Inflate iw.fss package into your zope instance lib/python directory. I’m sure it’s already done.

  • Add $INSTANCE_HOME/etc/package-includes/iw.fss-meta.zcml file with this line:

    <include package="iw.fss" file="meta.zcml"/>
  • Add $INSTANCE_HOME/etc/package-includes/iw.fss-configure.zcml file with this line:

    <include package="iw.fss" />

6.3 Configuring your Plone sites

  • Read carefully Storage strategies and FSS main configuration below and configure your instance in plone-filesystemstorage.conf accordingly. You may use the iw.recipe.fss recipe in your buildout.cfg to do this.

  • Start your Zope instance

  • In ZMI, go into your plone site and install the product using portal_quickinstaller

  • Optionally, tweak some - self explained - preferences in the FileSystemStorage configuration panel of your Plone site.

7 Storage strategies

A storage strategy defines how field values will be stored in your files system.

7.1 Strategies overview

FSS comes with 4 storage strategies. Each strategy requires two directories:

  • The storage directory stores the values of relevant fields in files according to the selected strategy.

  • The backup directory that keeps the values of the fields of deleted contents for “Undo” purpose.

  • Only one storage strategy can be selected on a given Plone site, but different Plone site in the same Zope instance may have distinct storage strategies.

  • You need the best performances: choose the Flat storage strategy or Directory storage strategy.

  • You need a storage that looks like your Plone site (in example to publish the storage directory in a read only samba or NFS share): choose the Site storage strategy 1 or the Site storage strategy 2.

  • If you care about CMFEdition support (means that one of the content types using FSS is versioned with CMFEditions), you should not use the Site storage strategy 1 or the Site storage strategy 2.

See FSS main configuration to configure the best suited strategy on your Zope instance and Plone sites.

7.2 Flat storage strategy

All field values are stored in a flat structure. This strategy is the default one.

  • Filename of these stored values: <uid of content>_<field name>

  • Filename of rdf file: <uid of content>_<field name>.rdf

  • Filename of backup values: <uid of content>_<field name>.bak

Rdf files are not backed up. They are automatically generated.

Example of storage:

fssDirectory
|
|- f42ad00adb7d4580f929d734bf1ed3ab_image
|
|- f42ad00adb7d4580f929d734bf1ed3ab_image.rdf
|
|- f42ad00adb7d4580f929d734bf1ed3ab_file
|
|- f42ad00adb7d4580f929d734bf1ed3ab_file.rdf

fssBackupDirectory
|
|- 9efeb7638fb35f5f9b9955da3e2dbfec_file.bak

7.3 Directory storage strategy

All field values are stored in a directory structure. Sub directories are defined on two level. First level of directory uses the 2 characters of content uid. Second level of directory uses the 4 characters of content uid. Backup files are stored in a flat structure.

  • Filename of these stored values: <uid of content>_<field name>

  • Filename of rdf file: <uid of content>_<field name>.rdf

  • Filename of backup values: <uid of content>_<field name>.bak

RDF files are not backed up. They are automatically generated

Example of storage:

fssDirectory
|
|- f42
   |
   |- f42ad
      |
      |- f42ad00adb7d4580f929d734bf1ed3ab_image
      |
      |- f42ad00adb7d4580f929d734bf1ed3ab_image.rdf
      |
      |- f42ad00adb7d4580f929d734bf1ed3ab_file
      |
      |- f42ad00adb7d4580f929d734bf1ed3ab_file.rdf

fssBackupDirectory
|
|- 9efeb7638fb35f5f9b9955da3e2dbfec_file.bak

Depending in one hand on your file system performances facing a huge amount of files in the same directory, and in the other hand on the number of contents relying on FSS, you might choose this strategy or the Flat storage strategy.

7.4 Site storage strategy 1

All field values are stored in a directory structure mirroring structure of PloneSite. Backup files are stored in a flat structure.

  • Filename of these stored values: Filename of field value or field name if not defined

  • Filename of rdf file: <field name>.rdf

  • Filename of backup values: <uid of content>_<field name>.bak

Rdf files are not backed up. They are automatically generated

Example of storage:

fssDirectory
|
|- members
   |
   |- john
      |
      |- dummy-document
         |
         |- image
         |  |
         |  |- moutain.jpg
         |  |
         |  |- image.rdf
         |
         |- file
            |
            |- diary.odt
            |
            |- file.rdf

fssBackupDirectory
|
|- 9efeb7638fb35f5f9b9955da3e2dbfec_file.bak

7.5 Site storage strategy 2

All field values are stored in a directory structure mirroring structure of PloneSite. Backup files are stored in a flat structure.

  • Filename of these stored values: Filename of field value or field name if not defined

  • Filename of rdf file: <field filename>.rdf

  • Filename of backup values: <uid of content>_<field name>.bak

Rdf files are not backed up. They are automatically generated

Example of storage:

fssDirectory
|
|- members
   |
   |- john
      |
      |- dummy-document
         |
         |- fss.cfg
         |
         |- moutain.jpg
         |
         |- mountain.jpg.rdf
         |
         |- diary.odt
         |
         |- diary.odt.rdf

fssBackupDirectory
|
|- 9efeb7638fb35f5f9b9955da3e2dbfec_file.bak

8 FSS main configuration

FSS is mainly configured the ZConfig way. At startup, the configuration file will be searched (in that order) in:

  • $INSTANCE_HOME/etc/plone-filesystemstorage.conf

  • /path/to/iw/fss/etc/plone-filesystemstorage.conf

  • /path/to/iw/fss/etc/plone-filesystemstorage.conf.in

A sample working configuration in provided in this last file. It assumes you have $INSTANCE_HOME/var/fss_storage and $INSTANCE_HOME/var/fss_backup directories, both being read/write enabled to the user that runs the Zope process, unless Zope will raise an error at startup.

All configuration doc you need is in plone-filesystemstorage.conf.in comments.

Inconsistent configuration features raise explicit error messages at Zope startup.

Note that we didn’t include this in zope.conf in order to keep Zope 2.7 and Zope 2.8 compatibility.

9 Configuration panel

9.1 FSS and RDF

As mentioned above in Storage strategies, FSS can optionally store RDF files in the back-end storage. These RDF files conform the DCMES-XML standard on XML expression of DublinCore elements.

If you select the Flat storage strategy, the RDF files may be used to build a files tree as close as possible of the Plone tree structure from the storage back-end, with the build_fs_tree.py utility:

$ cd /path/to/iw/fss/bin
$ python build_fs_tree.py --help

9.2 FSS maintenance

The “Maintenance” tab of FSS configuration panel shows some statistics about the amount of files managed by FSS.

In addition 2 additional buttons are provided:

  • Update FSS files : Cleans up the back-end storage directories. Files not referenced from a content object are removed.

  • Update RDF files : Builds all RDF files, in the event you chose to select the generetion of RDF files after having inserted relevant contents.

10 Zope backups

Of course, you need to synch your Data.fs backups with the storage paths backups, unless you may restore corrupted/incomplete sites.

11 Caveats and pittfalls

Note that whatever’s the selected strategies, DO NEVER CHANGE ANY FILE IN THE STORAGE PATHS UNLESS YOU KNOW WHAT YOU DO, otherwise you’ll loose your contents, your job, your money, your friends, your wife/husband and children.

DO NEVER CHANGE THE STRATEGY FOR A PLONE SITE ONCE THIS SITE HAS FIELDS STORED IN ITS FSS STORAGE PATH unless you ran the appropriate strategy migrator - see Migrating between strategies otherwise you’re loose your contents, your job, …

Thought it is possible to share the same storage path and backup path within various Plone sites, since you don’t mind of your data in development sites, you should really configure FSS such each Plone site has its own private storage path and backup path. In cas you ignore this warning, DO NEVER CLICK ANY BUTTON IN THE MAINTENANCE TAB OF THE FSS CONFIGURATION PANEL otherwise you’ll loose your contents, your job, …

Zexp exports don’t embed the associated FSS storage. So don’t move your Plone site within Zope instances using zexp exports unless you move the storage and backup directories along with the zexp file.

After changing your configuration (see FSS main configuration), always restart your Zope instance in foreground since configuration errors are not reported when Zope is started as daemon or Windows service.

12 Developer’s corner

12.1 Using FSS in your content types

We assume that creating AT based content types is a familiar activity.

When creating the schema of a content type, you usually don’t mention the storage of your fields since AT provides a default ZODB storage. You can choose another storage, either another one that ships in the Archetypes bundle, or a third party storage like FSS.

Here is a small example that doesn’t need much comments:

# Usual Zope/CMF/Plone/Archetypes imports
...
from iw.fss.FileSystemStorage import FileSystemStorage
...
my_schema = Schema((
    FileField('file',
              ...
              storage=FileSystemStorage(),
              widget=FileWidget(...)
              ),
    ...
    ))
 ...

You may have a look at examples/FSSItem.py to see a demo content type using FSS. If you want to play with this content type:

$ $INSTANCE_HOME/bin/zopectl stop
$ export FSS_INSTALL_EXAMPLE_TYPES=1
$ $INSTANCE_HOME/bin/zopectl start

12.2 Monkey patching a third party content type

12.2.1 The Python way

This works with any Zope version listed in the Requirements. If you run a Zope 2.9 or later instance, you should prefer The ZCML way.

This simple example shows how to plug the ATFile standard type from any custom product. Let’s say we’re in the __init__.py of your custom product:

...
from iw.fss import zcml as fss_utils
from Products.ATContentTypes import atct
...
fss_utils.patchATType(atct.ATFile, ['file'])
...

12.2.2 The ZCML way

This works only with Zope 2.9 and up, and is the recommanded way to wire your AT based content types with FSS since this way hides the API details that may be deprecated in the future.

You just need to add the “fss” namespace support and some directives in your “configure.zcml” like this to have the same effect as in the example of The Python way above:

<configure
  ...
  xmlns:zcml="http://namespaces.zope.org/zcml"
  xmlns:fss="http://namespaces.ingeniweb.com/filesystemstorage"
  ...>
  ...
  <fss:typeWithFSS
    zcml:condition="installed iw.fss"
    class="Products.ATContentTypes.atct.ATFile"
    fields="file" />
  ...
</configure>

The (required) attributes of this directive are:

  • class: the dotted name of the AT based content type class.

  • fields: one or more (space separated) field names to wire with FSS.

12.3 Storing Plone standard content types with FSS

Include this ZCML element either in your instance package-includes directory or in any personal configure.zcml in your instance packages:

<include package="iw.fss" file="atct.zcml" />

This ZCML file wires FSS with the relevant content types that ship with Plone bundle:

  • File

  • Image

  • News Item

12.4 Customize RDF info set

You can add data to the RDF info set via a FSS dedicated - and optional - hook.

Open the FSS configuration panel and give the name of your personal RDF data script, let’s call it fss_custom_rdf for this example. This script will add or change the RDF default data set.

12.4.1 RDF script interface

The script may be in any Plone layer like custom or any other in the skins path.

Such script is expected to have 4 parameters:

  • name: the name of the field

  • instance: the content object being processed

  • properties: FSS field properties mapping, depending on the type of field associated to FSS storage.

  • default_rdf: a mapping structure of the RDF data, including the namespaces, and the elements.

Such script is expected to return an updated default_rdf mapping structure.

Please read the source of classes FSSInfo, FSSFileInfo and FSSImageInfo from module FileSystemStorage.py, RDFWriter from module rdf.py for detailed information on formats of expected data.

12.4.2 Simple example

We need to add the date of RDF file generation in an element like this:

<fss:rdf_date>2007-05-13 13:21:00</fss:rdf_date>

Here’s the script:

## Script (Python) "fss_custom_rdf"
##bind container=container
##bind context=context
##bind namespace=
##bind script=script
##bind subpath=traverse_subpath
##parameters=name, instance, properties, default_rdf
##title=
##
"""We add the date of RDF file generation"""

rdf_args = default_rdf.copy()

# Add the date of RDF generation
new_props = []
new_props.extend(rdf_args['field_props'])
new_props.append({'id': 'fss:rdf_date', 'value': DateTime().ISO()})
rdf_args['field_props'] = new_props

return rdf_args

12.4.3 More complex example

In this more complex (and stupid - don’t do this ;) example, we remove the elements from fss namespace and simulate the standard behaviour for other elements (dc and rdf elements).

Here’s the code:

## Script (Python) "fss_custom_rdf"
##bind container=container
##bind context=context
##bind namespace=
##bind script=script
##bind subpath=traverse_subpath
##parameters=name, instance, properties, default_rdf
##title=
##
from Products.CMFCore.utils import getToolByName

rdf_args = {}

# Set RDF charset
ptool = getToolByName(instance, 'portal_properties')
rdf_args['charset'] = ptool.site_properties.default_charset

# Set RDF namespaces
rdf_args['namespaces'] = (
    {'id': 'xmlns:rdf', 'value': 'http://www.w3.org/1999/02/22-rdf-syntax-ns#'},
    {'id': 'xmlns:dc', 'value': 'http://purl.org/dc/elements/1.1/'},
    {'id': 'xmlns:fss', 'value': 'http://namespace.ingeniweb.com/fss'},)

# Set field url
utool = getToolByName(instance, 'portal_url')
portal_path = utool.getPortalObject().getPhysicalPath()
portal_path_len = len(portal_path)
rel_path = '/'.join(instance.getPhysicalPath()[portal_path_len:])
fss_path = '%s/%s' % (rel_path, name)
rdf_args['field_url'] = fss_path

# Set RDF properties
props = (
   {'id': 'dc:title', 'value': instance.title_or_id()},
   {'id': 'dc:description', 'value': instance.Description()},
   {'id': 'dc:language', 'value': instance.Language()},
   {'id': 'dc:creator', 'value': instance.Creator()},
   {'id': 'dc:date', 'value': instance.modified()},
   {'id': 'dc:format', 'value': properties.get('mimetype', 'text/plain')},
   )
rdf_args['field_props'] = props

# Returns the RDF structure
return rdf_args

12.4.4 Recommanded practices on custom RDF

Do not change the existing elements under fss namespace unless the bin/build_fs_tree.py utility won’t work. In other terms, do not use the script of the More complex example.

Adding or changing elements to the dc and rdf namespaces may screw up external utilities based on canonical DCMES-XML standard.

Consider adding your own namespace for your custom extra elements.

13 Testing

Please read ./tests/README.txt.

14 Other documentation

See ./doc/* and ./locales/README.txt

15 Downloads

You may find newer stable versions of FSS and pointers to related informations (tracker, doc, …) from http://pypi.python.org/pypi/iw.fss

16 Subversion repository

Stay in tune with the freshest (maybe unstable) versions or participate to the FileSystemStorage evolutions:

https://svn.plone.org/svn/collective/iw.fss

17 Support and feedback

Please read all the documentation that comes with this product before asking for support, unless you might get a RTFM reply ;)

Localisation issues - other than french - should be reported to the relevant translators (see Credits below).

Report bugs using the tracker (the Tracker link from http://plone.org/iw.fss). Please provide in your bug report:

  • Your configuration (Operating system+Zope+Plone+Products/versions).

  • The storage strategy in use.

  • The full traceback if available.

  • One or more scenario that triggers the bug.

Note that we do not support bug reports on Subversion trunk or branches checkouts.

Mail to Ingeniweb support in English or French to ask for specific support.

Donations are welcome for new features requests

18 Credits

See locales/README.txt about localization credits.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iw.fss-2.7.6.tar.gz (125.2 kB view hashes)

Uploaded Source

Built Distribution

iw.fss-2.7.6-py2.4.egg (236.4 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page