You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

What

ImportAmalgamate

Required

no

Description

Amalgamates external historical data

schema location

http://fews.wldelft.nl/schemas/version1.0/importAmalgamate.xsd

Description

Workflows may produce inefficient blobs that only span a few time steps. These blobs will be amalgamated. After the amalgamate is finished, the original import runs with all its imported external data is scheduled for deletion. Large grids, external forecasts and samples are not handled through this module.

Configuration

workflowId

One or more work flow ids that import external historical time series over a short time span (scheduled frequently).

expiry time.

The original (import) runs are deleted after successful amalgamation. To make sure the run is successful amalgamated and not removed by the rolling barrel it is best to configure a large expiry time for the amalgamated workflows in the workflowDescriptors.xml. Configure an explicit expiry time for the time series sets in the amalgamated workflows. The expiry time of the individual time series is preserved while amalgamated.

importRunMinimalAge

Import runs younger than the specified age are skipped. After the amalgamate has run it is no longer possible to create an archive with the exact original data available during the run.

amalgamateOrphans

For systems that did not have an amalgamate running it maybe required to amalgamate the complete database. Of course this should only be done once.
It is possible do this with the amalgamateOrphans=true. As this task may take too long (regular FSS tasks have a time out of 3 hours) this should be handled with care.
In that case it maybe useful to have the task run at an FSS that has no localdatastore, but has direct access to the central database. Only in that particular case the amalgamate runs (hard coded in the software) for only 1 hour. Then then task is aborted and the remaining imports are not handled in that taskrun anymore. This is done on purpose, as it is now possible to schedule this task with an interval of e.g. 3 hours (making it possible to the MC to do all the required cleaning stuff like rollingBarrel and markedRecordManager). In a few hours (to even days or in some rare situations evens weeks) all the taksruns are handled and the complete database has been amalgamated. Then the scheduled task can be stopped.

Example

ModuleDescriptors.xml
<moduleDescriptor id="ImportAmalgamate">
  <className>nl.wldelft.fews.system.plugin.amalgamate.ImportAmalgamate</className>
</moduleDescriptor>
ModuleInstanceDescriptors.xml
<moduleInstanceDescriptor id="Amalgamate">
  <moduleId>ImportAmalgamate</moduleId>
</moduleInstanceDescriptor>
Amalgamate.xml
<?xml version="1.0" encoding="UTF-8"?>
<importAmalgamate xmlns="http://www.wldelft.nl/fews" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.wldelft.nl/fews http://fews.wldelft.nl/schemas/version1.0/importAmalgamate.xsd">
  <workflowId>Import_Data</workflowId>
  <workflowId>Procesoverzicht_Update</workflowId>
  <importRunMinimalAge unit="hour"/>
</importAmalgamate>
  • No labels