Collectors are a powerful feature that we introduced in hale studio 3.1.0 and have since then expanded on. So, for what use cases should you be looking at collectors?
Network, that needs to reference many or all objects of a different type, e.g.
AdministrativeUnits, with their
With a Collector, you can collect values in one place in your transformation project and then use these values in another place in the transformation process. Let’s look at a recent project we’ve worked on to see how they work in practice.
Please note that this article assumes you have working knowledge of hale studio and know the terminology.
To use a collector, there are two to three steps:
We always collect values in the context of another transformation function. As of hale studio 3.2.0, these are the transformation functions that support the definition of collectors:
In all of these functions plus the following ones, you can apply the collected values:
With the upcoming 3.3.0 release, you’ll see more widespread support for the feature. Basically, most existing functions will allow collecting values, and we’ll add more functions to assign them.
This is the use case we are going to work on for this tutorial:
We need to create an INSPIRE Hydrographical Network dataset from UK Meridian 2 data encoded in a specific schema, using a GML 2.1 encoding. Each river segment from the source will be transformed to a
WatercourseLink, and in addition, we’ll create a
Networkobject that references all created
You can take a look at the hale transformation project for this tutorial and download it, including source data, here at haleconnect.com.
River, on the target side, select
WatercourseLink. Click on the double arrow icon and select
Retype. Use the default values for the function.
fidon the source and
idon the target feature type. Click on the double arrow icon and select
Groovy Script. Leave the parameters on the first page as they are and click
Nextto proceed to the actual script editor. After you’ve entered the script, click
Finishto let the transformation execute.
This is the actual script to use:
Assign collected valuesfunction
The easiest way to use values from a collector is to use the
Assign collected values function. Follow these steps to use it:
Networkfeature type. Click on the arrow icon and select
Create. This function will create one or more objects of the target type from thin air. Create exactly one object.
Networkand then on the arrow icon. Choose
Assign collected valuesand click
Next. Enter the name of the collector we’ve defined in the script above (
linkIDs) so that it can be accessed.
Assign collected values function has some special behaviour to automatically identify and create local references. If you inspect the created
network, you’ll see it now has 982 references that all look like this:
hale studio, in principle, automatically determines execution order of all cells. In some cases, this may not have the desired effect, so you need to provide hints to the transformation engine what should happen first, and what should happen last. For collectors, it’s important that the engine first completes collecting values before it tries to apply them in a different place. We do plan to recognize these cases automatically but for now, you’ll have to assign cell execution priorities to make sure everything always works as expected.
To ensure that the described steps are executed in the correct sequence, the execution priority has to be defined accordingly. The second mapping cell (
Network) has therefore to be set to a lower priority than the first mapping cell. This can be done via context menu in hale studio.
With these three steps, you learned how to use the collector feature in hale studio. Let us know what you think of this feature and what we can do to improve its usability!