Workflow Management with OPALS

This tutorial introduces the concept of creating arbitrary workflows using either shell scripts, python scripts or C++ programs.

Introduction

Due to the high amount of data provided by modern sensors, ALS data processing requires a high degree of automation. In general, similar calculation tasks are carried out for multiple processing units. Typically, a processing unit is either a single ALS strip (strip DSMs, relative strip differences ...) or a window (tile) of limited extension containing the combined ALS data of all ALS strips within the window (ALS filtering, DTM calculation ...).

Therefore, this tutorial focuses on the automation of data processing using scripts. A general understanding of how to use OPALS is a precondition. If necessary, please refer to Getting Started for basic information.

First examples

In the following subsections example scripts aiming at the derivation of relative strip differences are provided for each OPALS module implementation (command line executable, Python module, C++ class). For the sake of clarity, the scripts are kept very simple. Only a minimal set of input parameters is used and error handling is limited to a rudimentary level. So, please don't take them as examples for good programming style. Instead, real-world scripts should rather be more flexible concerning the parametrization and should provide more thorough parameter checks. The sample data used in the scripts can be found in directory $OPALS_ROOT/demo/.

A first shell script

For running the following script, it is necessary to generate a text file containing all strip filenames. E.g. for the demo data:

strip11.laz
strip21.laz
strip31.laz
@echo off
rem == Shell script for the derivation of z-colored strip differences ===========
IF "%1"=="" goto usage
IF "%2"=="" goto usage
SET StripFiles=%1
SET GridWidth=%2
SET OdmFiles=
SET BoundFiles=
SET DiffFiles=
SET StripPairs=strippairs.txt
rem == write screen log level to global cfg file =============================
echo Starting shell script strip diff example
rem == Import and DSM computation =============================================
setlocal enabledelayedexpansion
FOR /F "tokens=1" %%I IN (%StripFiles%) DO (
echo Importing strip %%I
opalsImport -inFile %%I
if !ERRORLEVEL! NEQ 0 goto error_occured
call :change_ext %%I odm
set OdmFiles=!OdmFiles! !result!
set ODM=!result!
call :change_ext %%I tif
set DSM=!result!
echo Computing boundary of strip %%I
opalsBounds -inFile !ODM! -outFile !ODM!.wnp -boundsType convexHull
if !ERRORLEVEL! NEQ 0 goto error_occured
echo Computing DSM
opalsGrid -inFile !ODM! -outFile !DSM! -gridSize %GridWidth% -interp movingPlanes
if !ERRORLEVEL! NEQ 0 goto error_occured
call :change_ext %%I odm.wnp
SET BoundFiles=!BoundFiles! !result!
)
echo Determining overlapping strip pairs
echo %BoundFiles%
opalsOverlap -inFile !BoundFiles! -outFile %StripPairs%
if !ERRORLEVEL! NEQ 0 goto error_occured
rem == Computing strip differences =================================
echo Computing strip differences
FOR /F "tokens=1,2,3" %%I IN (%StripPairs%) DO (
call :remove_ext %%I
call :change_ext !result! tif
set grid1=!result!
call :remove_ext %%K
call :change_ext !result! tif
set grid2=!result!
echo Computing strip differences between !grid1! and !grid2!
opalsDiff -inFile !grid1! !grid2!
if !ERRORLEVEL! NEQ 0 goto error_occured
call :remove_ext !grid1!
set DiffFiles=!DiffFiles! diff_!result!_!grid2!
)
echo Computing strip difference mosaic
opalsAlgebra -inFile !DiffFiles! -outFile diffMosaic.tif -limit union -formula first(r)
if !ERRORLEVEL! NEQ 0 goto error_occured
echo Deriving color coded visualization
opalsZColor -inFile diffMosaic.tif -palFile $OPALS_ROOT\addons\pal\differencePal.xml -scalePal 0.06
if !ERRORLEVEL! NEQ 0 goto error_occured
rem remove temp file
del %StripPairs% /q
del opals.cfg /q
echo Shell script strip diff example finished
goto end:
:change_ext
set result=%~n1.%2
goto :eof
:remove_ext
set result=%~n1
goto :eof
:error_occured
echo An error occurred. Script execution stopped
goto end
:usage
echo Usage:
echo %~n0 striplistfile gridwidth
goto end
:end
@echo on

If we store the strip file list as striplist.txt and the example script as computeStripDiff.bat , we can simply run the script by typing:

computeStripDiff striplist.txt 1

The script reads the names of the strip files from striplist.txt file, and for each strip it imports the data into the ODM (opalsImport), derives the strip boundary (opalsBounds), and calculates the surface model (opalsGrid). Furthermore the list of overlapping strip pairs is determined (opalsOverlap) and strip difference models are calculated for each overlapping strip pair. Finally, all strip difference models are merged into a single mosaic and a color coded visualization using the standard difference color palette is derived. The resulting color coded mosaic is shown in Fig. 1.

Fig. 1: Color coded strip differences of strips 11, 21, and 31 (mosaic)

A first Python script

The above example reveals a series of organizational and syntactical drawbacks. Formulating loops, defining local variables, and passing parameters is not very elegant. The more, access to results is only possible via file interface and error handling relies on numerical return codes of the command line executables. Python, in contrary, is a full featured scripting and programming language offering a wide range of (built in) features. The very same example as above implemented as a Python script, would look like this (in python 3.x syntax):

from argparse import ArgumentParser
import itertools
import os
import opals
from opals.Import import *
from opals.Bounds import *
from opals.Overlap import *
from opals.Grid import *
from opals.Diff import *
from opals.Algebra import *
from opals.ZColor import *
imp = Import()
grd = Grid()
dif = Diff()
bnd = Bounds()
ovl = Overlap()
alg = Algebra()
zco = ZColor()
print("Starting Python strip diff example")
parser = ArgumentParser()
parser.add_argument("-s", "--striplist", type=str, nargs='*', help="list of strip files seperated by blanks")
parser.add_argument("-g", "--gridwidth", type=float, default="1", help="the grid width to use")
options = parser.parse_args()
stripfiles = []
strippairs = []
stripfiles = options.striplist
if len(stripfiles) < 2:
raise NameError('At least 2 strip files must be specified')
dsm_files = []
odm_files = []
dif_files = []
scrLogLevel = opals.Types.LogLevel.error
for stripfile in stripfiles:
print("Importing strip {}".format(os.path.basename(stripfile)))
imp.reset()
imp.inFile = stripfile
imp.commons.screenLogLevel = scrLogLevel
imp.run()
odm = imp.outFile
odm_files.append( odm )
print("Computing strip boundary")
bnd.reset()
bnd.inFile = odm
bnd.boundsType = opals.Types.BoundaryType.convexHull
bnd.commons.screenLogLevel = scrLogLevel
bnd.run()
print("Computing DSM")
grd.reset()
grd.inFile = odm
grd.gridSize = options.gridwidth
grd.interpolation = opals.Types.GridInterpolator.movingPlanes
grd.commons.screenLogLevel = scrLogLevel
grd.run()
dsm_files.append( grd.outFile[0] )
print("Determining overlapping strip pairs")
ovl.inFile = odm_files
ovl.commons.screenLogLevel = scrLogLevel
ovl.run()
pairs=ovl.pairList
for pair in pairs:
grid0 = pair[0].stem() + "_z.tif"
grid1 = pair[1].stem() + "_z.tif"
print("Computing strip differences between {} and {}".format(grid0,grid1))
try:
dif.reset()
dif.inFile = [ grid0, grid1 ]
dif.commons.screenLogLevel = scrLogLevel
dif.run()
dif_files.append(dif.outFile)
except Exception as e:
print("Exception occured: {}".format(e))
pass
print("Computing strip difference mosaic")
alg.inFile = dif_files
alg.outFile = "diffMosaic.tif"
alg.formula = "first(r)"
alg.limit = "union"
alg.commons.screenLogLevel = scrLogLevel
alg.run()
print("Deriving color coded visualization")
zco.inFile = alg.outFile
zco.palFile = r"$OPALS_ROOT"+os.sep+"addons"+os.sep+"pal"+os.sep+"differencePal.xml"
zco.scalePal = "0.06"
zco.commons.screenLogLevel = scrLogLevel
zco.run()
print("Python strip diff example finished")

Given the script is stored as stripDiff.py and the PATH , PYTHONPATH, and PATHEXT environment variables are set as described in the intallation manual, the same results as in the example above are obtained by typing:

stripDiff.py -s strip11.laz strip21.laz strip31.laz -g 1

From the pre-configured OPALS shell use the following command to start the script

python stripDiff.py -s strip11.laz strip21.laz strip31.laz -g 1

Some remarks:

The script starts with the import of some general built-in python libraries, followed by the import and instantiation of the required OPALS modules

6 from opals.Import import *
7 from opals.Bounds import *
8 from opals.Overlap import *
9 from opals.Grid import *
10 from opals.Diff import *
11 from opals.Algebra import *
12 from opals.ZColor import *
13 
14 imp = Import()
15 grd = Grid()

In the next step the built-in argument parser is initialized, the script options are parsed, and some local variables are defined.

24 parser = ArgumentParser()
25 parser.add_argument("-s", "--striplist", type=str, nargs='*', help="list of strip files seperated by blanks")
26 parser.add_argument("-g", "--gridwidth", type=float, default="1", help="the grid width to use")
27 
28 options = parser.parse_args()
29 
30 stripfiles = []
31 strippairs = []

In the next step, the actual strip files (command line argument -s) are assigned to the stripfiles variable and within a loop the strip data are imported, the strip boundaries are derived and DSMs are calculated.

43 for stripfile in stripfiles:
44  print("Importing strip {}".format(os.path.basename(stripfile)))
45  imp.reset()
46  imp.inFile = stripfile
47  imp.commons.screenLogLevel = scrLogLevel
48  imp.run()
49  odm = imp.outFile
50  odm_files.append( odm )

The above code snippet shows an interesting and very useful feature. The output (=ODM) file of the data import, which has not been set explicitly but was automatically created by Module Import, is queried via the outFile property and stored in a list (for later use in the determination of the overlapping strip pairs). This provides a good way of chaining modules (output of 1st module = input of 2nd module, output of 2nd module = input of 3rd module...) without the need of setting the output files explicitly. This is even more useful, if the results of a module are not passed via files, but as custom object. This is, for instance, the case for Module Overlap :

68 print("Determining overlapping strip pairs")
69 ovl.inFile = odm_files
70 ovl.commons.screenLogLevel = scrLogLevel
71 ovl.run()
72 pairs=ovl.pairList

Here, Module Overlap calculates all overlapping strip pairs according to the input list of strip boundary files (i.e. the underlying odm_files in this case). The resulting pair list can directly be queried after run() has been executed via the pairList property. In the subsequent lines, the strip pair list is traversed and all relevant difference models are calculated.

74 for pair in pairs:
75  grid0 = pair[0].stem() + "_z.tif"
76  grid1 = pair[1].stem() + "_z.tif"
77  print("Computing strip differences between {} and {}".format(grid0,grid1))
78  try:
79  dif.reset()
80  dif.inFile = [ grid0, grid1 ]
81  dif.commons.screenLogLevel = scrLogLevel
82  dif.run()
83  dif_files.append(dif.outFile)
84  except Exception as e:
85  print("Exception occured: {}".format(e))
86  pass

The above code also demonstrates the error handling capabilities. The OPALS error system mainly bases on exceptions which are fully integrated into the Python implementation. For the sake of clarity, the given example is very simple and shows only the basic usage of exceptions. The parameter setting and the run() function (comprising parameter checking and the actual module code) is executed within a try-except block. If an exception is thrown by the Module Diff module, it is caught by except. In this simple case, only the error message of the exception is output. However, in real world processing, more thorough examination of the error conditions would be necessary, eventually leading to re-running the module with different parameters.

Finally, the resulting difference models are merged together using Module Algebra and a color coded raster map of the resulting mosaic is derived with Module ZColor. The latter highlights the specific ways of setting different parameter types.

97 zco.inFile = alg.outFile
98 zco.palFile = r"$OPALS_ROOT"+os.sep+"addons"+os.sep+"pal"+os.sep+"differencePal.xml"
99 zco.scalePal = "0.06"
100 zco.commons.screenLogLevel = scrLogLevel

The parameters inFile and palFile are of type 'string'. Whereas the input file is set using the output of a previous module (as mentioned before), the palette file is passed as a literal constant. Please note the preceding r character before the actual string value. This is necessary to allow back slash directory delimiters which would, by default, be interpreted as escape characters. For more details, please refer to the reference manual. In the next statement the scale factor for the palette is set again as a string constant ( scalePal = "1"). This might appear strange at a first glance. However, since the underlying data type (opals::ZColorScalePal) is defined as a complex class allowing automatic and z-range scaling, we can not simply provide a (scalar) scale factor as we probably would expect. All of the following examples are valid statements for setting the palette scale:

zco.scalePal = "auto"
zco.scalePal = "(-1.0 1.0)"
zco.scalePal = "1.0"

... but ...

zco.scalePal = 1.0

... would produce the following Python error:

Traceback (most recent call last):
File "<input>", line 1, in <module>
z.scalePal = 1.0
Boost.Python.ArgumentError: Python argument types in
None.None(ZColor, float)
did not match C++ signature:
None(class ModuleZColor {lvalue}, class opals::ZColorScalePal)

Please note, that the same rule applies for all complex OPALS types (i.e. GridLimit, MultigridLimit, CellFeature...). Finally, setting the screen log level as zco.screenLogLevel = scrLogLevel is an example, how enumerators can be handled. The variable scrLogLevel has previously been defined as:

0 scrLogLevel = opals.Types.LogLevel.error

The data type LogLevel is an enumerator defined by OPALS containing the named constants error, warning, info, verbose and debug). All enumerators can be accessed by the type.value syntax. For more information about using the Python interface, please refer to the reference manual. For instructions on how to debug Python script, see Debugging OPALS Python scripts.

Example: C++ program

Small C++ programs for each Module can be found in the directory $OPALS_ROOT/c++_api/demo/ . demoImport.cpp, for example, is an exemplary program written in C++ that instantiates Module Import and imports a data set into an ODM:

#include "opals/Exception.hpp"
#include "DM/StatFeature.hpp"
#include "opals/ParamList.hpp"
#include "opals/ModuleDeleter.hpp"
#include "opals/IImport.hpp"
#include <iostream>
#include <memory>
struct Control : public opals::IControlObject
{
public:
Control() {}
/// sets the descriptions of the stages (implicilty sets the number of stages)
virtual void setStages(const opals::Vector<opals::String> &stageDescriptions) {
std::cout << "setStages:" << std::endl;
for(int i=0; i<stageDescriptions.size(); i++)
{
std::cout << "\tStage " << i+1 << ": " << stageDescriptions[i].c_str() << std::endl;
}
std::cout << std::endl;
};
/// sets the current stage
virtual void setCurrStage(unsigned currentStage) {
std::cout << "Start of stage " << currentStage+1 << ":";
}
/// sets the number of steps for the current stage (will be called after the stage was set)
virtual void setSteps(long long stepCount) {
std::cout << " running.";
OutputInterval = (long long)stepCount/30.;
MaxStep = stepCount;
LastInterval = 0;
}
/// sets the current step
virtual void setCurrStep(long long currStep) {
unsigned intval;
if (currStep >= MaxStep)
std::cout << "finished" << std::endl;
else if ( (intval = currStep/OutputInterval) > LastInterval)
{
LastInterval = intval;
std::cout << ".";
}
//if (currStep >= 11000)
// throw opals::Exceptions::UserInterrupt(); //throw the opals::Exceptions::UserInterrupt() to interrupt the module run
}
virtual void log(opals::LogLevel level, unsigned threadId, const char* message) {
if (level <= opals::LogLevel::warning) //only output warning and error messages
{
switch( level )
{
std::cout << "debug: ";
break;
std::cout << "verbose:";
break;
std::cout << "info :";
break;
std::cout << "warning:";
break;
std::cout << "error :";
break;
}
std::cout << " " << message << std::endl;
}
}
protected:
long long MaxStep;
long long OutputInterval;
unsigned LastInterval;
};
int main(int argc, char** argv)
{
int errorCode = 0;
try {
/// Create an opals Import object.
/** opals::IImport is an abstract base class that derives from opals::IModuleBase ,
like all other opals Module classes.
opals::IImport::New() returns a pointer to an instance of a complete type
that derives from opals::IImport, which is newly allocated on the heap.
This instance should hence be destroyed after use in order to avoid memory leaks,
using opals::IModuleBase::Delete()
Using std::shared_ptr< opals::IImport > together with opals::ModuleDeleter ,
this is done automatically when the std::shared_ptr goes out of scope. */
Control control;
//std::shared_ptr<opals::IImport> module( opals::IImport::New(), opals::ModuleDeleter() );
std::shared_ptr<opals::IImport> module( opals::IImport::New(control), opals::ModuleDeleter() );
//limit screen output to log level to error (because we do screen logging our selfs...)
module->commons().screenLogLevel().set(opals::LogLevel::error);
/// Set input parameter
/** opals::Vector< opals::Path > is a complete type, like all other opals parameter types/classes.
Hence, the instance is allocated on the stack and is deleted automatically
when going out of scope, without special precautions. */
inFile.push_back("..\\..\\demo\\G111.las");
module->opts().inFile().set(inFile);
//module->set_controlObject(control); //already set at module construction
/// run the module
module->run();
}
{
//in this case an error occurred, output error message
errorCode = e.errorCode();
std::cout << e.errorMessage() << std::endl;
}
return errorCode;
}

Kindly note that the compiler's search path for C++ header files must contain the directory $OPALS_ROOT/c++_api/inc/opals/ . Furthermore, the linker must be set to link to opals_base.lib and the import libraries of any used modules (opalsImport.lib in the above example), found in $OPALS_ROOT/c++_api/lib/. For the Microsoft C/C++ Compiler is only necessary to append $OPALS_ROOT/c++_api/lib/ as additional library directory. All necessary libraries are then automatically linked. For debug configurations, it is necessary to the set the preprocessor macro /c OPALS_NO_DEBUG, forcing release libraries being used (debug libraries of OPALS are not included within the distribution). To run such a program, the operating system must be told where to find the shared libraries, which are located in $OPALS_ROOT/opals/opals/ .

OPALS package scripts in Python

OPALS is organized in packages each of which covering a specific topic (quality documentation, georeferencing, filtering, etc.). The packages consist of multiple modules and arbitrary work flows can be constructed by combining the respective modules using scripts. As a basic principle, the OPALS distribution provides one main Python script for each package. Package scripts, generally, take a list of data sets and carry out one or more specified tasks using certain calculation parameters. In this section, the overall organization and application of the package scripts is explained in detail.

Preconditions

The package scripts are provided as Python modules. The recommended way to execute the OPALS package scripts is from the OPALS shell, which sets up all necessary environment variables. Furthermore, the shell provides the OPALS script execution command opals.bat that allows to call the provided package scripts from any working directory simply by name. Note that opals.bat may as well be called when using a correctly configured external Python installation.

Without opals.bat, like for any Python script, one generally needs to call python followed by the absolute or relative script file path, including its file extension.

To invoke an OPALS package script simply by name as above (without parent directory and file extension), without prefixing the call with python, and without prefixing the call with opals.bat ,

  • include the path to the Python package scripts ($OPALS_ROOT/opals/workflows) in the PATH environment variable,
  • include .PY in the PATHEXT environment variable,
  • associate the .py file extension with the proper file type using assoc .py=Python.File
  • redirect invocations of this file type as arguments to the proper Python executable, which may or may not be the one that comes with OPALS, e.g. ftype Python.File=C:\Path\to\python.exe "%1" %*

For further details please refer to the Installation manual.

General structure

As mentioned above, OPALS comes along with one main workflow script per package. This script can be seen as a central access point of execution. The workflow scripts are stored in the directory $OPALS_ROOT/opals/workflows and are named as the corresponding package (e.g. opalsQuality.py). The scripts are designed being called from the command line but also providing a convenient pythonic interfaces (for further details see Executing scripts within python).

Each main package script may consist (and invoke) sub-scripts (related to the package), which themselves can rely on basic scripts (available and useful for all scripts). Thus, the general script hierarchy is:

  • Package script: one per package serving as central access point for overall processing steps related to the respective package. Naming convention: opals<Package>.py
  • Package sub-scripts: covering a subdomain of the entire package scope. This is especially advantageous for large packages with different aspects (like quality control). Naming convention: <pkg><sub-script>.py, where <pkg> is a package acronym (e.g. qlt for opalsQuality)
  • Common scripts: scripts with a confined scope. Common scripts are not intended to be called by the end-user but rather as helper functions for the package (sub-)scripts.

The package opalsQuality, for instance, consists of the following scripts:

opalsQuality.py // overall package script
qltDSM.py // sub-script for derivation and visualization of DSMs
qltDensity.py // sub-script for derivation of point density maps
qltStripDiff.py // sub-script for derivation of strip difference maps
qltLSM.py // sub-script for derivation of strip difference vectors via LSM
_import.py // helper script for data import
_bounds.py // helper script for derivation of strip outline
_overlap.py // helper script for identification of overlapping strip pairs
_grid.py // helper script for derivation of surface models

Executing package scripts

Executing scripts from the command line

If the preconditions mentioned above are met, Python scripts can be executed from within a command prompt window just like shell scripts (bat files) or executables. Each package (sub-)script provides a help screen informing about the options offered by the respective script. E.g., to get general help about the opalsQuality script, type:

opals Quality -help

... if running the pre-configured OPALS shell, or else:

opalsQuality -help

The script responds with the following usage screen:

usage: opalsQuality.py [options]
Package script providing a complete processing chain for quality assessment of ALS flight strips.
optional arguments:
-mosaic 1|0|true|false|yes|no
create mosaics
-mask 1|0|true|false|yes|no
create grid masks (sigma0, eccentricity)
-gridMask Path raster file containing user-defined mask areas (e.g.
water)
-qgisProject Path path to qgis project file, if one should be written. A
QGIS installation is required.
Common Package/Script Options:
Settings concerning the general options for (package) scripts.
-infile [INFILE [INFILE ...]]
input text file, directory or strip file
-outfile OUTFILE output directory or settings file (.txt)
-temp TEMPDIR temporary directory
-cfg CFGFILE configuration file
-projectDir PROJECTDIR
project directory
-skipIfExists 1|0|true|false|yes|no
skip processing if result already exists
-gearman GEARMAN address and port of gearman job server
-subScripts SUBSCRIPTS
list of sub-scripts to be executed separated by commas
Type -help foo to get extended help on parameter 'foo' (long option name
without prefix '-', e.g. '-help input')

Since there are reasonable default values available for all options (except the input data), we can trigger a complete quality assessment for the data in the current working directory (f.i. $OPALS_ROOT/demo/) by typing:

opals Quality -i "strip??.laz"

This imports all LAS files (strip11[21][31].laz) into seperate ODMs, calculates surface models for each strip, derives relief maps (shaded and color coded), determines the strip boundaries and identifies the overlapping strip pairs. For the latter, strip difference models are calculated and color coded visualizations are derived (for each strip pair and as a mosaic). The precision of the relative strip orientation is analyzed via a Least-Squares-Matching (LSM) approach and finally, the ALS point density is checked and visualized. For details, please refer to the opalsQuality documentation.

Please note that if you are not running the OPALS shell and the PATHEXT environment variable does not contain .PY , it is nevertheless possible to run the package scripts from the command prompt. In this case, the Python interpreter must be invoked explicitly, with the package script as input parameter. From a command prompt within the demo directory, we can write:

python ..\opals\workflows\opalsQuality.py -i "strip??.laz"

However, it is strongly recommended to use the OPALS shell, as it is the safest way to start, both, OPALS command line modules and package scripts from within any working directory.

Executing scripts within python

When you want to integrate a package scripts into our own python script, it is recommended to use the corresponding python interfaces. In a correctly set up OPALS python environment you can directly import the package script as shown below ($OPALS_ROOT/demo/opalsQualityDemo.py):

# example on how to run an opals workflow package script from inside python
# import opalsQuality package script
from opals.workflows import opalsQuality
def test_run():
# instantiate package and set parameters
pkgQlt = opalsQuality.opalsQuality(infile = 'strip??.laz')
# it is also possible to set parameters explicitly
# pkgQlt = opalsQuality.opalsQuality()
# pkgQlt.infile = 'strip??.laz'
# run package script
pkgQlt.run()
if __name__ == "__main__":
# since the workflow scripts may use the python multiprocessing library, we need to
# place the actual run command in such an if statement. see the python multiprocessing
# docu (https://docs.python.org/3/library/multiprocessing.html) for further details.
test_run()

As for OPALS modules all script parameters can be set while instantiation or each parameter is explicitly set to the corresponding attribute of the object.

Standard parameters

Usually, package scripts take a list of data files (i.e. ALS strip data or data tiles), perform a series of processing tasks using a set of calculation parameters, produce some intermediate products, and finally create the desired results. Since this general processing sequence can be applied to all package scripts, there is a list of standard parameters common to all scripts (as it is for opals modules abbreviation for parameters can be used when calling form the command line). These are:

  • Input files (-infile): The list of files for which a specific processing task should be carried out is the basic input for package scripts. Such file (or file pair...) lists can be passed either as an ASCII text file (with extension .txt ) containing the (absolute or relative) file paths (one file per line). A hash character ('#') at the beginning of a line indicates that the file should be ignored. This allows full control over the data sets to use and can, e.g., be used to continue the processing after a fatal error has occurred or for splitting up data processing in multiple parts. However, sometimes it might be advantageous to process all files (of a certain type). Thus, the use of wild cards is also supported to select an entire set of data. Finally, the input parameter (-i) may also be specified repeatedly in order to define a specific list of data files (c.f. examples for details). If the input parameter is not specified, all strip files in the current directory are processed.
  • Output directory (-outfile): Usually, it is desirable to store all the final results in a specific directory. To a achieve this, a parameter for specifying the storage location on disc (-o) is provided. Thus, the results of different scripts can easily be condensed in on specific place. Default: current working directory + name_of_package sub-folder (e.g., .\opalsQuality)
  • Temporary directory (-temp): Whereas OPALS modules can be regarded as atomic units with no (or little) intermediate results, OPALS scripts, in contrast, are intended for sophisticated workflows, potentially with a series of intermediate results. The package scripts provide the parameter -t to give control to the user about, where the temporary results should be stored. Default: current working directory.
  • Configuration files (-cfg): On the one hand, scripts rely on data sets (input/output/intermediate), and on the other hand, on calculation parameters for processing the data. As a basic principle, calculation parameters are strictly provided via configuration files. Details about using cfg files can be found here.
  • Project directory (-projectDir): All of the parameters mentioned above, if not specified as absolute paths, are relative to the project directory. The default project directory is the current working directory, but can be changed via the -p parameter.
  • Sub-scripts (-subScripts): Top-level package scripts in general feature more than one sub-script (c.f. above). If only specific sub-scripts are to be performed, the names of the respective scripts seperated by commas can be specified. Default: all sub-scripts related to a package.
  • SkipIfExists (-skipIfExists): If set to True the processing of (intermediate) results is supressed, in case the requested output already exists. Please note, that this feature is strictly restricted to the existence of certain files and does not consider, e.g., a change of calculation prameters.

Additional commonly used script parameters are:

  • Mosaic (-mosaic): In general, package scripts provide workflows for a series of seperate datasets (strips, strip pairs, tiles...) and, optionally, for the entire block by calculating image mosaics. Mosaicking can be activated (1=True, default) or deactivated (0=False).
  • Mask (-mask): Some processing steps require a restriction to vaild data areas via masking (e.g. relative strip differences should be restricted to smooth and bare ground). The calculation of grid masks can be activated (1=True) or deactivated (0=False). Default: Activated (1=True)

Some packages (e.g. opalsQuality) provide a main package script which intensely uses sub-scripts. Above all, this is because the packages often feature many different aspects. In addition, it has always been good programming practice to split up larger tasks into smaller pieces. One good reason for doing so, is the clarity and readability of the code. Now, we are often facing the situation that what is an intermediate result for one script may be the basic input for the next. As an example, qltDSM generates surfaces models and its visualizations, and qltStripDiff relies on the surface models (but not on their visualizations). In short, the main package script may use and interact with multiple sub-scripts. Since each sub-script itself can be executed stand alone and, thus, has its own set of parameters (input files, output/temporary directory, cfg files, etc.) it is necessary for the main package script to provide a set of output directories (-o) and cfg files (-c). For this purpose, these parameters may be specified in the following way for the top-level package scripts:

  • Output directory (-outfile):
    • not specified: default output directory = <current workdir>\Package
    • specific directory: results of all sub-scripts are stored in that directory
    • settings file name: ASCII file, Syntax per line: sub-script=output dir for sub-script
  • Configuration files (-cfg):
    • not specified: default cfg files are used (c.f. next section)
    • specific directory: cfg files with standard names (e.g. qltDSM.cfg...) are expected in this directory
    • settings file name: ASCII file, Syntax per line: sub-script=cfg file for sub-script

Use of configuration files

For each script, a default configuration file is available with the same name as the script itself and the extension .cfg . These default configuration files are located in the directory $OPALS_ROOT/cfg/. However, since each project may require different parameters, user defined configuration files are supported via the script parameter -c. Thus, it is good practice to copy the default configuration files, store them either on a global location if you always want to use the same user-defined cfg files or on project related location in case you want to adapt the parameters for each individual project.

The configuration files should contain all relevant calculation parameters for all incorporated OPALS modules. As a representative example the contents of the file qltStripDiff.cfg are listed below:

screenLogLevel=info

[opalsGrid]
gridSize=1
interpolation=movingPlanes
neighbours=8
searchRadius=5
selMode=nearest
filter=Echo[last]

[opalsBounds]
boundsType=convexHull

[opalsOverlap]
minArea=5%

[opalsAlgebra]
limit=union

[opalsZColor]
oFormat=GTiff
scalePal=0.04
palFile=$OPALS_ROOT/addons/pal/differencePal.xml

Package scripts, however, may call the same module via more than 1 subscript. If module calls via different subscripts shall use different parameters, then the corresponding module name in the configuration file must be prefixed with the corresponding subscript names, separated by a dot, as e.g. in:

[qltDSM.opalsZColor]
oFormat=GTiff
palFile=$OPALS_ROOT\addons\pal\standardPal.xml
[qltDensity.opalsZColor]
palFile=$OPALS_ROOT\addons\pal\densityPal.xml

Please note, that input (and output) files should strictly be passed as script parameters and not via the cfg files in order to avoid confusion. In case of doubt, the parameters passed as script arguments or those which are hard-coded in the scripts take precedence over the configuration file parameters.

Setting script parameters using the cfg file

Since version 2.6.0. script parameter can be set in the cfg file using sections. This can be useful when a parameter value is long and it is desired to "declutter" the command line call. Another use case for this would be if multiple calls of the same script are performed, changing parameters while others stay the same for all calls. Those can be set in the cfg files. An example of this can be seen in the documentation of the opalsQuality package script and is also shown below, using the example of the mask and slopeLimits parameter.

#############################################################################
########################### common parameters ###############################
#############################################################################
[script.opalsQuality]
mask = sigmaz:r<0.1 sigma0:r<0.8 gridMask:!r
slopeLimits = 10 20 30

Debugging OPALS Python scripts

Although Python natively contain a simple graphically GUI providing rudimentary debugging possibilities, it is recommended to use one of the free Python IDEs described in the section below. Those are well suited for debugging and developing new scripts.

IDEs for Python and OPALS

In general any Python Integrated Development Environments (IDE) can be used for script development within OPALS. Basically, its only necessary to start the IDE from the opalsShell (when using the internal python interpreter), to secure that the correct environment is set. Additionally, the OPALS Python interpreter has to be configured and selected within the IDE. To simplify matter, OPALS provides two IDE Packages that are fully preconfigured to collaborate with OPALS without installation. PyScripter is already integerated in the standard OPALS Windows packages, because of its small size. This IDE is the ideal choice for simple script analysis and debugging tasks. For serious script development (and for Linux users) PyCharm is recommended since it's a full featured IDE which contains auto-completing, code inspection and advanced debugging features. Because of its size, it's not included in the standard packages, but provided as separate download package on the download page.

PyScripter (Windows only)

The Windows version of OPALS comes with PyScripter, a feature-rich but lightweight Python IDE (Alternatively, PyCharm can be used as high-end IDE available for Windows and Linux). Assuming that you have not configured your OPALS installation to use an external Python interpreter (see Installation Guide), you can start PyScripter by double-clicking on $OPALS_ROOT/addons/PyScripter/startPyScripter.bat. As mentioned in OPALS modules as Python modules, PyScripter offers an enhanced interactive Python shell. Among other things, PyScripter additionally allows for debugging Python scripts, and for organizing Python scripts in projects (file extension *.psproj). For each OPALS package script, there exists a PyScripter project file in $OPALS_ROOT/opals/workflows. Having started PyScripter, such a project file may be loaded via the menu Project, entry Open Project.... The Project Explorer on the left of the GUI then shows all Python scripts that belong to that project (node Files), which may be loaded into the editor by double-clicking on them. Furthermore, Project Explorer shows all configurations to execute those scripts (node Run Configurations). Right-clicking on one of those configurations and selecting Run executes the respective script with the according options.

To debug an OPALS package script, load the according PyScripter project file into PyScripter, and load the package script into the editor (by double-clicking on it in Project Explorer). At the position where you want the execution of the script to halt set a break point by clicking on one of the small, blue dots on the left side of the editor window, turning that blue dot into a red disk. Then, right-click on the configuration you want to run in Project Explorer, and select Debug. Now the according script is executed until it encounters the first break point.

Debugging package opalsQuality using PyScripter

You may proceed with execution step-by-step, using options found in menu Run:

  • Resume execution to next break point
  • Run to Cursor resumes execution up to the current position of the cursor
  • Step Into function calls
  • Step Over function calls
  • Step Out of function calls

Whenever execution is halted, PyScripter allows for inspecting the currently defined variable names, their types and values:

Debugging with PyScripter: Variables inspection

PyCharm (Windows and Linux)

Whereas Windows users only need to select the PyCharm installation option during setup, Linux users can get a pre-configured package of the free PyCharm Community Edition from the download page that needs to extracted into the $OPALS_ROOT directory by the following command

tar -xzf opals_2.6.0-pycharm_2018.1.3.tar.gz

Compared to the original installation package the provided package

  • has full auto-completing support for OPALS modules (relevant skeleton generator sources have been adapted)
  • contains a clickable start scripts ($OPALS_ROOT/addons/PyCharm/startPyCharm[.bat|.sh]) that runs PyCharm in the opals environment
  • incorporates with the OPALS directory structure

Details on PyCharm and its feature can be found here

First time start of PyCharm

Under Linux double-clicking on $OPALS_ROOT/addons/PyCharm/setupPyCharm.sh to once perform necessary setup steps (not needed under Windows since this is done by the setup). Then double-clicking on $OPALS_ROOT/addons/PyCharm/startPyCharm[.bat|.sh] to actually start PyCharm. The script does a few precondition checks and initializes the OPALS PyCharm environment. If an error occurs, the script stops with a corresponding error message (re-running $OPALS_ROOT/addons/PyCharm/setupPyCharm[.bat|.sh] with Administrator/root privileges may resolve the problem). Otherwise PyCharm is automatically started.

After the PyCharm splash screen you will need to accept the PyCharm Community Edition Privacy Policy Agreement to get to an initial configuration dialog.

Initial Configuration Dialog of PyCharm

Select your preferred setting (can be changed later as well) to get to the actual start-up window. Select 'Open' and choose the $OPALS_ROOT directory. The directory should show up with a slightly different directory icon, indicating that it contains a PyCharm project (The actual project files are stored in the $OPALS_ROOT/.idea/ directory. After the project is fully loaded PyCharm updates its indices and builds skeleton files in a background process which may take a minute or two (see status bar of the PyCharm window).

PyCharm builds indices and skeleton files

Although you can fully interact with PyCharm while the background process is running, the auto-completing feature will not fully work before the skeleton creation is finished.

Working with PyCharm

PyCharm stores all settings in the user directory. Hence, it is possible to configure PyCharm differently for different users on the same computer. You will also recognise that PyCharm automatically restores the last project at start-up.

Once the opals project is loaded you should see a similar window as shown below. Left to the Editor Window, the Project Tool Window is displayed. There, if unfolded, the OPALS directory tree and all python files are visible. PyCharm supports multiple configurations for debugging and running python scripts. On the right side of the Navigation Bar (between Menu and Editor) existing configurations can be selected and administrated (by choosing Edit Configurations...). As shown below, activate the opalsQuality: OPALS quality package script configuration to testwise debug the opalsQuality script.

Selection Configuration for Debugging in PyCharm

Click on the bug icon next to the configuration selection for debugging the opalsQuality script. The script is executed in the $OPALS_ROOT/demo/ directory with the parameters -i G*.las set (Menu Run/Edit Configurations... also allows checking the configuration setting). PyCharm should stop the execution at the pre-defined break point as shown below. Break points can easily be set/unset by clicking in the grey bar left of the Editor Window, using the Menu Run/Toggle Line Breakpoint or the corresponding shortcut key (visible right of the corresponding menu entry).

You may proceed with execution step-by-step, using options found in menu Run:

  • Resume Program execution to next break point
  • Run to Cursor resumes execution up to the current position of the cursor
  • Step Into function calls
  • Step Over function calls
  • Step Out of function calls

Whenever execution is halted, PyCharm shows currently defined variable names, their types and values in the Variables Window (centre window of the debug docker tool window). Additionally, the Watches Window allows further inspecting specific elements, attributes and functions of variables and objects. Left of the Variables Window, PyCharm displays the call stack in the Frames Window.

Debugging with PyCharm: Variables inspection

Auto-completing within PyCharm

Auto-completing increases the development efficiency specially when programming with unknown (or not well-known) tools or libraries. There, the code editor suggests possible inputs based on the current context. For testing this feature, create a new python file by right clicking on the opals/demo directory and selecting the corresponding context menu entries as shown below. Call the new file e.g. test.py.

Create a new python file in PyCharm

While typing from opals import I... PyCharm automatically shows a context menus with possible input values. If e.g. an Import module is instantiated (imp = Import.Import()), PyCharm suggest possible member functions if the variable imp is used. PyCharm not only displays member function of the current class (Import) but also of it base classes (Base) as it can be seen below.

Auto-completing example 1 in PyCharm
Auto-completing example 2 in PyCharm

PyCharm uses static code analysis for error detection and auto-completing. For completeness it is mentioned that python is a type-unsafe programming language, PyCharm does not always know the type of variable and hence, auto-completing doesn't work correctly in all cases.

Another useful feature is to directly jump to the definition of a selected code. Test this feature, by typing imp.inFile, marking inFile and pressing <F12> (or <Ctrl+B>). PyCharm will jump (Menu Navigate) to the declaration in the corresponding skeleton file of Module Import, as shown below. There, you will find the function documentation and the expect parameters and types. You can also <Ctrl+Left Click> on the word inFile

Auto-completing example 2 in PyCharm

This was just an excerpt of useful PyCharm features. Please refer to the comprehensive PyCharm help for further details.

Author
GM, WK, JO
Date
01.07.2016
@ nbThreads
number of concurrent threads
opalsZColor is the executable file of Module ZColor
Definition: ModuleExecutables.hpp:253
@ debug
Anything that may help 'debugging' by means of a release-build.
@ gridMask
grid mask file
opalsBounds is the executable file of Module Bounds
Definition: ModuleExecutables.hpp:18
const char * errorMessage() const
returns the actual error message
IGroup< Names::_, IValue< Names::inFile, false, Path >, IValue< Names::attribute, false, String >, IValue< Names::gridFile, false, Vector< RasterFile > >, IValue< Names::tilePointCount, false, unsigned >, IValue< Names::neighbours, false, int >, IValue< Names::searchRadius, false, Vector< float > >, IValue< Names::selMode, false, SelectionMode >, IValue< Names::searchMode, false, SearchMode >, IValue< Names::filter, false, Vector< String > >, IValue< Names::resampling, false, ResamplingMethod > > Options
Options of Module AddInfo.
Definition: c++_api/inc/opals/IAddInfo.hpp:40
@ file
generic file parameter (opalsInfo)
ErrorCode::Enum errorCode() const
returns the error code
virtual void log(LogLevel level, unsigned threadId, const char *message)=0
retrieve log message
@ sigma0
sigma 0 of grid post adjustment (i.e. std.dev. of the unit weight observation)
Definition: GridFeature.hpp:12
A functor that may be used for memory management of modules allocated on the heap usage: e....
Definition: ModuleDeleter.hpp:15
virtual void setSteps(long long stepCount)=0
sets the number of steps for the current stage (will be called after the stage was set) This function...
LogLevel
Enumerator defining different importance levels of log records.
Definition: LogLevel.hpp:8
opalsDiff is the executable file of Module Diff
Definition: ModuleExecutables.hpp:53
opalsAlgebra is the executable file of Module Algebra
Definition: ModuleExecutables.hpp:13
opalsImport is the executable file of Module Import
Definition: ModuleExecutables.hpp:113
@ error
Some failure that cannot be handled. Program execution is aborted.
virtual void setStages(const Vector< String > &stageDescriptions)=0
sets the descriptions of the stages (implicitly sets the number of stages. it will be set within the ...
@ warning
Some weird program state which can still be handled (e.g. 'poor matrix condition',...
@ options
query the options of this module; optionally, finalize input before
@ strip
strip ID of an image (opalsStripAdjust)
@ parameter
a general parameter error (usually comes from boost::program_options)
Definition: c++_api/inc/opals/Exception.hpp:60
@ temp
File path to replace $OPALS_TEMP with.
@ scalePal
scale factor to be applied to the given palette (opalsZcolor)
virtual void setCurrStage(unsigned currentStage)=0
sets the current stage (zero based index) In rare situations no step information will be available fo...
This is the fake namespace of all opals Python scripts.
Definition: __init__.py:1
Contains the public interface of OPALS.
Definition: AbsValueOrQuantile.hpp:8
@ verbose
Something that may help in understanding normal program behaviour.
opalsOverlap is the executable file of Module Overlap
Definition: ModuleExecutables.hpp:158
opalsGrid is the executable file of Module Grid
Definition: ModuleExecutables.hpp:93
@ directory
Temporary / intermediate data directory path (opalsStripAdjust)
@ odm
OPALS Datamanager file.
Interface for retrieving status and progress information from a module run.
Definition: c++_api/inc/opals/IControlObject.hpp:30
@ nClasses
number of different classes (e.g. opalsZColor)
@ sigmaz
sigma z of grid post adjustment (i.e. std.dev. of the interpolated height)
Definition: GridFeature.hpp:11
@ info
Some progress that may be interesting in everyday-use.
Mimics std::vector<T>
Definition: fwd.hpp:18
virtual void setCurrStep(long long currStep)=0
sets the current step It is internally secured that for the last setCurrStep call currStep >= stepCou...
@ grid
Grid data file.
@ screenLogLevel
verbosity level of screen output
@ strips
strip group (opalsStripAdjust)
@ help
print this usage screen: help on parameters specific to this module
@ palFile
palette file (opalsZcolor)
The base class of all exceptions thrown by opals.
Definition: c++_api/inc/opals/Exception.hpp:170
@ global
LSM for entire overlap area.