Difference between revisions of "Seon Directory Scanner"
|  (→Configuration of scanning tasks) |  (→external links) | ||
| (38 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
| − | |||
| − | |||
| == What is the Seon Directory Scanner? == | == What is the Seon Directory Scanner? == | ||
| The goal of the directory scanner is to scan configured directories (without recursion) for new files (older than 60 seconds) and apply a matching pattern on them. If the pattern matches, the file will be moved to the [[Seon_Core_configuration#data_outgoing_directory|configured outgoing directory]] and an executable will be started with parameters defined for this directory scanner entry, based on either fix values or dynamic ones. | The goal of the directory scanner is to scan configured directories (without recursion) for new files (older than 60 seconds) and apply a matching pattern on them. If the pattern matches, the file will be moved to the [[Seon_Core_configuration#data_outgoing_directory|configured outgoing directory]] and an executable will be started with parameters defined for this directory scanner entry, based on either fix values or dynamic ones. | ||
| Line 22: | Line 20: | ||
| You can click on "New" or the empty paper icon to create a new entry. In order to edit an entry, click on the edit icon. | You can click on "New" or the empty paper icon to create a new entry. In order to edit an entry, click on the edit icon. | ||
| + | The following screenshot shows the edit page of an existing directory scanner entry: | ||
| + | |||
| + | [[Image:Dirscanner scantest.png]] | ||
| === Name === | === Name === | ||
| + | The name of the directory scanner entry can be a human-interpretable textual string which will only occur in the logs.  | ||
| + | === Directory === | ||
| + | The directory on which the directory scanner works on. Remember that only that directory without subdirectories will be scanned. The configured outgoing directory cannot be configured since the files will be moved into that directory before executing the command for a found file. | ||
| − | |||
| === Regular expression === | === Regular expression === | ||
| − | === Configuration values === | + | The file name found in the configured directory must match this [http://en.wikipedia.org/wiki/Regular_Expression regular expression]. Regular expressions are quite complex but very powerful. The name of the found file must result into a true value (which means that any output of the regular expression is valid but not the empty). The engine compiling these regular expression values is always PCRE, which implements Perl-style regular expressions, which are widely used across different systems. | 
| − | ==== fix values ==== | + | |
| − | ==== variable values ==== | + | If the regular expression is not correct, the directory scanner will identify this situation, add a log entry to the system log and disable this configured directory scanner configuration. | 
| − | === Execution === | + | |
| + | === Age of entry === | ||
| + | With the given age, only entities (files or directories; depending on your configuration) are taken which are older than this amount of seconds. The minimum age of entries must be at least the value of the [[Seon_Core_configuration#time_slice_for_send_queue_daemon|send queue daemon timeslice value]]. | ||
| + | |||
| + | === Type selection === | ||
| + | If you configured and licensed to [[Seon_Core_configuration#is_Seon_Enterprise_installed.3F|use Seon Enterprise]], you have the option to | ||
| + | *Scan for files, handled for Seon Core enqueueing | ||
| + | *Scan for directories or files to be used for Seon Enterprise job creation | ||
| + | |||
| + | Depending on your choice, you're getting a different configuration view: | ||
| + | |||
| + | ==== Seon Core ==== | ||
| + | |||
| + | ===== Configuration values types ===== | ||
| + | A found file matching the configured regular expression leads to a number of paraeters which are then passed to the executable for later using them. There are two types of configuration values you can use for every single configuration parameter: | ||
| + | |||
| + | ===== fix values ===== | ||
| + | The easiest way to use a configuration value is to pre-set it with a fix value. This is mostly a good decision if i.e. the directory is partner-based and the configuration of the communication partner is fix (due to its nature of residence in that configured directory). | ||
| + | |||
| + | ===== variable values ===== | ||
| + | Another way to extract a configuration value is based on the found file. The found filename (without path) will be passed to the configured regular expression, where the '''first''' variable definition, which are normally enclosed by round brackets: '<code>(</code>' and '<code>)</code>'. Subsequent variable extractions will be ignored. If no variable value is extractable by the configured regular expression on the given file, an empty string is used as parameter value. | ||
| + | |||
| + | ===== "matching pattern activates functionality" configuration values ===== | ||
| + | There exist parameters which are being activated if the returned value is non-empty. So even a zero ("<code>0</code>") activates the functionality. Be sure to enable a functionality only by configuring values, ignoring their interpretation. | ||
| + | |||
| + | ===== Configuration values ===== | ||
| + | These fixed parameters are available which are then passed to the configured executable below: | ||
| + | |||
| + | ====== Partner selection ====== | ||
| + | This parameter defines normally a partner shortname. Used by the enqueueing script. | ||
| + | |||
| + | ====== Virtual filename selection ====== | ||
| + | Since the file has a separate name on the filesystem and during transport (and lateron at partner's receive side), you have to define a virtual filename. | ||
| + | |||
| + | ====== Comment selection ====== | ||
| + | This comment will be put into the comment field of the enqueued file when using the standard enqueueing process. | ||
| + | |||
| + | ====== Originator SFID / Destination SFID ====== | ||
| + | For a separate sender's and receiver's SFID extraction, this value defines with which SFID the file will be sent. Leave empty if you want to use the partner's default configuration. | ||
| + | |||
| + | ====== Passive switch selection ====== | ||
| + | If the found file should be enqueued passively, the value of this configuration parameter should be not-empty. ([[Seon_Core_binaries#input_parameters|see "<code>seoneq</code>", parameter "<code>-P</code>"]]). | ||
| + | |||
| + | ====== Binary transfer mode selection ====== | ||
| + | If the default transfer mode of "binary" should be used instead of fixed or variable record length, this parameter activates this functionality if an non-empty value is returned. | ||
| + | |||
| + | ====== Fixed record length transfer mode selection ====== | ||
| + | If the default transfer mode should be overridden and "fixed record length" files should be transfered, a non-empty return value activates this functionality. | ||
| + | |||
| + | ====== Variable record length transfer mode selection ====== | ||
| + | If the default transfer mode should be overridden and "variable record length" files should be transfered, a non-empty return value activates this functionality. | ||
| + | |||
| + | ====== record length selection ====== | ||
| + | If a non-binary transfer mode is used for the found file, you have to define which record length is being used (max.: 2048). This value is ignored in binary transfer mode. | ||
| + | |||
| + | ====== Execution ====== | ||
| + | As stated before, a found file matching the regular expression pattern has got a number of configuration values. These parameters are passed to an executable, which has the task to handle these input parameters. You can insert ''any'' executable you want, you may want to script your own ones or use a preset included in the standard installation. | ||
| + | |||
| + | The preset is: | ||
| + | *"Seon Core enqueueing" (<code>dirscanner_seoneq.sh</code>): This script parses all parameters correctly to enqueue the found file to the Seon send queue with the given parameters. | ||
| + | |||
| + | The presets are not fix, you may insert any executable you want. | ||
| + | |||
| + | ==== Seon Enterprise send job ==== | ||
| + | If you want to create Seon send jobs automatically via the directory scanner, you can switch to "Seon Enterprise send job" mode. | ||
| + | |||
| + | For every single send job created by the directory scanner, a new directory will be created in the [[Seon_Core_configuration#data_outgoing_directory|configured outgoing directory]] with a [[Seon_Core_configuration#Path_for_jobs_of_directory_scanner|configured name prefix]], appended by the dynamic job number. | ||
| + | |||
| + | ===== Scan for ===== | ||
| + | You have the possibility to create Seon send jobs from single files (which match your regular expression configured above) or directories (which are scanned for files within and in subdirectories). Every single file will be moved into the created outgoing send job directory, the original directory will be removed. | ||
| + | |||
| + | ===== Sender selection ===== | ||
| + | The sender of the job will be defined here. Only jobs with a valid (non-deleted) sender are created. If the sender is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner. | ||
| + | |||
| + | ===== Recipient selection ===== | ||
| + | The recipient of the job will be defined here. Only jobs with a valid (non-deleted) recipient are created. If the recipient is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner. | ||
| + | |||
| + | Remember that the plugin group for send jobs of the recipient of the job will be executed, which can be configured at user, department, location or company level. | ||
| + | |||
| + | ===== Job comment ===== | ||
| + | An optional job comment can be added to the send job. | ||
| + | |||
| + | ==== Seon Enterprise receive job ==== | ||
| + | If you want to create Seon receive jobs automatically via the directory scanner, you can switch to "Seon Enterprise receive job" mode. | ||
| + | |||
| + | For every single receive job created by the directory scanner, a new directory will be created in the [[Seon_Core_configuration#data_outgoing_directory|configured outgoing directory]] with a [[Seon_Core_configuration#Path_for_jobs_of_directory_scanner|configured name prefix]], appended by the dynamic job number. | ||
| + | |||
| + | ===== Scan for ===== | ||
| + | You have the possibility to create Seon receive jobs from single files (which match your regular expression configured above) or directories (which are scanned for files within and in subdirectories). Every single file will be moved into the created outgoing receive job directory, the original directory will be removed. | ||
| + | |||
| + | ===== Sender selection ===== | ||
| + | The sender of the job will be defined here. Only jobs with a valid (non-deleted) sender are created. If the sender is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner. | ||
| + | |||
| + | ===== Recipient selection ===== | ||
| + | The recipient of the job will be defined here. Only jobs with a valid (non-deleted) recipient are created. If the recipient is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner. | ||
| + | |||
| + | Remember that the plugin group for receive jobs of the recipient of the job will be executed, which can be configured at user, department, location or company level. | ||
| + | |||
| + | ===== Job comment ===== | ||
| + | An optional job comment can be added to the send job. | ||
| === Sorting order === | === Sorting order === | ||
| + | The Seon Directory Scanner functionality scans directories in the order of the configuration, so you have to keep in mind that top-level entries will be scanned first. You may want to configure the same directory with different regular expressions for file name mathing in order to minimize the amount of scanned directories but increase the complexity of file names. Since regular expressions may match in both cases, you can keep your entries in order by clicking the icons on the right-hand of the directory list (up and down). | ||
| + | |||
| + | [[Image:Dirscanner movedown.png]] | ||
| + | |||
| === Enable / disable entry === | === Enable / disable entry === | ||
| + | In order to test entries (or if anything is not configured correctly Seon disables entries, too), you have the possibility to en- and disable directory scanner entries. Disabled entries are not used by the directory scanner, they are displayed as a grey line. To disable, click on the third icon on the left hand entitled as "<code>deactivate directory entry '...'</code>": | ||
| + | |||
| + | [[Image:Dirscanner disable.png]] | ||
| + | |||
| + | To enable an entry, click on the icon "<code>activate directory entry '...'</code>": | ||
| + | |||
| + | [[Image:Dirscanner enable.png]] | ||
| + | |||
| === Preview / Dry-run === | === Preview / Dry-run === | ||
| + | A preview shows you via web interface what would happen if the directory scanner whould start to work on the selected entry. Click on the icon "<code>dry-run directory entry '...' for verification</code>" to start the process: | ||
| + | |||
| + | [[Image:Dirscanner start dryrun.png]] | ||
| + | |||
| + | The new opening windows shows you what would happen: | ||
| + | |||
| + | [[Image:Dirscanner dryrun.png]] | ||
| + | |||
| + | === Clone entry === | ||
| + | For faster configuration, a directory scanner entry can be cloned. Every configuration parameter is being copied into the cloned entry, a new name has to be given to the entry. Click on the icon "<code>clone directory entry '...'</code>" to clone an entry: | ||
| + | |||
| + | [[Image:Dirscanner clone.png]]  | ||
| + | |||
| + | Then configure a new name for the cloned entry: | ||
| + | |||
| + | [[Image:Dirscanner clone2.png]] | ||
| + | |||
| === Delete entry === | === Delete entry === | ||
| + | In order to delete a directory scanner entry, click on the trash icon entitled with "<code>delete directory entry '...'</code>": | ||
| + | |||
| + | [[Image:Dirscanner delete.png]] | ||
| + | |||
| + | Then confirm the deletion: | ||
| + | |||
| + | [[Image:Dirscanner delete2.png]] | ||
| == Logging == | == Logging == | ||
| + | Logging will be done in general for the following items: | ||
| + | *a regular expression is not valid | ||
| + | *a directory is not accessable | ||
| + | *a file which should be moved by the directory scanner is not movable (which includes a inner-filesystem and outer-filesystem file movement) | ||
| + | |||
| + | In addition, logging is [[Seon_Core_configuration#Enable_directory_scanner_logging.3F|globally configurable]] for every found file and execution of the command via a configuration parameter ("Configuration" -> "Logging" -> "[[Seon_Core_configuration#Enable_directory_scanner_logging.3F|Enable directory scanner logging?]]"). If this configuration parameter is enabled, every single file which has been found by the directory scanner and which succeeds the configured regular expression will be logged, including the time and date, the script, all parameters, returncode of the script and the its output. Log vault functionality is given here, too. | ||
| + | |||
| + | == often used regular expressions == | ||
| + | *Everything (any file): | ||
| + |  .* | ||
| == external links == | == external links == | ||
| + | Since Regular Expressions are not everybody's best friend, some handy tools are available online for testing and verifying regular expressions. Some are listed here, but they may be offline from time to time. Use your favorite search engine to look for tools helping with regular expressions. | ||
| + | |||
| + | *[https://regex101.com https://regex101.com] | ||
| + | *[http://regexpal.com/ http://regexpal.com/] | ||
| + | *[http://www.fileformat.info/tool/regex.htm http://www.fileformat.info/tool/regex.htm] | ||
| + | *RegExecution iOS App: [http://itunes.apple.com/de/app/regexecution/id380985639?mt=8 http://itunes.apple.com/de/app/regexecution/id380985639?mt=8] | ||
Latest revision as of 18:46, 29 August 2015
Contents
- 1 What is the Seon Directory Scanner?
- 2 Configuration of scanning tasks
- 2.1 Menu entry
- 2.2 Name
- 2.3 Directory
- 2.4 Regular expression
- 2.5 Age of entry
- 2.6 Type selection
- 2.6.1 Seon Core
- 2.6.1.1 Configuration values types
- 2.6.1.2 fix values
- 2.6.1.3 variable values
- 2.6.1.4 "matching pattern activates functionality" configuration values
- 2.6.1.5 Configuration values
- 2.6.1.5.1 Partner selection
- 2.6.1.5.2 Virtual filename selection
- 2.6.1.5.3 Comment selection
- 2.6.1.5.4 Originator SFID / Destination SFID
- 2.6.1.5.5 Passive switch selection
- 2.6.1.5.6 Binary transfer mode selection
- 2.6.1.5.7 Fixed record length transfer mode selection
- 2.6.1.5.8 Variable record length transfer mode selection
- 2.6.1.5.9 record length selection
- 2.6.1.5.10 Execution
 
 
- 2.6.2 Seon Enterprise send job
- 2.6.3 Seon Enterprise receive job
 
- 2.6.1 Seon Core
- 2.7 Sorting order
- 2.8 Enable / disable entry
- 2.9 Preview / Dry-run
- 2.10 Clone entry
- 2.11 Delete entry
 
- 3 Logging
- 4 often used regular expressions
- 5 external links
What is the Seon Directory Scanner?
The goal of the directory scanner is to scan configured directories (without recursion) for new files (older than 60 seconds) and apply a matching pattern on them. If the pattern matches, the file will be moved to the configured outgoing directory and an executable will be started with parameters defined for this directory scanner entry, based on either fix values or dynamic ones.
The Seon Directory Scanner is available since Seon 3 in Seon 3 Core.
Configuration of scanning tasks
Using the directory scanner needs some configuration via web interface and optionally in addition on the filesystem (if you really want to modify the behaviour more deeply).
Menu entry
The menu entry "Dir.scanner" in the administrative web interface exists if the binary
seon_ds_dryrun
exists in the installation directory for binaries of Seon.
Clicking on that links shows you the actually configured directory scanner entries, with an empty view in the default installation.
You can click on "New" or the empty paper icon to create a new entry. In order to edit an entry, click on the edit icon.
The following screenshot shows the edit page of an existing directory scanner entry:
Name
The name of the directory scanner entry can be a human-interpretable textual string which will only occur in the logs.
Directory
The directory on which the directory scanner works on. Remember that only that directory without subdirectories will be scanned. The configured outgoing directory cannot be configured since the files will be moved into that directory before executing the command for a found file.
Regular expression
The file name found in the configured directory must match this regular expression. Regular expressions are quite complex but very powerful. The name of the found file must result into a true value (which means that any output of the regular expression is valid but not the empty). The engine compiling these regular expression values is always PCRE, which implements Perl-style regular expressions, which are widely used across different systems.
If the regular expression is not correct, the directory scanner will identify this situation, add a log entry to the system log and disable this configured directory scanner configuration.
Age of entry
With the given age, only entities (files or directories; depending on your configuration) are taken which are older than this amount of seconds. The minimum age of entries must be at least the value of the send queue daemon timeslice value.
Type selection
If you configured and licensed to use Seon Enterprise, you have the option to
- Scan for files, handled for Seon Core enqueueing
- Scan for directories or files to be used for Seon Enterprise job creation
Depending on your choice, you're getting a different configuration view:
Seon Core
Configuration values types
A found file matching the configured regular expression leads to a number of paraeters which are then passed to the executable for later using them. There are two types of configuration values you can use for every single configuration parameter:
fix values
The easiest way to use a configuration value is to pre-set it with a fix value. This is mostly a good decision if i.e. the directory is partner-based and the configuration of the communication partner is fix (due to its nature of residence in that configured directory).
variable values
Another way to extract a configuration value is based on the found file. The found filename (without path) will be passed to the configured regular expression, where the first variable definition, which are normally enclosed by round brackets: '(' and ')'. Subsequent variable extractions will be ignored. If no variable value is extractable by the configured regular expression on the given file, an empty string is used as parameter value.
"matching pattern activates functionality" configuration values
There exist parameters which are being activated if the returned value is non-empty. So even a zero ("0") activates the functionality. Be sure to enable a functionality only by configuring values, ignoring their interpretation.
Configuration values
These fixed parameters are available which are then passed to the configured executable below:
Partner selection
This parameter defines normally a partner shortname. Used by the enqueueing script.
Virtual filename selection
Since the file has a separate name on the filesystem and during transport (and lateron at partner's receive side), you have to define a virtual filename.
Comment selection
This comment will be put into the comment field of the enqueued file when using the standard enqueueing process.
Originator SFID / Destination SFID
For a separate sender's and receiver's SFID extraction, this value defines with which SFID the file will be sent. Leave empty if you want to use the partner's default configuration.
Passive switch selection
If the found file should be enqueued passively, the value of this configuration parameter should be not-empty. (see "seoneq", parameter "-P").
Binary transfer mode selection
If the default transfer mode of "binary" should be used instead of fixed or variable record length, this parameter activates this functionality if an non-empty value is returned.
Fixed record length transfer mode selection
If the default transfer mode should be overridden and "fixed record length" files should be transfered, a non-empty return value activates this functionality.
Variable record length transfer mode selection
If the default transfer mode should be overridden and "variable record length" files should be transfered, a non-empty return value activates this functionality.
record length selection
If a non-binary transfer mode is used for the found file, you have to define which record length is being used (max.: 2048). This value is ignored in binary transfer mode.
Execution
As stated before, a found file matching the regular expression pattern has got a number of configuration values. These parameters are passed to an executable, which has the task to handle these input parameters. You can insert any executable you want, you may want to script your own ones or use a preset included in the standard installation.
The preset is:
- "Seon Core enqueueing" (dirscanner_seoneq.sh): This script parses all parameters correctly to enqueue the found file to the Seon send queue with the given parameters.
The presets are not fix, you may insert any executable you want.
Seon Enterprise send job
If you want to create Seon send jobs automatically via the directory scanner, you can switch to "Seon Enterprise send job" mode.
For every single send job created by the directory scanner, a new directory will be created in the configured outgoing directory with a configured name prefix, appended by the dynamic job number.
Scan for
You have the possibility to create Seon send jobs from single files (which match your regular expression configured above) or directories (which are scanned for files within and in subdirectories). Every single file will be moved into the created outgoing send job directory, the original directory will be removed.
Sender selection
The sender of the job will be defined here. Only jobs with a valid (non-deleted) sender are created. If the sender is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner.
Recipient selection
The recipient of the job will be defined here. Only jobs with a valid (non-deleted) recipient are created. If the recipient is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner.
Remember that the plugin group for send jobs of the recipient of the job will be executed, which can be configured at user, department, location or company level.
Job comment
An optional job comment can be added to the send job.
Seon Enterprise receive job
If you want to create Seon receive jobs automatically via the directory scanner, you can switch to "Seon Enterprise receive job" mode.
For every single receive job created by the directory scanner, a new directory will be created in the configured outgoing directory with a configured name prefix, appended by the dynamic job number.
Scan for
You have the possibility to create Seon receive jobs from single files (which match your regular expression configured above) or directories (which are scanned for files within and in subdirectories). Every single file will be moved into the created outgoing receive job directory, the original directory will be removed.
Sender selection
The sender of the job will be defined here. Only jobs with a valid (non-deleted) sender are created. If the sender is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner.
Recipient selection
The recipient of the job will be defined here. Only jobs with a valid (non-deleted) recipient are created. If the recipient is not valid, the directory scanner entry will be deactivated dynamically by the directory scanner.
Remember that the plugin group for receive jobs of the recipient of the job will be executed, which can be configured at user, department, location or company level.
Job comment
An optional job comment can be added to the send job.
Sorting order
The Seon Directory Scanner functionality scans directories in the order of the configuration, so you have to keep in mind that top-level entries will be scanned first. You may want to configure the same directory with different regular expressions for file name mathing in order to minimize the amount of scanned directories but increase the complexity of file names. Since regular expressions may match in both cases, you can keep your entries in order by clicking the icons on the right-hand of the directory list (up and down).
Enable / disable entry
In order to test entries (or if anything is not configured correctly Seon disables entries, too), you have the possibility to en- and disable directory scanner entries. Disabled entries are not used by the directory scanner, they are displayed as a grey line. To disable, click on the third icon on the left hand entitled as "deactivate directory entry '...'":
To enable an entry, click on the icon "activate directory entry '...'":
Preview / Dry-run
A preview shows you via web interface what would happen if the directory scanner whould start to work on the selected entry. Click on the icon "dry-run directory entry '...' for verification" to start the process:
The new opening windows shows you what would happen:
Clone entry
For faster configuration, a directory scanner entry can be cloned. Every configuration parameter is being copied into the cloned entry, a new name has to be given to the entry. Click on the icon "clone directory entry '...'" to clone an entry:
Then configure a new name for the cloned entry:
Delete entry
In order to delete a directory scanner entry, click on the trash icon entitled with "delete directory entry '...'":
Then confirm the deletion:
Logging
Logging will be done in general for the following items:
- a regular expression is not valid
- a directory is not accessable
- a file which should be moved by the directory scanner is not movable (which includes a inner-filesystem and outer-filesystem file movement)
In addition, logging is globally configurable for every found file and execution of the command via a configuration parameter ("Configuration" -> "Logging" -> "Enable directory scanner logging?"). If this configuration parameter is enabled, every single file which has been found by the directory scanner and which succeeds the configured regular expression will be logged, including the time and date, the script, all parameters, returncode of the script and the its output. Log vault functionality is given here, too.
often used regular expressions
- Everything (any file):
.*
external links
Since Regular Expressions are not everybody's best friend, some handy tools are available online for testing and verifying regular expressions. Some are listed here, but they may be offline from time to time. Use your favorite search engine to look for tools helping with regular expressions.













