Merge Data Streams |
The Merge Data Streams processor allows you to merge a number of input data streams into a single stream, by mapping each input data stream to a target structure.
Merge Data Streams does not perform any transformation, matching, or merging of records. All input records are output, mapped to the target structure.
Use Merge Data Streams where you have a number of sources of data that all represent the same type of entity, and where all the sources have similar attribute structures that can be easily mapped to a target structure. Once the data streams have been merged, you can define your processing to act on all the records from all the sources.
Any attributes from the data streams you want to merge.
The Merge Data Streams configuration screen is designed to allow you to map any number of data streams through to a target, working with each input data stream in turn. Use the following instructions to map each of your input data streams to a target output data stream:
. Mappings will be created for all the selected attributes which have the same name and type as an existing output attribute. The name matching is not case sensitive.As Merge Data Streams outputs a completely new data stream from the streams input to it, it is not possible to connect processors before Merge Data Streams directly to processors after Merge Data Streams. For example, in the below screenshot, the connection the user is attempting to draw to the Writer is not possible:
As a new data stream is output (but not necessarily completely written out), it is also not possible to link back to the snapshot or staged data used in a reader when drilling down to see results. This means that when drilling down on the results of processors downstream of a Merge Data Streams processor, you will only be able to see the attributes that were actively processed, rather than all attributes in the data set.
The data attributes output by Merge Data Streams are user-defined using the configuration screen.
None
|
Execution Mode |
Supported |
|
Batch |
Yes |
|
Real time Monitoring |
Yes |
|
Real time Response |
Yes |
The Merge Data Streams processor takes all of the input records and outputs a completely new data stream. This means that when running a process that contains a Merge Data Streams processor, you may see a higher record count in the progress bar than you expect. This is because OEDQ counts all of the input records separately from the output records (in the new data stream). The same is true when running Match processors, as these also output new data streams.
Merge Data Streams presents a view of the target data set only. The input data streams are not shown.
Merge Data Streams outputs a single Merged output filter, with all input records mapped to the target structure.
In this example, a number of sources of records representing business contacts are merged into a single data stream.
Records from Source A
Records from Source B
Merge Data Streams Configuration
Source A Mappings:
Source B Mappings:
Output Data Stream
Oracle ® Enterprise Data Quality Help version 9.0
Copyright ©
2006,2012 Oracle and/or its affiliates. All rights reserved.