You are here: Processor Library > Transformation > Denoise

Denoise

The Denoise processor removes user-defined 'noise' characters from text attributes, and returns the denoised value in a new output attribute.

The list of noise characters can be entered as a list on–screen, or a reference list may be used, or both.

Use

Inconsistent formatting, punctuation and spurious control characters etc. can mask otherwise consistent values in data.

Use the Denoise processor to remove these 'noise' characters from text attributes, prior to other processing, such as before performing a List Check on a text attribute.

Configuration

Inputs

Any String or String Array type attributes that you wish to denoise. Number and Date attributes are not valid inputs.

Note that if you input an Array attribute, the transformation will apply to all array elements, and an Array attribute will be output.

Options

Option

Type

Purpose

Default Value

Noise characters Reference Data

Reference Data

List of noise characters

*Noise Characters

Noise characters

Free text

Additional noise characters

None

Outputs

Data attributes

Data attribute

Type

Purpose

Value

[Attribute Name].Denoise

Derived

The denoised version of the attribute value.

This may be a String or an Array, depending on the input attributes.

The original attribute value, denoised.

Flags

None

Execution

Execution Mode

Supported

Batch

Yes

Real time Monitoring

Yes

Real time Response

Yes

Results Browsing

The Denoise transformer presents no summary statistics on its processing.

In the Data view, each input attribute is shown with its new derived denoised attribute to the right.

Output Filters

None

Example

In this example the Denoise processor is used to remove all hash characters (#) from a NAME attribute:

Oracle ® Enterprise Data Quality Help version 9.0
Copyright © 2006,2011 Oracle and/or its affiliates. All rights reserved.