The aggregation of data from potentially multiple systems, or even a closer look to just one system, may bring up data quality issues. Case sensitivity, typos, misunderstandings or just simply different procedures can lead to multiple almost identical entries of the same information. This session teaches you how to de-duplicate your almost identical data using Warehouse Builder.
Level:
Difficult
Coverage:
Versions 9.2 and 10g
Publication:
Online (and hands-on in the related materials)
Intended audience:
Users who are facing the issue of near duplicates in their target Business Intelligence environment.
Users who want to get started on using the advanced de-duplication features in Warehouse Builder.
Objectives
Know the Warehouse Builder terminology for advanced de-duplication.
Know the difference between match and merge, and how to define those rules in Warehouse Builder.
Be able to design an ETL process with the match - merge operator.
Pre-requisites
None.
Content
The advanced de-duplication feature was introduced in the 9.2 release. Users running version 9.0.4 cannot benefit from the advanced de-duplication.
Component
Version
Estimated Duration (h)
Delivery
Availability
Read the Data Quality Integration whitepaper.
9.2 / 10g
1:00
March 2004
Watch the Introduction to advanced de-duplication features with OWB viewlet
9.2 / 10g
0:30
March 2004
Read the second section of the Data Quality Whitepaper, as of Overview of the Match / Merge Process on page 11.
9.2 / 10g
0:45
March 2004
Review the match - merge section of the Data Quality FAQ
9.2 / 10g
0:20
March 2004
Read the section Match-Merge Operator in chapter 8 - Using Mapping Operators - of the Warehouse Builder User's Guide.
9.2 / 10g
1:00
March 2004
Go through Demo 10 - Data Quality - in the hands-on demo*
9.2 / 10g
1:00**
April 2004
Enroll in the Oracle Warehouse Builder: Cleansing Data eClass and go through the relevant section about de-duplication (match-merge) * **
9.2 / 10g
3:00
March 2004
* Note: you may want to go through the OWB self-service
education - installation session on how to setup Warehouse Builder.
** Note: excluding setup, included in the demo document. *** Note: to enroll in this class requires
access to the iLearning Online Library
may come at a cost. Legend: = recorded, = online, = paper, = hands-on