Configuring Delimited Text File Formats

When you define a new data store that connects to a text file with delimited fields, the New Data Store wizard will prompt you to supply the information necessary to parse the file correctly.

Data in a delimited text file is arranged in rows and columns, with one entry per row.  Each column is separated from the next by a field separator character.  The following file snippet illustrates characteristics common to many delimited files.  It contains information about cars and their owners, with each field separated from the next by a comma:

Notice that:

 

In order to parse the data in a delimited text file correctly, OEDQ needs to be informed of the special characters used in that file.  This is done in the New Data Store wizard, and can be edited as part of the data store settings later, if required:

As well as the name and location of the file, this wizard requires you to specify the field separator, used to separate data in one column from data in the next, and the quote character, used to enclose data in any field which contains the separator character.

The wizard also offers the option of discarding any non-data rows at the beginning of the file, and specifying that the first data row should be treated as a header row, containing column names. You can also specify how many columns are present in the data. If no column number is specified, the number of columns will be assumed to be the same as the number of fields found in the header row. If neither the number of columns nor the header row is specified, a default of 20 columns will be assumed.

Configuring a directory of delimited text files

If you have a set of delimited text files which share the same file layout and special characters, you can configure the whole directory as a data store. To do this, the files should have the following attributes in common:

In addition, if you are specifying the number of columns in the file (rather than deriving the number from a header row), the files should contain the same number of columns.

Oracle ® Enterprise Data Quality Help version 9.0
Copyright © 2006,2011 Oracle and/or its affiliates. All rights reserved.