Oracle Magazine Issue Archive
2008
January 2008
FEATUREInformation Comes HomeBy David BaumOracle Database 11g manages all your enterprise data. Every business depends on the ability to capture, analyze, and share information efficiently. The need to establish unified, enterprisewide policies for managing this information is leading IT departments to Oracle Database 11g, which can manage all types of datafrom traditional business information to office documents, XML, and both 2-D and 3-D spatial information.
These capabilities are making Oracle Database 11g a popular choice to power transaction processing, data warehousing, and content management applications. "Our objective with Oracle Database 11g is to provide a single platform to store, manage, back up, and secure all your data," says Vishu Krishnamurthy, Oracle's senior director of SQL, text, and XML development. "We're delivering a single repository for both data and metadata, much of which is currently stored in file systems and content management systems and on unmanaged PCs." An Evolving ArchitectureIn addition to improvements in performance, scalability, availability, security, and management efficiency, Oracle Database 11g includes many new features that support all your enterprise data. For example, Oracle Database 11g's Spatial option provides native semantic technology, which identifies and creates connections between disparate pieces of datasuch as a company's customers, orders, and productscreating new, actionable insights. These capabilities are of particular importance to Metatomix, a leading provider of semantic solutions in the government, financial services, manufacturing, and life sciences sectors. "I'm not just looking for solutions to relational data storage but for a reliable repository for all types of information," says Colin Britton, chief technology officer at Metatomix. "With an Oracle database, we can manage our data, our configuration information, and all of our code in a single repository. We get all the benefits of backup, management, failover, clustering, and so forth, without supporting different strategies for different parts of the application." Metatomix also favors Oracle Database 11g for its native Resource Description Framework (RDF) storage capabilities. "Oracle supplies a semantic layer within the database, and Oracle is the only database vendor that offers native support for RDF datatypes and OWL [Web ontology language] ontologies," adds Britton. "This makes it a good technical option for us and our customers." For example, a U.K.-based financial services organization hired Metatomix to speed up data-intensive tasks related to the processing of mutual fund trades. The bank needed a business intelligence application that could understand the meaning of the information it was gathering, which led the company to semantic technology. Britton says that the Oracle-based solution helps this financial services organization verify fund prices and identify data anomalies. "Financial services firms have massive amounts of data, governed by complex business processes and analytics," he says. "Semantic technology offers insight whenever exceptions occur, and Oracle has the right database platform for maintaining the information." Reducing Performance OverheadOrganizations see the wisdom in consolidating information into a professionally managed database; however, there have been concerns about the performance overhead required to store and retrieve large objects (LOBs). To address the issue of LOB performance, Oracle Database 11g includes Oracle SecureFiles, a new and improved storage infrastructure for LOB datatypes. Oracle partner TUSC has been working with a large government client testing Oracle SecureFiles. Based in Chicago, TUSC is a consulting firm that helps companies optimize their investments in Oracle technology and applications. "Oracle SecureFiles enables the customer to manage unstructured data in the same way as structured data," says Richard J. Niemiec, CEO of TUSC and former president of the International Oracle User Group. "It gives the same security, backup, recovery, encryption, flashback technology, locking, clustering technology, partitioning. . . . All these mature database features can now be brought to bear on all types of data." While some companies have had reservations about putting unstructured information into a relational database for performance reasons, Niemiec believes that the new functionality of Oracle Database 11g with Oracle SecureFiles can resolve these concerns. "Oracle SecureFiles uses entirely different technology for locking, space management, and disk storage than former versions of the database," he says. "Oracle SecureFiles is not only much faster than traditional LOBs, it's faster than a file system. This is one of the most compelling reasons to move to Oracle SecureFiles and Oracle Database 11g." During their testing, TUSC found out that Oracle SecureFiles was 22 times faster than LOBs when reusing deleted space. Similarly, TUSC saw PL/SQL reads as 6 times faster, and SQL*Loader operation as nearly 3 times faster. "All of the performance issues that might have prevented people from storing unstructured content in the database have been addressed," Niemiec says. A Remodeled Storehouse for XMLAs a markup language capable of describing all kinds of informationincluding structured, unstructured, and semi-structured dataXML appeals to many types of organizations. Oracle was the first commercial relational database vendor to offer native XML capabilities within a relational database. "Oracle made XML a fundamental datatype with Oracle9i Database Release 2," says Oracle's Krishnamurthy. Oracle Database's XML capabilities caught the ear of Don Janik, senior technical manager at Warner Music Group, home to well-known record labels such as Asylum, Atlantic, and Elektra. Last year, Warner Music Group implemented the XML DB feature in Oracle Database 10g to collect, validate, and track critical sales information from its business partners. The music company has also tested the new XML capabilities in Oracle Database 11g. Warner Music Group looked at various solutions for handling XML data. They chose the Oracle solution because they like the way Oracle implements structured storage. "Based on the XML schema, it is not necessary for Oracle XML DB to store XML tag names when storing the contents of XML documents," Janik says. "This can significantly reduce the storage space required."
Oracle XML DB is commonly used for storing, retrieving, and managing massive volumes of XML data. XML documents can be stored either as character large objects for unstructured and semistructured documents such as Microsoft Word documents, or as object-relational datatypes for structured documents such as purchase orders. This functionality enables XML documents to be accessed using industry standard SQL, XML, and file/folder interfaces. Moreover, says Oracle's Krishnamurthy, Oracle Database 11g introduces a new storage representation and indexing method for XML data: Binary XML datatype and XMLIndex. Binary XML improves performance and storage efficiency. In combination with XMLIndex, it can provide up to a 15-fold performance improvement when accessing XML documents. Warner Music Group uses Oracle XML DB to expedite the loading and validation of digitally formatted sales information. In one test, Janik's team processed 50,000 sales transactions in 15 minutes. This is partly due to the unique architecture of Oracle Database. When an XML schema is registered, Oracle XML DB generates a set of SQL objects that correspond to complex types defined in that schema. XPath expressions, sent to Oracle XML DB functions, are translated to SQL access methods that operate directly against the underlying objects. "By abstracting the storage model through the use of the XMLType datatype, and providing a set of operators that use XPath to perform operations against XML documents, Oracle XML DB lets us switch between structured and unstructured storage and to experiment with different forms of structured storage without affecting the application," Janik says. This is particularly important as the music industry prepares for compliance with Recording Industry Association of America (RIAA) standards for electronic messages. The RIAA is proposing use of the Digital Data Exchange XML-based message formats for processing payments and royalties among music service providers, artists, publishers, and composers. Because of the innovative work being done by Janik and his team, Warner Music Group will be well positioned to take advantage of these new standards. Integrating Spatial DataScientists working for the U.S. Army Corps of Engineers rely on Oracle to store all kinds of data, including geospatial and multimedia information. "Oracle Database 11g helps us pull together and process very large sets of both unstructured and traditional structured data," says Michael Smith, a physical scientist who works at the Cold Regions Research and Engineering Lab of the U.S. Army Corps of Engineer's Engineer Research and Development Center. Smith says that one application in particular that can benefit from the new features of Oracle Database 11g is the Operation and Maintenance Business Information Link Regulatory Module (ORM2), a Web-based system for issuing and tracking development projects in and around public wetlands. Anyone who wants to develop on waters of the United States is required to apply for a wetland permit. The Corps uses ORM2 to track the development and its associated impact on the landscape. According to Smith, this permit data used to be stored in database columns as x-y coordinate pairs. The Corps recently moved it to Oracle Spatial to gain point, line, and polygon storage capabilities. "Spatial storage maintains the spatial indexes, including the type of geometry, the projections, the tolerances, and the whole series of coordinate pairs," he says. "This lets us do a lot more work inside the database."
The Corps makes this data available through Web Feature Services (WFS), making it easier to enter data and share it with other government agencies. Oracle Database 11g can dispatch those Web services directly from the database, thus simplifying the infrastructure. "Being able to store the spatial indexes directly in the database, at the transactional level, enables us to edit the polygons through the WFS transactions," says Smith. "It's a standardized format, so we don't have to develop and maintain as much code." The center is also developing database applications to work with 3-D point cloud storage and indexing functions, which can now be stored directly in Oracle Database 11g. (A point cloud is a set of three-dimensional points describing the outlines or surface features of an object.) The laboratory uses this type of data structure for recording measurements that come from its light detection and ranging instruments. "Up until now, such data had to be stored in flat text files, because databases had no way of tying together the data sets so that we could run calculations against them as a single entity," says Smith. "Oracle Spatial 11g can store a point cloud as a single object. This means that we can write simple queries to do line-of-sight, data-point-intensity, or nearest-neighbor calculations without even moving the data out of the database." Moving forward, the agency plans to use Oracle Database 11g to merge business and technical data into a common repository. Like thousands of other Oracle customers, Smith and his team have learned an important lesson: To simplify management and gain meaningful insight into all your information, you need to be able to store, manage, and analyze all types of data in a cohesive way.
David Baum (david@dbaumcomm.com) is a freelance business writer based in Santa Barbara, California. |