Oracle Data Platform for Retail

Live data analysis


Apply best practices to improve analysis of your critical retail data

The ability to perform live data analysis using high-quality data is crucial for organizations in many industries, but it’s particularly important for retailers. Timely insights drawn from accurate data can help improve customer satisfaction by offering in-the-moment product recommendations and ensuring stock is in the right place at the right time; optimize merchandising, marketing, and sales efforts with real-time assessments of how well promotions are performing; lower costs and risk via more-precise inventory forecasts; and more. In short, effective live data analysis has the potential to positively impact retail operations across your organization.

To get the greatest value from live data analysis, you need to implement a single optimized approach to data lifecycle management across your most critical datasets. This approach helps you

  • Reduce data complexity and duplication.
  • Minimize the risk and costs associated with poor-quality data.
  • Create a single, consistent view of your data.
  • Deliver data in a consistent form across the organization.
  • Make self-service business intelligence (BI) for reporting and advanced data analytics available, regardless of the tools used by domain teams.
  • Build flexibility into your data landscape, keeping the cost of change in the future as low as possible.

Modernize your analytics with an optimized analytics solution

The following architecture demonstrates how Oracle Data Platform is built to provide retailers with a cohesive, comprehensive framework to manage the entire data analytics lifecycle. At its center are two critical components: the operational data store (ODS)—used to store operational data that is ingested and persisted in raw form with no transformations applied—and a data warehouse, where the data is stored in optimized form for query performance and advanced analysis.

When combined, the ODS and data warehouse create a data platform capable of more efficient and advanced analytics. The combination enables the effective application of advanced analytics and visualization tools while retaining the ability to investigate the data in its raw form to identify anomalies or insights without impacting the performance of the underlying transactional application. This approach is beneficial for retailers because it prevents contradictory and inaccurate duplication of the same source data, which, if used to inform an organization’s decisions, can cause delays, errors, and ultimately lost sales.

Let’s take a closer look at how Oracle Data Platform incorporates an ODS, data warehouse, and other key components to help retailers effectively use live data analysis.

Live data analysis diagram, description below

This image shows how Oracle Data Platform for retail can be used to support the analysis of live and historical data in optimized form. The platform includes the following five pillars:

  1. 1. Data Sources, Discovery
  2. 2. Connect, Ingest, Transform
  3. 3. Persist, Curate, Create
  4. 4. Analyze, Learn, Predict
  5. 5. Measure, Act

The Data Sources, Discovery pillar includes two categories of data.

  1. 1. Applications comprises data from ERP, SCM, CX, and WMS applications, Fusion SaaS, NetSuite, E-Business Suite, PeopleSoft, JD Edwards, Salesforce, SAP, and Workday.
  2. 2. Business records comprises sales transactions, customer data, product data, returns transactions, supplier data, inventory data, data from point-of-sale systems, and revenue and margin data.

The Connect, Ingest, Transform pillar comprises two capabilities.

  1. 1. Batch ingestion uses OCI Data Integration, Oracle Data Integrator, and DB tools. Change data capture uses OCI GoldenGate and Oracle Data Integrator.
  2. 2. Both capabilities connect unidirectionally into the operational data store capability within the Persist, Curate, Create pillar.

The Persist, Curate, Create pillar comprises three capabilities.

  1. 1. The operational data store uses Oracle Autonomous Transaction Processing.
  2. 2. The serving data store uses Oracle Autonomous Data Warehouse and database tools.
  3. 3. Governance uses OCI Data Catalog.

These capabilities are connected within the pillar. The operational data store is unidirectionally connected to the serving data store.

One capability connects into the Analyze, Learn, Predict pillar: The serving data store connects unidirectionally to the analytics and visualization capability.

The Analyze, Learn, Predict pillar comprises one capability.

  1. 1. Analytics and visualization uses Oracle Analytics Cloud, GraphStudio, and ISVs.

The Measure, Act pillar comprises a single category of consumer: dashboards and reports.

The three central pillars—Ingest, Transform; Persist, Curate, Create; and Analyze, Learn, Predict—are supported by infrastructure, network, security, and IAM.

There are two (or optionally three) main methods of injecting data into an architecture to enable retailers to better analyze their data.

  • To start our process, we need to gain visibility into up-to-date data from our business records and applications (for example, inventory levels across retail locations). To do so, we use OCI GoldenGate to enable change data capture (CDC) ingestion of near real-time data from operational databases (transactional processing). This will include all records or discrete record sets related to retail transactions, including point-of-sale and web transactions (both sales and returns), and inventory, logistics, and supply chain data. In addition to triggering data ingestion using time stamps or flag filters, data can be ingested through a CDC mechanism that detects changes as they happen. OCI GoldenGate provides a CDC mechanism that can process source changes noninvasively by processing log files of completed transactions and storing these captured changes in external trail files, independent of the database. Changes are then reliably transferred to a staging database or operational data store.
  • We can now add datasets relevant to core retail transactions, including inventory and product data, customer records, and offers and prices. These datasets often comprise large volumes of often on-premises data, and in most cases, batch ingestion is typically most efficient.

    That said, there are some things to consider when deciding how to collect transactional data from operational sources to populate operational data stores. The techniques available vary mostly in terms of the latency of data integration, ranging from scheduled daily batches to continuous real-time integration. Data is captured from sources via incremental queries that filter either based on a time stamp or flag. The techniques also vary in whether they use a pull or push operation; a pull operation pulls in new data at fixed intervals, while a push operation loads data into the target once a change appears. A daily batch ingestion is most suitable if intraday freshness isn’t required for the data—for example, data on longer-term trends or data that’s only calculated once daily, such as financial close information. Batch loads might be performed in a downtime window if the business model doesn’t require 24-hour data warehouse availability. Different techniques, such as real-time partitioning or trickle and flip exist to minimize the impact of a load to a live data warehouse when no downtime window is available.
  • Optionally, we can also use streaming ingestion to ingest data read from beacons at store locations through IoT, machine-to-machine communication, and other means. Video imaging can also be consumed this way. Additionally, in this use case, we intend to analyze and rapidly respond to consumer sentiment by analyzing social media messages, responses to first-party posts, and trending messages. Social media (application) messages/events will be ingested with the option to perform some basic transformation/aggregation before the data is stored in cloud storage. Additional stream analytics can be used to identify correlating consumer events and behavior, and identified patterns can be fed back (manually) for OCI Data Science to examine the raw data.

Data persistence and processing is built on two components.

  • The operational data store is used for operational reporting on raw data and as a source of data for an enterprise or domain-level service data store or enterprise data warehouse (EDW). It’s a complementary element to an EDW in a decision support environment. An ODS is typically a relational database designed to integrate and persist data from multiple sources to be used for additional operations, reporting, controls, and operational decision support, whereas the EDW is used for tactical and strategic decision support. Usually the ODS’s data model is very close to the OLTP source application’s data model. Any source data should be accepted by the ODS and almost no data quality rules should be implemented, ensuring you have a store representing all the data of the day from operational systems. Unlike a production master data store, the data is not passed back to the operational system. Data warehouses are typically read-only and batch updated on a specific schedule, while operational data stores are maintained in closer to real time and trickle fed constantly.
  • We have now created processed datasets ready to be persisted in optimized relational form for curation and query performance in the serving data store. In this use case, the serving data store is a data warehouse, a type of persistence platform that is designed to support business intelligence activities and increasingly advanced analytics. The main goal of a data warehouse is to consolidate and deliver accurate indicators to business users to help them make informed decisions in their day-to-day work as well as larger strategic business decisions. To do this, data warehouses are highly specialized, often contain large amounts of historical data, and are solely intended to perform queries and analysis. A data warehouse centralizes and consolidates large amounts of data from multiple sources, such as application log files and transaction applications, and then delivers it in optimal form for analysis. Its analytical capabilities allow organizations to derive valuable business insights from their data to improve decision-making. Over time, it builds a historical record that can be invaluable to data scientists and business analysts. Because of these capabilities, a data warehouse can be considered an organization’s “source of truth.” There has been a tendency to view data warehouses purely as technology assets, but they actually provide a unique environment to bring business users and IT together to develop and deliver a shared understanding of a retailer’s operating environment and to complete tasks such as
    • Defining business needs (key indicators); identifying source data that concerns key indicators; and specifying business rules to transform source information into key indicators
    • Modelling the data structure of the target warehouse to store the key indicators
    • Populating the indicators by implementing business rules
    • Measuring the overall accuracy of the data by setting up data quality rules
    • Developing reports on key indicators
    • Making key indicators and metadata available to business users through ad hoc query tools or predefined reports
    • Measuring business users’ satisfaction and adding or modifying key indicators

The ability to analyze, learn, and predict is built on two technologies.

  • Analytics and visualization services deliver descriptive analytics (describes current trends with histograms and charts), predictive analytics (predicts future events, identifies trends, and determines the probability of uncertain outcomes), and prescriptive analytics (proposes suitable actions, leading to optimal decision-making), enabling retailers to answer questions such as
    • How do actual sales this period compare to the current plan?
    • What is the retail value of inventory on hand, and how does it compare to the same period last year?
    • What are the best-selling items in a division or department?
    • How effective was the last promotion?

    Alongside the use of advanced analytics and visualizations, machine learning models can be developed, trained, and deployed.

    Governance is a critical factor to consider when building a solution such as this. Business users rely on the accuracy of key indicators from the data warehouse to make decisions. If these indicators are wrong, the decisions are also likely to be wrong. Depending on the data quality strategy you have defined, business users will likely need to actively participate in the monitoring of data discrepancies. They will have to help the IT team refine how the indicators are calculated and assist with the qualification and identification of erroneous data. This generally leads to the modification and testable improvement of the business rules.

  • Our curated, tested, and high-quality data and models can have your governance rules and policies applied and can be exposed as a data product (API) within a data mesh architecture for distribution across the retail organization. This can be critical to addressing data quality issues. Poor data quality impacts almost every retail organization. Inconsistent, inaccurate, incomplete, and out-of-date data is often the root cause of expensive business problems such as operational inefficiencies, faulty analysis, unrealized economies of scale, and dissatisfied customers. These data quality issues and the business-level problems associated with them can be solved by committing to a comprehensive data quality effort across the enterprise, exploiting the capabilities of the architecture described above.

Make better decisions with better data

Oracle Data Platform is built to ensure you have organizationwide access to consistent, high-quality data when and where you need it so you can do the following:

  • Make better-informed decisions.
  • Minimize the cost of future changes with a consistent, but flexible, data landscape.
  • Reflect process and data changes many times with fewer silos and no impact on data availability and quality.
  • Reduce the risk of errors in critical financial and regulatory reporting by eliminating siloed copies of the same data with differing transformation logic across the enterprise data landscape.
  • Provide self-serve advanced analytics and data discovery for reporting with far better data availability—access to data is no longer tied to the specific tools used by domain teams.
  • Reduce storage costs for growth of the ODS and other data stores.
  • Spend more time looking at the insight the data provides and less time identifying the discrepancies caused by multiple copies of data across disconnected silos.
  • Reduce risk by no longer having multiple copies of data, which increase the surface attack area.

Related resources

Начало работы с современной платформой данных Oracle

Более 20 бесплатных облачных служб Always Free в 30-дневной пробной версии

Oracle предлагает бесплатную пробную версию без ограничений по времени для более чем 20 сервисов, таких как Autonomous Database и Arm Compute и Storage, а также бонусы на 300 долларов США для пробного использования дополнительных облачных сервисов. Узнайте подробности и зарегистрируйтесь бесплатно уже сегодня.

  • Что предлагается в рамках Oracle Cloud Free Tier?

    • 2 автономные базы данных, объемом 20 ГБ каждая
    • Виртуальные машины AMD и Arm Compute
    • Общее блочное хранилище объемом 200 ГБ
    • Объектное хранилище на 10 ГБ
    • 10 ТБ исходящих данных в месяц
    • Более 10 бесплатных сервисов Always Free
    • Бонус в 300 долларов США сроком на месяц и даже больше

Учитесь с помощью пошаговых инструкций

Ознакомьтесь с широким спектром сервисов OCI с помощью учебных пособий и тренингов. Независимо от того, являетесь Вы разработчиком, администратором или аналитиком, мы поможем Вам понять, как работает OCI. Многие практические занятия проходят на уровне Oracle Cloud Free Tier или на бесплатной платформе для практических занятий Oracle.

  • Начало работы с базовыми сервисами OCI

    Практические занятия этого семинара охватывают введение в основные сервисы Oracle Cloud Infrastructure (OCI), включая виртуальные облачные сети (VCN), а также сервисы вычислительных ресурсов и хранения.

    Начать практическое занятие «Базовые сервисы OCI»
  • Быстрый запуск Autonomous Database

    На этом семинаре Вы ознакомитесь с пошаговыми инструкциями по началу работы с Oracle Autonomous Database.

    Начать практическое занятие Autonomous Database
  • Создание приложения из электронной таблицы

    В рамках этого практического занятия Вы загрузите электронную таблицу в таблицу базы данных Oracle, а затем создадите приложение на основе этой новой таблицы.

    Начать этот тренинг
  • Развертывание приложения HA в OCI

    На этом практическом занятии Вы развернете веб-серверы на двух вычислительных экземплярах в Oracle Cloud Infrastructure (OCI), настроенных в режиме высокой доступности с помощью балансировщика нагрузки.

    Начать практическое занятие «Приложение с высокой доступностью»

Изучите более 150 лучших практик

Посмотрите, как архитекторы и заказчики Oracle развертывают различные нагрузки: от корпоративных приложений до высокопроизводительных вычислений и от микросервисов до озер данных. Ознакомьтесь с лучшими практиками, узнайте много нового от архитекторов-заказчиков в нашей серии видео Built & Deployed, а также разверните множество нагрузок либо благодаря возможности «нажмите, чтобы развернуть», либо самостоятельно с помощью репозитория Oracle GitHub.

Популярные архитектуры

  • Apache Tomcat и сервис MySQL Database
  • Oracle Weblogic в Kubernetes и Jenkins
  • Среды машинного обучения и искусственного интеллекта
  • Tomcat on Arm и Oracle Autonomous Database
  • Анализ журналов со стеком ELK
  • HPC и OpenFOAM

Узнайте, сколько можно сэкономить благодаря возможностям OCI

Ценообразование Oracle Cloud построено на принципах простоты и постоянства с поддержкой широкого спектра сценариев использования. Чтобы оценить низкую ставку, откройте калькулятор затрат и настройте сервисы в соответствии с Вашими потребностями.

Почувствуйте разницу:

  • 1/4 исходящих затрат на пропускную способность
  • 3-кратное соотношение «цена-производительность» для вычислений
  • Одинаковая низкая цена в каждом регионе
  • Низкие цены без долгосрочных обязательств

Связаться с отделом продаж

Хотите узнать больше об Oracle Cloud Infrastructure? Позвольте одному из экспертов Oracle помочь.

  • Они могут ответить на такие вопросы, как:

    • Какие нагрузки лучше всего выполняются в OCI?
    • Как получить максимальную отдачу от инвестиций в Oracle?
    • Чем OCI отличается от облачных вычислений других поставщиков?
    • Как может OCI помочь Вам в достижении целей по IaaS и PaaS?