A data lake is a repository for structured, semistructured, and unstructured data in any format and size and at any scale that can be analyzed easily. With Oracle Cloud Infrastructure (OCI), you can build a secure, cost-effective, and easy-to-manage data lake. A data lake on OCI is tightly integrated with your preferred data warehouses and analytics as well as with other OCI services, such as data catalog, security, and observability services.
Move your data in batches or streams seamlessly to an OCI data lake where it can be analyzed. Leverage OCI Data Integration, OCI GoldenGate, or OCI Streaming to ingest your data and store it in OCI Object Storage.
A central data lake on OCI integrates with your preferred tools, including databases such as Oracle Autonomous Data Warehouse, MySQL HeatWave, analytics and machine learning (ML) tools such as Oracle Analytics Cloud, and open source projects such as Apache Spark.
A comprehensive set of AI and ML services lets you gain new insights from your data, make predictions, lower your operational overhead, and improve customer experience.
Catalog your data and gather insights about your data lake with OCI Data Catalog. Enable query tools and databases to discover and query your data in the object store.
Oracle Cloud Infrastructure is launching a fully managed data lake service called OCI Data Lake this year. You can sign up for early access to explore its features and capabilities before it's released to the public.
A data lake makes it possible to work with more kinds of data, but the time and effort needed to manage it can be disadvantageous. By offering fully managed open source data lake services, OCI provides both lower costs and less management, so you can expect reduced operational costs, improved scalability and security, and the ability to incorporate all of your current data in one place.
Data warehouses and data marts are crucial to successful businesses. Integrating them with a data lake will increase their value even more. Integration among databases, data warehouses, and a data lake with Oracle means that data can be accessed from multiple locations with a single SQL query. Current applications and tools get transparent access to all data, with no changes and no need to learn new skills.
Data generated by enterprise applications is highly valuable, but it’s rarely fully utilized. A data lake on OCI simplifies access to data from multiple applications and enables sophisticated analysis that can mean the difference between a good quarter or a bad quarter.
Centralize your data with an embedded OCI Data Integration experience.
Query any data from any source without replication.
Preintegrated applications for instantaneous time to value.
Catalog and govern with an embedded OCI Data Catalog experience.
Secure data with fine-grained, role-based access control policies.
Oracle Autonomous Database supports integration with data lakes—not just on Oracle Cloud Infrastructure, but also on Amazon Web Services (AWS), Microsoft Azure, Google Cloud, and more. You have the option of loading data into the database or querying the data directly in the source object store. Both approaches use the same tools and APIs to access the data.
This architecture is sometimes referred to as a lakehouse architecture.
One MySQL cloud database service for transactions, real-time analytics across data warehouses and data lakes, and machine learning—without the complexity, latency, risks, and cost of ETL duplication.
Quickly create Hadoop-based or Spark-based data lakes to extend your data warehouses and ensure all data is both easily accessible and managed cost-effectively.
Connect and extend analytical applications with real-time consistent transactional data, efficient batch loads, and streaming data.
Build a data lake using fully managed data services with lower costs and less effort.
Leverage OCI integration of your data lakes with your preferred data warehouses and uncover new insights.
Gain insights from data with prebuilt AI models, or create your own.
Oracle partner solutions leverage and augment data lakehouses on OCI.
Oracle offers a Free Tier with no time limits on a selection of services, including Autonomous Data Warehouse, OCI Compute, and Oracle Storage products, as well as US$300 in free credits to try additional cloud services. Get the details and sign up for your free account today.
The best way to learn is to try it yourself. Try this free data lake workshop, which demonstrates a typical usage scenario and highlights some of the tools you can use to build a data lake.
The labs in this workshop walk you through the steps you need to access a data lake created with Oracle Object Storage buckets by using Oracle Autonomous Database and OCI Data Catalog.
Start data lake access labLearn how to create and monitor a highly available Hadoop cluster using Big Data Service and OCI. You’ll also add Oracle Cloud SQL to the cluster and access the utility and master node, and learn how to use Cloudera Manager and Hue to access the cluster directly in a web browser.
Start the data lake labUse analytics and machine learning to analyze 70 years of racing data. Find out what makes some races so exciting you can’t look away while others are more predictable.
Start the data analytics labDiscover how to use OCI Anomaly Detection to create customized machine learning models. You’ll take data uploaded by users, use a specialized algorithm to train a model, and deploy the model into the cloud environment to detect anomalies.
Start the anomaly detection lab nowInterested in learning more about a data lake? Let one of our experts help.