Oracle by Example branding Downloading Citi Bike Data and Storing into Object Store

section 0Before You Begin

In this 10-minute tutorial, you will use notebook to download the Citi Bike data and to store it into Object Store.

This is the first tutorial in the Storing and Analyzing Data with Big Data Cloud series. Perform the tutorials sequentially.

Background

This tutorial uses bike ride data available from the New York City bike share program known as Citi Bike NYC. Citi Bike consists of a fleet of bikes and a network of docking stations. Bikes can be unlocked from one station and returned to any other.

What Do You Need?

  • A running BDC cluster.
  • BDC account credentials or Big Data Cloud Console direct URL (for example: https://xxx.xxx.xxx.xxx:1080/).
  • The URL to the CSV file that contains the Citi Bikes dataset.
  •  code_snippet-1.txt

section 1Navigate to Big Data Cloud Console - Notebook Page

  1. Login to your BDC account.
    Note:
    If you have the direct URL to access the Big Data Cloud Console, you can navigate to the link directly and continue from step 3.
  2. In the Services page, click the Manage this Service  This is manage this service icon. icon of the cluster and then click Big Data Cloud Console.
    bdcsce services page
    Description of the illustration a2.jpg
  3. A window titled Authentication Required appears. Enter your BDC cluster user name and password and click OK.
    authentication window
    Description of the illustration a3.jpg
  4. In the Big Data Cloud Console, click Notebook.
    bdcsce console  page
    Description of the illustration a4.jpg

section 2Create a Note

  1. In the Big Data Cloud Console Notebook page, click New Note.
    Enter a Note Name and click OK. In this example, Citi Bike Trip is entered for Note Name.
    New Note page
    Description of the illustration b1.jpg
  2. The Note opens after it is created successfully. The Note currently contains an empty paragraph.
    Citi Bike Trip note
    Description of the illustration b2.jpg

section 2Create and Run Paragraph

Perform these steps to create a paragraph.

  1. Copy code-snippet-1.txt and paste it in the empty paragraph.
  2. Click the Settings icon This is Settings icon. and choose Show title.
    Settings context menu
    Description of the illustration c2.jpg
  3. Click the paragraph title and set it to Download Data and Copy into Object Storage.
    new paragraph
    Description of the illustration c3.jpg
  4. Note: This scripts shows how to move files into the Object Storage. You store the original data file into a directory within the container called “citibike/raw”. Then, you create a modified version of the data file (that has the header row removed) and store that into a directory called “citibike/modified”.
  5. Run the paragraph by clicking the Run this paragraph icon This is Run this paragraph icon..
    Run the first paragraph
    Description of the illustration c4.jpg
    paragraph output
    Description of the illustration c4_a.jpg

next stepNext Tutorial

Working with Hive


more informationWant to Learn More?