Manipulating Data from Oracle Object Storage to ADW with Oracle Data Integrator (ODI)
0. Introduction and Prerequisites
This article presents an overview on how to use Oracle Data Integrator in order to manipulate data from Oracle Cloud Infrastructure Object Storage. The scenarios here present loading the data from an object storage in Oracle Cloud Infrastructure and then move the data to Oracle Autonomous Data Warehouse (ADW[SR1] ).
This document could be a reference for customer have data storage in different regions and want to do the data integration& feed into Data warehouse.
Main steps are listed here:
1. Install ODI 12.2.1.3.0.
2. Patch p26669648_122130_Generic to upgrade ODI to version 12.2.1.3.1.
3. Set up Source Data Server/Physical Schema/Model in Object Storage.
4. Set up Target Data Server/Physical Schema/Model in ADW.
5. Creating a Mapping and test it.
You should have Object storage and ADW instance provisioned.
1. Install ODI 12.2.1.3.0
You can refer below link for reference.
https://docs.oracle.com/en/middleware/lifecycle/12.2.1.3/oding/installing-and-configuring-oracle-data-integrator.pdf
2. Patch p26669648_122130_Generic to upgrade to 12.2.1.3.1
You need to patch ODI to version 12.2.1.3.1 firstly.
3. Now you get ODI 12.2.1.3.1
4. New a Data Server
Let’s setup the topology. Right click Oracle Object Storage
An overview:
Let me explain the items above.
a. Region:
Oracle Object Storage region. A region is a localized geographic area, and an availability domain is one or more data centers located within a region. A region is composed of several availability domains. Most Oracle Cloud Infrastructure resources are either region-specific, such as a virtual cloud network, or availability domain-specific, such as a compute instance.
b. Tenant OCID:
Tenant’s Oracle Cloud ID. Every Oracle Cloud Infrastructure resource has an Oracle-assigned unique ID called an Oracle Cloud Identifier (OCID). It's included as part of the resource's information in both the Console and API. To find your tenancy's OCID. Go to Administration-> Tenancy Details.
c. User OCID:
Oracle Cloud ID of the user logging into Oracle Object Storage.
In the Console on the page showing the user's details. To get to that page:
- If you're signed in as the user, click the user icon present in the top-right corner of the Console, and then click User Settings.
- If you're an administrator doing this for another user, instead click Identity, click Users, and then select the user from the list.
User OCID: api.user
Security
d. Private Key File – Click the browse button to choose the location of the private key file (in PEM format)
Follow the steps to generate the private key and fingerprint
https://docs.cloud.oracle.com/iaas/Content/API/Concepts/apisigningkey.htm#How
- Passphrase – Passphrase is the password used while generating the private key
e. fingerprint
f. username:
Specify the user api.user, need to be same with item c. User OCID
Caution: Upload the public key to Object Storage.
You can upload the PEM public key in the Console, located at https://console.us-ashburn-1.oraclecloud.com. If you don't have a login and password for the Console, contact an administrator.
- Open the Console, and sign in.
- View the details for the user who will be calling the API with the key pair:
- If you're signed in as this user, click your username in the top-right corner of the Console, and then click User Settings.
- If you're an administrator doing this for another user, instead click Identity, click Users, and then select the user from the list.
- Click Add Public Key.
- Paste the contents of the PEM public key in the dialog box and click Add.
Test Connection
5. Creating an Oracle Object Storage Physical Schema
Create an Oracle Object Storage physical schema using the standard procedure, in Administering Oracle Data Integrator.
Oracle Object Storage specific parameters are:
- Name: Name of the physical schema created
- Bucket (Schema): It specifies the Oracle Object Storage Bucket name from which upload, download or the delete operation will happen. Select the required bucket from the Bucket Name drop-down list.
- Directory (Work Schema): This is the temporary folder on the local system used for getting files from Oracle Object Storage bucket during reverse engineering. If the directory does not exist it will be created. Specify the required location in the local system.
And the logical schema:
6. Creating and Reverse-Engineering an Oracle Object Storage Model
Creating an Oracle Object Storage Model
An Oracle Object Storage model is a set of data stores, corresponding to files stored in an Oracle Object Storage bucket. In a given context, the logical schema corresponds to one physical schema. You can create a model from the logical schema for the Oracle Object Storage technology. The bucket schema of this physical schema is the Oracle Object Storage bucket containing all the files. You can create new ODI Data store that will represent a file in Oracle Object Storage so that it can be used in mappings.
Input the information required and Save.
Reverse-Engineering Delimited Files from Oracle Object Storage
To perform a delimited file reverse engineering:
- In the Models accordion, right click your Object Storage Model and select New Data store. The Data Store Editor opens.
- In the Definition tab, enter the following fields:
- Name: Name of this data store
- Resource Name: Click the Search icon, to select the required file from the list of files present in Oracle Object Storage for the configured bucket.
- Go to the Storage tab, to describe the type of file. Set the fields as follows:
- File Format: Delimited
- Heading (Number of Lines): Enter the number of lines of the header. Note that if there is a header, Oracle Data Integrator uses the first line of the header to name the columns in the file.
- Select a Record Separator.
- Select or enter the character used as a Field Separator.
- Enter a Text Delimiter if your file uses one.
- Enter a Decimal Separator, if your file contains decimals.
- From the File main menu, select Save.
- In the Data Store Editor, go to the Attributes tab.
- In the editor toolbar, click Reverse Engineer.
Click Reverse Engineer, ODI will generate the Metadata based on the header of the file.
- Verify the data type and length for the reverse engineered attributes. Oracle Data Integrator infers the field data types and lengths from the file content, but may set default values (for example 50 for the strings field length) or incorrect data types in this process.
- From the File main menu, select Save.
7. Create a Connection with ADW
Create a Data Server for ADW. Specify the Credential file and choose the connection details from dropdown list.
JDBC information will be there, no need to update.
And Test the connection
And then new a Physical Schema.
New the Model and Reverse Engineer.
8. New a Project, Mapping and Test
Set the AP(Access Point) as below:
Caution: You need to run the store procedure to create credential on ADW before running the Mapping.
set define off
begin
DBMS_CLOUD.create_credential(
credential_name => 'ODI',
username => 'api.user',
password => '.};rKwO6t8***'
);
end;
/
set define on
Mapping run finished.
And Review the data loaded in ADW.
Comparing with the source csv file in Oracle Object Storage:
Looks good.
Conclusion:
With the Oracle Data Integrator 12c releases Oracle introduced several new enhancements, more source and target are supported (Oracle Object Storage, Oracle Autonomous Data Warehouse Cloud (ADW), Oracle Autonomous Transaction Processing (ATP), Oracle Enterprise Resource Planning (ERP) Cloud etc.). This document could help customer to achieve their data integration over different regions or oversea.
The ODI 12c releases continue to improve Oracle’s strategic Data Integration platform while preserving the key product differentiators: Declarative Design, Knowledge Modules, Hot-Plug-ability, and E-LT architecture.
Manipulating Data from Oracle Object Storage to ADW with Oracle Data Integrator (ODI)的更多相关文章
- Oracle Schema Objects(Schema Object Storage And Type)
One characteristic of an RDBMS is the independence of physical data storage from logical data struct ...
- [译]OpenStack Object Storage Monitoring
注:翻译的不完整,主要是有些地方翻译后反而妨碍理解,有些不知道怎么翻,anyway,需要时拿来用用也是可行的,顺便共享啦.欢迎提意见. 一个OpenStack Object Storage(OSOS) ...
- centos6.4 ceph安装部署之ceph object storage
preface: ceph-deploy does not provide a rapid installation for Ceph Object Storage install Configura ...
- [转]Build An Image Manager With NativeScript, Node.js, And The Minio Object Storage Cloud
本文转自:https://www.thepolyglotdeveloper.com/2017/04/build-image-manager-nativescript-node-js-minio-obj ...
- golang 操作ceph object storage
ceph的object storage 提供了和amazon s3兼容的接口以供客户访问. 在ceph的官网上,可以看到它提供了多种语言的访问范本,例如python的(http://docs.ceph ...
- js & h5 & app & object storage
js & h5 & app & object storage API https://developer.mozilla.org/en-US/docs/Web/API Stor ...
- Oracle客户端工具出现“Cannot access NLS data files or invalid environment specified”错误的解决办法
Oracle客户端工具出现"Cannot access NLS data files or invalid environment specified"错误的解决办法 方法一:参考 ...
- 问题-Error creating object. Please verify that the Microsoft Data Access Components 2.1(or later) have been properly installed.
问题现象:软件在启动时报如下错误信息:Exception Exception in module zhujiangguanjia.exe at 001da37f. Error creating obj ...
- openStack 对象存储object storage swift
随机推荐
- H3C端口角色的确定
- 一眼看懂promise async的区别
// promise方法 let p1 = new Promise((resolve,reject) => { setTimeout(() => { resolve('我是p1') },4 ...
- ZR979B. 【十联测 Day 9】唯一睿酱
ZR979B. [十联测 Day 9]唯一睿酱 题目大意: 给定一个数组\(r_i\),表明对于第\(i\)个数来说,他是\([max(1,i - r_i),min(n,i+r_i)]\)中最大的,求 ...
- Linux 内核存取配置空间
在驱动已探测到设备后, 它常常需要读或写 3 个地址空间: 内存, 端口, 和配置. 特别 地, 存取配置空间对驱动是至关重要的, 因为这是唯一的找到设备被映射到内存和 I/O 空间的位置的方法. 因 ...
- 【38.63%】【hdu 3047】Zjnu Stadium
Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others) Total Submission(s) ...
- 【2016常州一中夏令营Day4】
小 W 走迷宫[问题描述]小 W 被小 M 困在了一个方格矩阵迷宫里,矩阵边界在无穷远处,我们做出如下的假设:a. 每走一步时,只能从当前方格移动一格,走到某个相邻的方格上:b. 走过的格子立即塌陷无 ...
- Dubbo-本地测试直连
一.服务提供方 <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http: ...
- RecursiveTask和RecursiveAction的使用总结
一:什么是Fork/Join框架 Fork/Join框架是Java7提供了的一个用于并行执行任务的框架, 是一个把大任务分割成若干个小任务,最终汇总每个小任务结果后得到大任务结果的框架.我们再通 ...
- 王雅超的学习笔记-大数据hadoop集群部署(十)
Spark集群安装部署
- Python数据分析:手把手教你用Pandas生成可视化图表
大家都知道,Matplotlib 是众多 Python 可视化包的鼻祖,也是Python最常用的标准可视化库,其功能非常强大,同时也非常复杂,想要搞明白并非易事.但自从Python进入3.0时代以后, ...