Transfer data in and out from Hadoop by using single PL SQL language

hi,

any oracle product that help to transfer data in and out from Hadoop by using single PL SQL language ?

and also can do parallel data processing for that feature ?
LVL 1
marrowyungSenior Technical architecture (Data)Asked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

SujithData ArchitectCommented:
Sqoop is the Apache product to transfer data between Hadoop and Relational databases. It uses JDBC to connect to the database. Sqoop can support import(Into Hadoop) and export(from Hadoop).

Here is the Hadoop documentation:
http://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html

If you are using Cloudera here is the doc:
https://archive.cloudera.com/cdh5/cdh/5/sqoop-1.4.6-cdh5.6.1/SqoopUserGuide.html#_connecting_to_a_database_server
0
marrowyungSenior Technical architecture (Data)Author Commented:
Sqoop is free? highly integratable with oracle ? so Oracle can do PL SQL to get data from Hadoop ?
0
SujithData ArchitectCommented:
Which Hadoop distribution are you using? Sqoop is integrated with the Hadoop installation. As far as I am aware there is no additional cost involved.

Sqoop resides in your Hadoop cluster and it pulls and push data from within the cluster. As it works using JDBC; as long as you have the Oracle Driver installed in your cluster you can integrate it with Oracle.

No it doesn't work with Pl/SQL. As I mentioned before; Sqoop is a tool residing outside the Oracle database, hence it has no connection with Pl/SQL.
0
Big Business Goals? Which KPIs Will Help You

The most successful MSPs rely on metrics – known as key performance indicators (KPIs) – for making informed decisions that help their businesses thrive, rather than just survive. This eBook provides an overview of the most important KPIs used by top MSPs.

marrowyungSenior Technical architecture (Data)Author Commented:
"Which Hadoop distribution are you using? Sqoop is integrated with the Hadoop installation. As far as I am aware there is no additional cost involved"

tks.

"Sqoop resides in your Hadoop cluster and it pulls and push data from within the cluster."

oracle RAC is the cluster you are referring to ?

"No it doesn't work with Pl/SQL. As I mentioned before; Sqoop is a tool residing outside the Oracle database, hence it has no connection with Pl/SQL."

i am new to this area! then how can we query data out without query language ?
0
SujithData ArchitectCommented:
>> oracle RAC is the cluster you are referring to ?
No I am referring to the Hadoop Cluster.

>> i am new to this area! then how can we query data out without query language ?
Your Oracle Database and the Hadoop Cluster are two different data stores. Two Physically different stores. Sqoop is a tool within Hadoop that can interface with any RDBS to read/write data. Sqoop has its own methods for reading/writing data. It supports SQL as well when you interface with an RDBMS.

(Oracle Database) <----->  [ Sqoop <-----> (Hadoop Data Store(Hive/Hbase/Impala etc) ) ]

Think of Sqoop as a tool similar to "Import Export" within the oracle world(just for sake of explanation), it takes in a number of arguments to detect the data it has to operate on.
Highly recommend you to do some reading of Hadoop components in the links above.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
SujithData ArchitectCommented:
Having talked about Sqoop, It is one of the tools to read data in/out of an Oracle database(any RDBMS). It is not the only mechanism for this purpose, there are more. For example Oracle has connectors for Hadoop. I haven't used them myself, so cant elaborate much on it. You can have a read here

http://www.oracle.com/technetwork/database/database-technologies/bdc/big-data-connectors/overview/index.html
0
marrowyungSenior Technical architecture (Data)Author Commented:
tks.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
SQL

From novice to tech pro — start learning today.