Creating a Table:-
Managed table:-
The managed tables are sometimes called as internal tables,
because Hive controls the lifecycle of their data (more or less). Hive stores
the data for these tables in a subdirectory under the directory defined by hive.metastore.warehouse.dir
(e.g., /user/hive/warehouse), by default. When we drop a managed table, Hive
deletes the entire data in the table. However, managed tables are less
convenient for sharing with other tools.
Query- create
table if not exists tab11 (id int, name string, sal bigint, sub
array<string>, dud map<string, int>, addr
struct<city:string,state:string,pin:bigint>) row format delimited fields
terminated by ',' collection items terminated by '$' map keys terminated by
'#';
External table:-
The
EXTERNAL keyword
tells Hive the table is external and the LOCATION clause is required
to tell Hive where the table is located. Because it’s external, Hive does not
assume it owns the data. Therefore, dropping the table does not delete
the data, although the metadata for the table will be deleted.
Query- create
external table if not exists tab7 (id int, name string, sal bigint, sub
array<string>, city string) row format delimited fields terminated by ',' collection items
terminated by '$' location ‘/home/training/Desktop’;
really Good blog post.provided a helpful information.I hope that you will post more updates like this Big data hadoop online training
ReplyDeleteSure Teju
Delete