Lokesh

Apache Hive - Table Partition DDL

Discussion created by Lokesh on Jul 31, 2017
Latest reply on Aug 2, 2017 by maprcommunity

Hi All,

 

I understood the idea of partition and how query would run fast if table is partitioned. But i am finding it difficult to understand below DDL query used for creating partitioned table.

Below code snippet could be found in DA_440_SlideGuide.pdf page 51.

 

create table antartica.windspeed ( year:INT, month:string, knots:string) partitioned BY (station:string) ROW format delimited fields terminated by ',' lines terminated by '\n' stored as textfile;

 

Here, going by the query, table windspeed is partitioned on column 'station' but column 'station' is not even part of 3 columns (year, month and knots are only columns listed) mentioned in create table script. 

 

How would one run query

select * from windspeed where station = 'xyz';

to take advantage of partitioning when station is not even part of table columns?

 

Could somebody please guide understand DDL?

 

Regards,

Lokesh

Outcomes