欢迎投稿

今日深度:

Hive总结,

Hive总结,


建表语句
1、分隔符:row format delimited fields terminated by ‘\t’
2、格式:stored as textfile
3、存储路径:location ‘/user/stu2’
4、根据查询结果创建表:create table stu3 as select * from stu2
5、根据已经存在的表结构创建表:create table stu4 like stu2
6、从hdfs加载数据:load data inpath ‘/hivedatas/techer.csv’ into table techer
7、分区表:partitioned by (month string)
8、本地加载数据到分区表:load data local inpath ‘/export/servers/hivedatas/score.csv’ into table score partition (month=‘201806’)
9、多分区查询:select * from score where month = ‘201806’ union all select * from score where month = ‘201807’
10、添加分区:alter table score add partition(month=‘201804’) partition(month = ‘201803’)
添加成功后就会多一个文件夹
11、外部表建立关联:msck repair table score4
12、在实际的项目开发当中,hive表的数据存储格式一般选择:orc或parquet。压缩方式一般选择snappy。

www.htsjk.Com true http://www.htsjk.com/hive/35116.html NewsArticle Hive总结, 建表语句 1、分隔符:row format delimited fields terminated by ‘\t’ 2、格式:stored as textfile 3、存储路径:location ‘/user/stu2’ 4、根据查询结果创建表:create table stu3 as select * from...
相关文章
    暂无相关文章
评论暂时关闭