hive正则匹配,
第一步:创建apachelog表(用于存放apache的访问日志:access_log)
CREATE TABLE apachelog (
host STRING,
identity STRING,
user_d STRING,
time STRING,
request STRING,
status STRING,
size STRING
)ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) \[(.*)\] \"(.*)\" (-|[0-9]*) (-|[0-9]*)"
)
STORED AS TEXTFILE;
STORED AS TEXTFILE;
第二步:插入数据:load data local inpath "/usr/local/apache/logs/access_log" into table apachelog;
本站文章为和通数据库网友分享或者投稿,欢迎任何形式的转载,但请务必注明出处.
同时文章内容如有侵犯了您的权益,请联系QQ:970679559,我们会在尽快处理。