Hive 六种存储格式

2021-04-27 11:28:39 浏览数 (1)

Hive六种存储格式:

AVRO、ORC、PARQUET、RCFILE、SEQUENCEFILE、TEXTFFILE

(avro、orc、parquet、rcfile、sequencefile、textfile)

语法

标准语法

STORED AS AVRO / STORED AS AVROFILE

ROW FORMAT SERDE   'org.apache.hadoop.hive.serde2.avro.AvroSerDe'   STORED AS INPUTFORMAT   'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat'

STORED AS ORC / STORED AS ORCFILE

ROW FORMAT SERDE   'org.apache.hadoop.hive.ql.io.orc.OrcSerde'   STORED AS INPUTFORMAT   'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'

STORED AS PARQUET / STORED AS PARQUETFILE

ROW FORMAT SERDE   'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'   STORED AS INPUTFORMAT   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'

STORED AS RCFILE

STORED AS INPUTFORMAT   'org.apache.hadoop.hive.ql.io.RCFileInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.RCFileOutputFormat'

STORED AS SEQUENCEFILE

STORED AS INPUTFORMAT   'org.apache.hadoop.mapred.SequenceFileInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.mapred.SequenceFileOutputFormat'

STORED AS TEXTFILE

STORED AS INPUTFORMAT   'org.apache.hadoop.mapred.TextInputFormat'   OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat'

0 人点赞