Hive 列格式

說明

Spark 支援 CREATE TABLETRANSFORM 子句中的 Hive 列格式,以指定 SerDe 或文字分隔符號。有兩種方法可以在 CREATE TABLETRANSFORM 子句的 row_format 中定義列格式。

  1. SERDE 子句用於指定自訂 SerDe 類別。
  2. DELIMITED 子句用於為原生 SerDe 指定分隔符號、跳脫字元、Null 字元等。

語法

row_format:    
    SERDE serde_class [ WITH SERDEPROPERTIES (k1=v1, k2=v2, ... ) ]
    | DELIMITED [ FIELDS TERMINATED BY fields_terminated_char [ ESCAPED BY escaped_char ] ] 
        [ COLLECTION ITEMS TERMINATED BY collection_items_terminated_char ] 
        [ MAP KEYS TERMINATED BY map_key_terminated_char ]
        [ LINES TERMINATED BY row_terminated_char ]
        [ NULL DEFINED AS null_char ]

參數