Indexing JSON logs with Parquet

12月 28, 2016

We frequently use Spark SQL and EMR to analyze terabytes of JSON request logs. The builtin JSON support in Spark is easy to use and works well for most use cases. For example, this small piece of code will infer the schema of the files and provide a table that can be queried with standard SQL:

from Pocket http://ift.tt/2iaM8l5
via IFTTT

このブログを検索

もひかん

Indexing JSON logs with Parquet

このブログの人気の投稿

How to patch OS X for the bash/Shellshock vulnerability | TUAW: Apple news, reviews and how-tos since 2004

(AMD 初のSSD 「Radeon R7 SSD」は国内9月上旬発売。120GB 1万2000円前後から - Engadget...

Beatsの新型完全ワイヤレスイヤホン｢Beats Studio Buds｣は来月に発表へ