Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

You can parse and flatten JSON files with arrays in Wrangler. Because JSON files can be large and because Wrangler supports up to 10 MB for sampling files, the JSON file must be stored in Google Cloud Storage or BigQuery.

To parse JSON files with arrays, follow these steps:

  1. In Wrangler, Wrangler source (GCS or BigQuery), read a JSON file to Wrangle.

  2. Once navigated to Wrangler tab, open the dropdown on the column and choose Parse → JSON.

  3. After Step 4, the fields in the JSON will form the column. Identify the column that has rows.

  4. Open the drop-down from that specific column and select Explode → Array (by flattening).

  5. This will explode the elements in the JSON array as individual data in the same column in Wrangler.

After step 7 further directives can be applied based on cleanup needed.

 

  • No labels