Flatten Nested Struct Columns in Parquet and Export to CSV
Beginner Mode

Start your terminal to use beginner mode.

Scenario

A Parquet file contains customer data with nested struct columns that need to be flattened for easier analysis and export to CSV format.

Task

Write a Python script at /home/interview/flatten_data.py using pandas that reads /home/interview/customers_nested.parquet, flattens all nested struct columns into separate columns with underscore naming (e.g., address_street, contact_phone), and writes the result to /home/interview/flattened_data.csv.

Note: pandas and pyarrow are already installed.

Example

Input (nested structure):

id | first_name | address                                    | contact
1  | John       | {street: "123 Main", city: "NYC", ...}    | {phone: "555-0100", ...}

Expected output (flattened):

id | first_name | address_street | address_city | contact_phone | ...
1  | John       | 123 Main       | NYC          | 555-0100      | ...

Terminal requires a larger screen

Open this page on a desktop or tablet (≥ 768px) to launch the terminal and practice hands-on.

Linux Terminal Environment

Write and execute your solution in the terminal below.

Sign In

Track

Question Difficulty Company Access
Need more practice in this area? Explore more questions →