Apache Spark 3 for Data Engineering and Analytics with Python - Challenge Part 2 – Brief

Apache Spark 3 for Data Engineering and Analytics with Python - Challenge Part 2 – Brief

Assessment

Interactive Video

Information Technology (IT), Architecture

University

Hard

Created by

Quizizz Content

FREE Resource

7 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the first task in the data cleaning challenge?

Change data types of columns

Remove bad records from the sales data frame

Extract city and state information

Add year and month columns

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

How can you extract the city from the purchase address?

By extracting the first element

By extracting the last element

By extracting the third element

By extracting the second element

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What data type should the 'quantity ordered' column be changed to?

String

Boolean

Float

Integer

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which columns should be added to the data frame based on the order date?

Day and hour

Year and month

Week and day

Minute and second

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What should be done with columns containing null values?

Leave them as they are

Replace them with zeros

Remove them

Fill them with average values

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

How should the final data frame be saved?

As a Parquet file partitioned by year and month

As a JSON file

As an Excel file

As a CSV file

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is a recommended approach if you encounter a Spark function not covered in the course?

Search online and use forums like Stack Overflow

Ignore the task

Wait for the next lesson

Ask a colleague for help