Distributed Computing with Spark SQL

Distributed Computing with Spark SQL

University

39 Qs

quiz-placeholder

Similar activities

Module 1.2 - Governance

Module 1.2 - Governance

University

39 Qs

TR - BITM215 Excel Exam 3

TR - BITM215 Excel Exam 3

University

41 Qs

Computer Network

Computer Network

12th Grade - University

40 Qs

AP Java Exam

AP Java Exam

8th Grade - University

35 Qs

digital fluency - Test

digital fluency - Test

University

40 Qs

B.tech ODD Sem Salesforce Challenge

B.tech ODD Sem Salesforce Challenge

University

40 Qs

CompTIA Security+ SY0-601 Chapters 9 - 12

CompTIA Security+ SY0-601 Chapters 9 - 12

University

40 Qs

ALGO - Prelim Reviewer

ALGO - Prelim Reviewer

University

34 Qs

Distributed Computing with Spark SQL

Distributed Computing with Spark SQL

Assessment

Quiz

Computers

University

Easy

Created by

Rdip play

Used 10+ times

FREE Resource

39 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Which of the following are true when it comes to the business value of big data? (Select all that apply.)

The size of the data businesses collect is growing

Businesses are increasingly making data-driven decisions

Automated technologies mean that data scientists and data analysts are no longer needed

2.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Spark uses... (Select all that apply.)

A distributed cluster of networked computers made of a driver node and many executor nodes

A driver node to distribute work across a number of executor nodes

One very large computer that is able to run computation against large databases

Your database technology (e.g., Postgres or SQL Server) to run Spark queries

A distributed cluster of networked computers made of many driver nodes and many executor nodes

3.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

How does Spark execute code backed by DataFrames? (Select all that apply.)

It optimizes your query by figuring out the best "how" to execute what you want

It separates the "logical plan" of what you want to accomplish from the "physical plan" of how to do it so it can optimize the query

It executes code determined in advance

It iterates over all of the source data to exhaustively evaluate queries

4.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

What are the properties of Spark DataFrames? (Select all that apply.)

Resilient: Fault-tolerant

Dataset: Collection of partitioned data

Distributed: Computed across multiple nodes

Tables: Operates as any table in SQL environments

5.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

What is the difference between Spark and database technologies? (Select all that apply.)

Spark is a highly optimized compute engine and is not a database

Spark is a computation engine and is not for data storage

Spark operates for both data storage and computation

Spark does not interact with databases but uses its proprietary DataFrame technology instead

Spark in an alternative to traditional databases

6.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

What is Amdahl's law of scalability? (Select all that apply.)

A formula that gives the theoretical speedup as a function of the percentage of a computation that can be parallelized

Amdahl's law states that the speedup of a task is a function of how much of that task can be parallelized

A formula that gives the expected speed of a single processor performing a computation

A formula that gives the theoretical speedup as a function of the size of a partition (or subset) of data

A formula that gives the number of processors (or other unit of parallelism) needed to complete a task

7.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Spark offers a unified approach to analytics. What does this include? (Select all that apply.)

Spark allows analysts, data scientists, and data engineers to all use the same core technology

Spark is able to connect to data where it lives in any number of sources, unifying the components of a data application

Spark unifies applications such as SQL queries, streaming, and machine learning

Spark code can be written in the following languages: SQL, Scala, Java, Python, and R

Spark unifies databases with optimized computation allowing for faster computation against the data it stores

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?