M14 : Processing - EMR

M14 : Processing - EMR

Professional Development

10 Qs

quiz-placeholder

Similar activities

FinTech 13-2 AWS

FinTech 13-2 AWS

Professional Development

9 Qs

AWS Quiz Show 2023 (Week 2)

AWS Quiz Show 2023 (Week 2)

Professional Development

15 Qs

M15 : Analytics

M15 : Analytics

Professional Development

10 Qs

TGX - BB Scheduled Task

TGX - BB Scheduled Task

Professional Development

10 Qs

Webinar NetApp-Partner 22 July

Webinar NetApp-Partner 22 July

Professional Development

10 Qs

Accelerate January 2019

Accelerate January 2019

University - Professional Development

12 Qs

Cloud Guardians - Network Security

Cloud Guardians - Network Security

1st Grade - Professional Development

15 Qs

Team Quiz 1

Team Quiz 1

Professional Development

15 Qs

M14 : Processing - EMR

M14 : Processing - EMR

Assessment

Quiz

Other

Professional Development

Medium

Created by

Carina Martin

Used 2+ times

FREE Resource

10 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Which are components of Elastic MapReduce ? (Choose all that apply)

Master nodes

Core nodes

Leader nodes

Task nodes

S3 or HDFS

2.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

When would you use Elastic MapReduce (EMR)? (Choose all that apply)

for analysis of structured data

for on-demand EC2 billing

for analysis of unstructured data

for serverless querying

3.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Which are true statements regarding Elastic MapReduce (EMR)? (Choose all that apply)

EMR clusters can't be used in conjunction with Auto Scaling groups.

It uses S3 to store data for its cluster.

It is a customer-managed, EC2 cluster-based product.

It is an AWS-managed, EC2 cluster-based product.

It is an AWS product that allows the analysis of large sets of structured and unstructured data.

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You have advertising campaign information stored in a DynamoDB table. You need to write queries that join clickstream data to identify the most effective categories of ads that are displayed on websites. Which tool should you use?

Quicksight

Kinesis data streams

EMR

Data Pipeline

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You need to store and process data quickly in a cost-effective manner. You can move data easily from its location on disk to wherever you'd like without needing to stream the data. Also, you do not know how much data you will be handling in 6 months, and your processing needs spike intermittently. Specifically, you need to transform the data that comes in by aggregating the different disparate metrics into summary information. Which Big Data tools should you use?

DynamoDB and Redshift

Kinesis Data Streams and DynamoDB

S3 and Spark on EMR

S3 and Amazon Machine Learning

6.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Your EMR cluster uses 12 m4.large instances and runs 24 hours per day, but it is only used for processing and reporting during business hours. Which options can you use to reduce the costs? ​

(Choose two answers)

Run 12 d2.8xlarge instead without turn-off.

Use Spot instances for task nodes when needed.

Use the ReduceMapper distribution of EMR.

Migrate the data from HDFS to S3 using S3DistCp and turn off the cluster when not in use.

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is AWS Glue ?

Fully managed extract, transform and load service.

Petabyte scale cloud data warehouse

Real time data streaming service

None of the above

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?