
C5M1

Quiz
•
Information Technology (IT)
•
University
•
Medium
Abylai Aitzhanuly
Used 1+ times
FREE Resource
10 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Suppose your training examples are sentences (sequences of words). Which of the following refers to the jth word in the ith training example?
x(i)<j>
x<i>(j)
x(j)<i>
x<j>(i)
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Tx = Ty
Tx < Ty
Tx > Ty
Tx = 1
3.
MULTIPLE SELECT QUESTION
45 sec • 1 pt
Speech recognition (input an audio clip and output a transcript)
Sentiment classification (input a piece of text and output a 0/1 to denote positive or negative sentiment)
Image classification (input an image and output a label)
Gender recognition from speech (input an audio clip and output a label indicating the speaker’s gender)
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Estimating P(y<1>, y<2>, ...., y<t-1>)
Estimating P(y<1>)
Estimating P(y<t> | y<1>, y<2>, ...., y<t-1>)
Estimating P(y<t> | y<1>, y<2>, ...., y<t>)
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
(i) Use the probabilities output by the RNN to pick the highest probability word for that time-step as y<t>. (ii) Then pass the ground-truth word from the training set to the next time-step.
(i) Use the probabilities output by the RNN to randomly sample a chosen word for that time-step as y<t>. (ii) Then pass the ground-truth word from the training set to the next time-step.
(i) Use the probabilities output by the RNN to pick the highest probability word for that time-step as y<t>. (ii) Then pass this selected word to the next time-step.
(i) Use the probabilities output by the RNN to randomly sample a chosen word for that time-step as y<t>. (ii) Then pass this selected word to the next time-step.
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You are training an RNN, and find that your weights and activations are all taking on the value of NaN (“Not a Number”). Which of these is the most likely cause of this problem?
Vanishing gradient problem
Exploding gradient problem.
ReLU activation function g(.) used to compute g(z), where z is too large.
Sigmoid activation function g(.) used to compute g(z), where z is too large.
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Suppose you are training a LSTM. You have a 10000 word vocabulary, and are using an LSTM with 100-dimensional activations a<t>. What is the dimension of Γu at each time step?
1
100
300
10000
Create a free account and access millions of resources
Similar Resources on Wayground
6 questions
Kuis C++

Quiz
•
University
15 questions
Perl

Quiz
•
University
12 questions
Docker q2

Quiz
•
University
15 questions
Bakytbek

Quiz
•
University
10 questions
Quiz Meet 1 Mini SC Programing

Quiz
•
University
11 questions
Conceptos Básicos HTML5 y CSS3

Quiz
•
University
10 questions
WS&T

Quiz
•
University
10 questions
Rooting React

Quiz
•
University
Popular Resources on Wayground
10 questions
Video Games

Quiz
•
6th - 12th Grade
20 questions
Brand Labels

Quiz
•
5th - 12th Grade
15 questions
Core 4 of Customer Service - Student Edition

Quiz
•
6th - 8th Grade
15 questions
What is Bullying?- Bullying Lesson Series 6-12

Lesson
•
11th Grade
25 questions
Multiplication Facts

Quiz
•
5th Grade
15 questions
Subtracting Integers

Quiz
•
7th Grade
22 questions
Adding Integers

Quiz
•
6th Grade
10 questions
Exploring Digital Citizenship Essentials

Interactive video
•
6th - 10th Grade
Discover more resources for Information Technology (IT)
20 questions
Definite and Indefinite Articles in Spanish (Avancemos)

Quiz
•
8th Grade - University
7 questions
Force and Motion

Interactive video
•
4th Grade - University
36 questions
Unit 5 Key Terms

Quiz
•
11th Grade - University
7 questions
Figurative Language: Idioms, Similes, and Metaphors

Interactive video
•
4th Grade - University
15 questions
Properties of Equality

Quiz
•
8th Grade - University
38 questions
WH - Unit 3 Exam Review*

Quiz
•
10th Grade - University
21 questions
Advise vs. Advice

Quiz
•
6th Grade - University
12 questions
Reading a ruler!

Quiz
•
9th Grade - University