Quiz RL - Temporal Difference Algorithm

Quiz
•
Computers
•
University
•
Hard
meilana siswanto
Used 2+ times
FREE Resource
10 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dalam lingkup kajian Reinforcement Learning, Temporal Difference
Learning termasuk ...
Model-based algorithm
Model free algorithm
Reward based algorithm
Environment-based algorithm
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Berikut pernyataan yang benar tentang Temporal Difference
Learning adalah...
Model-based environment
Agent belajar dari lingkungan melalui pemodelan lengkap
Kombinasi dari Monte Carlo dan Dynamic Programming
Tidak ada jawaban yang benar
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Mengapa dikatakan bahwa Monte Carlo adalah ide dasar dari Temporal Difference Learning?
Karena dalam Monte Carlo, value-nya dievaluasi tiap episode
Karena pada algoritma Monte Carlo tidak perlu ada termination
Karena Monte Carlo merupakan model free algorithm
Karena setiap episode dalam Monte Carlo tidak independent
4.
MULTIPLE CHOICE QUESTION
45 sec • 1 pt
Berikut merupakan pernyataan yang benar tentang Temporal Difference Learning adalah...
Bersifat episodik dalam melakukan evaluasi value-nya
Bersifat non-episodik dalam melakukan evaluasi value-nya
Tidak memiliki learning rate
Bersifat independent, tidak bootstrapping
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang menyebabkan Dynamic Programming (DP) merupakan ide dari Temporal Difference Learning (TDL)?
DP dalam meng-update value-state harus menyelesaikan 1 episode
DP dapat meng-update value-state per-step dari episode
Semua kemungkinan transisi state tidak dipertimbangkan pada setiap step
TDL tidak bersifat bootstrapping sebagaimana DP
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dua diantara pilihan berikut mana yang merupakan Temporal Difference Control adalah...
Monte Carlo dan Dynamic Programming
Markov Decision Process dan Monte Carlo
SARSA dan Q-Learning
SARSA dan Monte Carlo
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang dimaksud dengan SARSA pada Temporal Difference Learning?
Merupakan Action-Value function
Off policy
Update value secara episodik
Semua jawaban benar
Create a free account and access millions of resources
Similar Resources on Wayground
10 questions
Algoritma dan Modelling Machine Learning

Quiz
•
University
10 questions
Classification in Machine Learning2

Quiz
•
University
15 questions
Kuis Kelas Maya

Quiz
•
12th Grade - University
10 questions
Pertemuan 6 DWBI

Quiz
•
University
10 questions
TEP13 LEARNER - CENTERED PEDAGOGY*

Quiz
•
University
11 questions
Machine Learning Quiz by Vishal Sir

Quiz
•
University
15 questions
Chapter Eleven

Quiz
•
University
10 questions
Deep Learning IF

Quiz
•
University
Popular Resources on Wayground
10 questions
Video Games

Quiz
•
6th - 12th Grade
20 questions
Brand Labels

Quiz
•
5th - 12th Grade
15 questions
Core 4 of Customer Service - Student Edition

Quiz
•
6th - 8th Grade
15 questions
What is Bullying?- Bullying Lesson Series 6-12

Lesson
•
11th Grade
25 questions
Multiplication Facts

Quiz
•
5th Grade
15 questions
Subtracting Integers

Quiz
•
7th Grade
22 questions
Adding Integers

Quiz
•
6th Grade
10 questions
Exploring Digital Citizenship Essentials

Interactive video
•
6th - 10th Grade
Discover more resources for Computers
20 questions
Definite and Indefinite Articles in Spanish (Avancemos)

Quiz
•
8th Grade - University
7 questions
Force and Motion

Interactive video
•
4th Grade - University
36 questions
Unit 5 Key Terms

Quiz
•
11th Grade - University
7 questions
Figurative Language: Idioms, Similes, and Metaphors

Interactive video
•
4th Grade - University
15 questions
Properties of Equality

Quiz
•
8th Grade - University
38 questions
WH - Unit 3 Exam Review*

Quiz
•
10th Grade - University
21 questions
Advise vs. Advice

Quiz
•
6th Grade - University
12 questions
Reading a ruler!

Quiz
•
9th Grade - University