
Reinforcement Learning and Deep RL Python Theory and Projects - Changing Policy Architecture
Interactive Video
•
Information Technology (IT), Architecture
•
University
•
Practice Problem
•
Hard
Wayground Content
FREE Resource
Read more
7 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Which policy is recommended for image-related tasks?
None of the above
RNN Policy
CNN Policy
MLP Policy
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
What is the default batch size for the MLP policy mentioned in the video?
128
256
512
64
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
How many layers are defined in the new MLP network architecture?
3
4
5
2
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
What is the purpose of setting 'verbose' to 1 during training?
To increase the training speed
To get detailed statistics
To reduce memory usage
To enable GPU acceleration
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
What is the average reward obtained after 20,000 steps?
180
150
220
200
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Where is the trained model saved?
In a zip format
In a JSON file
In a text file
In a database
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
What is the next topic hinted at the end of the video?
Modifying the algorithm
Improving the reward function
Changing the policy architecture
Adding more layers
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?