CSF 7.7 Assessment - Machine Learning Model Training
Flashcard
•
Computers
•
9th - 12th Grade
•
Hard
Wayground Content
FREE Resource
Student preview

10 questions
Show all answers
1.
FLASHCARD QUESTION
Front
Which wombat do you most relate to at this moment?
Back
undefined
2.
FLASHCARD QUESTION
Front
Which of the following best describes a corpus in data science? A type of metadata that describes the properties of a dataset, A machine learning model that has been trained on a large dataset, A set of pre-labeled images used to test the accuracy of a machine learning model, A collection of texts or documents used to analyze and extract information
Back
A collection of texts or documents used to analyze and extract information
Answer explanation
A corpus is a large and structured set of texts or documents that is used for natural language processing (NLP) and text analytics in data science. It can be composed of various types of data, including news articles, social media posts, academic papers, or any other written content that can be processed by a computer. The purpose of a corpus is to provide a representative sample of language use in a particular domain, language, or culture, so that researchers and data scientists can analyze it and extract insights about language patterns, semantic relationships, sentiment, or other aspects of language use. In NLP, a corpus is often used for tasks such as training and testing machine learning models, building language models, or developing text classification algorithms.
3.
FLASHCARD QUESTION
Front
Which of the following should be true about a corpus used as input for a machine learning model? 
 It is large, It is sorted, It is labeled, All of these options
Back
It is large
Answer explanation
The statement "It is large" should be true about a corpus used as input for a machine learning model. This is because a larger corpus provides more diverse and representative examples of the language being analyzed, which can improve the accuracy and robustness of the machine learning model.
The statement "It is sorted" is not necessarily true for a corpus used as input for a machine learning model. However, it can be helpful to organize the corpus by categories or topics to facilitate analysis and model training.
The statement "It is labeled" is also not necessarily true for a corpus used as input for a machine learning model, but it can be helpful in supervised learning scenarios where the model is trained on labeled data with known outcomes. However, unsupervised learning techniques can also be used to analyze unstructured text data without the need for labeled data.
4.
FLASHCARD QUESTION
Front
Which of the following is NOT an example of metadata? The date a photo was taken, The resolution of an image, The color of a text in a document, The author of an article
Back
The color of a text in a document
Answer explanation
Metadata refers to descriptive information about data that provides context, structure, or meaning to it. In data science, metadata can be used to describe various properties of a dataset, such as its format, structure, source, or content. Metadata can help data scientists to understand and analyze data more effectively, as well as to retrieve, manage, and share data with others.
Examples of metadata include the date a photo was taken, the resolution of an image, and the author of an article. These pieces of information provide context and descriptive information about the photo, image, or article. However, the color of text in a document is not considered metadata, as it is a visual attribute of the text and does not provide information about the document itself.
5.
FLASHCARD QUESTION
Front
Which of the following is the best example of metadata? The kind of device that captured the image, Pixels, The differences in images of dogs and wolves, The accuracy of the machine learning model for the image
Back
The kind of device that captured the image
Answer explanation
The best example of metadata from the options given is "The kind of device that captured the image".
Metadata refers to data that provides information about other data. In this case, the type of device that captured an image is metadata because it provides information about the image itself, such as the resolution, image format, and other characteristics that are specific to the device used to capture it.
"Pixels" are not metadata, as they refer to the actual content of the image itself and not information about the image.
"The differences in images of dogs and wolves" is not metadata, as it refers to the content of the images and not information about the images.
"The accuracy of the machine learning model for the image" is also not metadata, as it refers to the performance of the machine learning model on the image and not information about the image itself.
6.
FLASHCARD QUESTION
Front
Which of the following is an example of underfit in classification models? Misclassifying images of cats and dogs as horses, Memorizing the training set and failing to generalize to new, unseen data, Correctly classifying images in the training set but performing poorly on the test set, Overfitting the training data and failing to generalize to new, unseen data
Back
Correctly classifying images in the training set but performing poorly on the test set
Answer explanation
Underfitting occurs when a model is too simple to capture the underlying patterns in the data, resulting in poor performance on both the training set and the test set. In classification models, underfitting can occur when the model is not complex enough to distinguish between different classes or when the training data is not representative enough to capture the variability of the data.
Option A) Misclassifying images of cats and dogs as horses is an example of a model that is overfitting the training data and is not generalizing well to new data.
Option B) Correctly classifying images in the training set but performing poorly on the test set is an example of a model that is underfitting the training data and is not capturing the underlying patterns in the data.
Option C) Memorizing the training set and failing to generalize to new, unseen data is an example of a model that is overfitting the training data and is not generalizing well to new data.
Option D) Overfitting the training data and failing to generalize to new, unseen data is also an example of overfitting, similar to option A.
7.
FLASHCARD QUESTION
Front
Which of the following is the best example of overfit? Blueberries, raspberries, and cherries being labeled separately instead of all being labeled as “fruit”, Carrots, oranges, and basketballs all being labeled as a pumpkin, Frogs and dogs being mislabeled as the same thing because they both end in “ogs”, A dress being mislabeled as a skirt
Back
Blueberries, raspberries, and cherries being labeled separately instead of all being labeled as “fruit”
Answer explanation
The best example of overfitting from the options given is "Blueberries, raspberries, and cherries being labeled separately instead of all being labeled as 'fruit'".
Overfitting occurs when a machine learning model is trained on a dataset that is too specific or complex, and as a result, it becomes overly tuned to the training data and performs poorly on new or unseen data. In this case, if the machine learning model were trained to identify individual types of berries instead of recognizing them all as "fruit," it would be overfitting to the training data and may not perform well when presented with new types of fruit.
The other examples provided do not illustrate overfitting. "Carrots, oranges, and basketballs all being labeled as a pumpkin" is an example of mislabeling or incorrect labeling, not overfitting. "Frogs and dogs being mislabeled as the same thing because they both end in 'ogs'" is an example of incorrect labeling based on superficial similarities, not overfitting. "A dress being mislabeled as a skirt" is an example of a simple labeling error.
Create a free account and access millions of resources
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple

Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?
Similar Resources on Wayground
6 questions
unit2-PG-QUANTIFIERS
Flashcard
•
7th Grade - University
13 questions
Money
Flashcard
•
9th - 12th Grade
7 questions
Reference Books and Their Types
Flashcard
•
KG
8 questions
PAI TERBITNYA FAJAR DAKWAH
Flashcard
•
9th - 12th Grade
13 questions
Types and Components of Computer Systems
Flashcard
•
9th - 12th Grade
6 questions
Fetch-Execute Cycle and CPU Registers
Flashcard
•
KG
8 questions
Customer Service Vocabulary
Flashcard
•
7th - 12th Grade
12 questions
Biblia (nat2020 kerettantervi alapfogalmak)
Flashcard
•
9th Grade - University
Popular Resources on Wayground
10 questions
Ice Breaker Trivia: Food from Around the World
Quiz
•
3rd - 12th Grade
20 questions
MINERS Core Values Quiz
Quiz
•
8th Grade
10 questions
Boomer ⚡ Zoomer - Holiday Movies
Quiz
•
KG - University
25 questions
Multiplication Facts
Quiz
•
5th Grade
22 questions
Adding Integers
Quiz
•
6th Grade
20 questions
Multiplying and Dividing Integers
Quiz
•
7th Grade
10 questions
How to Email your Teacher
Quiz
•
Professional Development
15 questions
Order of Operations
Quiz
•
5th Grade