S04 - Speech Recognition (GSLC)

S04 - Speech Recognition (GSLC)

University

10 Qs

quiz-placeholder

Similar activities

Object-Oriented Programming - Prelim Exam

Object-Oriented Programming - Prelim Exam

University

15 Qs

L2 scratch

L2 scratch

7th Grade - University

15 Qs

Chapter 12 : Private Security and Homeland Defense

Chapter 12 : Private Security and Homeland Defense

University

15 Qs

Day1 Quiz

Day1 Quiz

University

10 Qs

CSC305: TOPIC 2

CSC305: TOPIC 2

University

15 Qs

ICF (Theory) Electrical First Quiz

ICF (Theory) Electrical First Quiz

University

10 Qs

NLP - QUIZ

NLP - QUIZ

University

14 Qs

ENSA - TEST 1B

ENSA - TEST 1B

University

5 Qs

S04 - Speech Recognition (GSLC)

S04 - Speech Recognition (GSLC)

Assessment

Quiz

Computers

University

Easy

Created by

Amalia Zahra

Used 6+ times

FREE Resource

10 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the main purpose of wav2vec models?

Text-to-Speech (TTS) conversion

Self-supervised learning for speech recognition

Machine Translation

Image classification

2.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the primary difference between wav2vec and wav2vec 2.0?

wav2vec 2.0 uses CNNs, while wav2vec uses Transformers

wav2vec 2.0 removes dependency on phoneme-based features

wav2vec is multilingual, while wav2vec 2.0 is not

wav2vec 2.0 is only used for speaker recognition

3.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

Which component of wav2vec 2.0 is responsible for capturing long-range speech dependencies?

Feature Encoder

Transformer Context Network

Quantization Module

Spectrogram Processor

4.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the function of the Quantization Module in wav2vec 2.0?

Converts audio into text

Discretizes speech representations into codebook embeddings

Removes noise from speech signals

Translates speech into multiple languages

5.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the main advantage of XLSR over wav2vec 2.0?

It is trained on multiple languages

It does not use self-supervised learning

It does not require any training

It is only used for speaker verification

6.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What does the “53” in XLSR-53 refer to?

The number of Transformer layers

The number of audio features extracted

The number of languages used for training

The number of training datasets

7.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

Which model is best suited for low-resource languages?

wav2vec

wav2vec 2.0

XLSR-53

X-LSR

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?