Pyspark MLib

Pyspark MLib

University

20 Qs

quiz-placeholder

Similar activities

MÓDULOS Y PAQUETES PARA MACHINE LEARNING CON PYTHON

MÓDULOS Y PAQUETES PARA MACHINE LEARNING CON PYTHON

University

20 Qs

EROBOT LONGTEST

EROBOT LONGTEST

University

25 Qs

Progamming Language

Progamming Language

University

20 Qs

S3 What do you remember from S2?

S3 What do you remember from S2?

KG - University

18 Qs

Comprensión de Datos en CRISP-DM

Comprensión de Datos en CRISP-DM

9th Grade - University

20 Qs

OCR A Level Computer Science (H446) - Component 1.2.2 - Application Generation

OCR A Level Computer Science (H446) - Component 1.2.2 - Application Generation

11th Grade - University

21 Qs

Pandas Python: Trabajo con Datos

Pandas Python: Trabajo con Datos

University

20 Qs

analisis big data

analisis big data

University

15 Qs

Pyspark MLib

Pyspark MLib

Assessment

Quiz

Computers

University

Hard

Created by

FRIKA NUGROHO

FREE Resource

20 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Untuk memuat dataset CSV ke dalam DataFrame di Spark, metode mana yang digunakan?

spark.read.file()
spark.load.csv()
spark.read.csv()
spark.import.csv()

2.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Lengkapi kode pada titik-titik untuk menggabungkan atribut menjadi satu vektor!

assembler = VectorAssembler(.........=selected_features, .........="featuress")

assembler = VectorAssembler(inputs=selected_features, outputs='features')
assembler = VectorAssembler(inputCols=selected_features, outputCol=features_col)
assembler = VectorAssembler(inputCols=features, outputCol=selected_features)
assembler = VectorAssembler(inputCols=selected_features, outputCol="features")

3.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Lengkapi kode berikut untuk melakukan pembagian dataset menjadi data latih dan data uji, dimana data uji diberikan jumlah subset 20% !

train_data, test_data = data.randomSplit(.........)

train_data, test_data = data.randomSplit([0.6, 0.4])

train_data, test_data = data.randomSplit([0.8, 0.2])

train_data, test_data = data.randomSplit([0.7, 0.3])

train_data, test_data = data.randomSplit([0.9, 0.1])

4.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Fungsi mana yang digunakan untuk menghitung jumlah baris dalam DataFrame di Spark?

df.size()

df.rows()

df.count()

df.length()

5.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Fungsi mana yang digunakan untuk menampilkan skema DataFrame di Spark?

df.schema()

df.displaySchema()

df.printSchema()

df.showSchema()

6.

FILL IN THE BLANK QUESTION

2 mins • 1 pt

Lengkapi kode dibawah ini, agar hanya mengambil kolom yang memiliki fitur numerik !

numeric = [t[0] for t in df.dtypes if t[1] == 'int']

numeric_summary = df.select(............).summary()

numeric_summary.show(truncate=False)

7.

MULTIPLE CHOICE QUESTION

2 mins • 1 pt

Fungsi mana yang digunakan untuk menghapus kolom dari DataFrame di Spark?

df.dropCol()

df.remove()

df.drop()

df.delete()

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?