Home › Forums › Assignment courserra › IBM AI Engineering Professional Certificate › Scalable Machine Learning on Big Data using Apache Spark › Week 4 Course Project Quiz
August 26, 2020 at 2:26 pm #1170Abhishek TyagiKeymaster
P.S.- If you know remaining answers feel free to add
Question 1- Please don’t forget to import the assignment notebook before answering questions as instructed the learning item before.
Please use the discussion forums if you have questions
What is the correlation between HOURLYWindSpeed and HOURLYPressureTendency? Please use the already existing code for the colleration matrix and adjust the code respectivly.
Question 2- What is the RMSE metric obtained from the LinearRegression model (1st model in the notebook – cell has comment #LR1)
Question 3- Please change #LR1 in order to use features_norm over features. What’s the RMSE value you get now?
Question 4 What’s the RMSE value we obtain from cell #GBT1?
Question 5- What is the accuracy you get from cell #LGReg1?
Question 6- What is the accuracy you get from cell #RF1?
Question 7- What is the accuracy you get from cell #GBT2?
Question 8- If you change the number of trees in cell #RF1 from 30 to 10, what’s the new accuracy?
Question 9- What data storage format is the used?
CSV, with header, columns separated by comma
Question 10- What correlation methods are supported by the Correlation matrix function?
Question 11- Which Classification Model performs best in this notebook?
1. Logistic Regression
2. Random Forest
3. Gradient Boosted Tree
Possible answers: 1,2 or 3
Question 12- Which Regression Model performs best in this notebook?
1. Linear Regression
2. Gradient Boosted Tree Regressor
Possible answers: 1 or 2
- This topic was modified 10 months ago by Yash Arora.
- You must be logged in to reply to this topic.