Name : Anushka Gupta SRN : PES2UG20CS060 Class : 5A Date : 04 July 2022

Problem Statement 1


Check whether the dataset in gen1.csv is monotonic and find correlation using the same(spearman/Pearson)


question-1.jpg

Problem Statement 2


Use the WEKA Explorer and justify the values

  1. MCC
  2. Kappa Stats
  3. ROC Curve Value

For the different pre-defined datasets present under

C:\\\\Program Files\\\\Weka-3-8-6\\\\data\\\\diabetes.arff


MCC It's a correlation between predicted classes and ground truth.

Kappa Statistics is the ratio of the proportion of times that the appraisers agree (corrected for chance agreement) to the maximum proportion of times that the appraisers could agree (corrected for chance agreement).

ROC Curve Value are frequently used to show in a graphical way the connection/trade-off between clinical sensitivity and specificity for every possible cut-off for a test or a combination of tests. In addition the area under the ROC curve gives an idea about the benefit of using the test(s) in question.