Check whether the column rainfall and potential Evapo transpiration in the file gen.csv
displays a monotonic function or not
Use the WEKA Explorer and justify the values
For the different pre-defined datasets present under C:\\Program Files\\Weka-3-8-6\\data\\diabetes.arff
Data mining techniques ⇒ SEMMA
Data Analysis
The values shall lie between -1 to +1, if anything out of range denoted that the data isn’t processed properly and needs to be redone.
When the function is monotonic, Spearman coefficient variable is considered ⇒ Where the graph of the function is neither increasing or decreasing
When the function is linearly related variable Pearson r correlation coefficient is considered
Market Basket Analysis,
Simple random forest classifiers
Correlation Coefficient | Type of Relationship | Level of Measurement | Data Distribution |
---|---|---|---|
Pearson’s r | Linear | ||
Spearman | |||
Precision Recall Curve (PRC), ROC Curve, Area under the Curve, MCC (Mathews Correlation Coefficient)in any data set, Kappa Statistic Coefficient
Mean Squared Error (MSE) and Mean Average Error (MAE) shall be considered for every structured dataset
> getwd()
> print("hello")
> print('hello')
> rep(c("a", "b"), 2)
[1] "a" "b" "a" "b"
> rep("anushka", 5)
[1] "anushka" "anushka" "anushka" "anushka" "anushka"
> x <- 1:10
> x
[1] 1 2 3 4 5 6 7 8 9 10
> class(x)
[1] "integer"
# to check the working directory
getwd()
# to set the working directory
setwd("G:/My Drive/Semester_5/Summer_Course/tableau/dayOne")