Analysis of Algorithms
Data Science: Introduction to Data Science – Digital Universe – Sources of Data – Information Commons –
Data Science Project Life Cycle: OSEMN Framework
Introduction to Data Preprocessing – Reading, Selecting, Filtering Data – Filtering Missing Values –
Manipulating, Sorting, Grouping, Rearranging, Ranking Data Formulation of Hypothesis – Probabilistic
Approximately Correct Learning - VC Dimension – Hypothesis elimination – Candidate Elimination
Algorithm
R Basics - data types and objects - control structures – data frame -Feature Engineering - scaling, Label
Encoding and One Hot Encoding, Reduction
Regression Models- Linear and Logistic Model, Classification Models – Decision Tree, Naïve Bayes, SVM and
Random Forest, Clustering Models – K Means and Hierarchical clustering
Data visualization: Box plot, histogram, scatter plot, heat map – Working with Tableau – Outlier detection – Data
Balancing
Loss Function and Error: Mean Squared Error, Root Mean Squared Error – Model Selection and Evaluation criteria:
Accuracy, Precision, F1 score, Recall Score – Binary Predictive Classification – Sensitivity – Specificity.