January 10, 2017 Text Classification

Natural Language Processing using python

Introduction Let’s learn from a precise demo on Natural Language Processing on Newsgroup data for Machine Learning What we will do : 1. Read the newsgroup data 2. Use TfIdfVectorizer for converting a collection of raw documents to a matrix of TF-IDF features. 3. Fit random forest and multinomial model (No crossvalidation is used here) […]

Read more
December 28, 2016 deeplearning

Neural Networks For Machine Learning in R

Neural networks almost mimics the working of human brain.The neurons are connected by axons in human brain.Same way we have neural units in neural networks. Neural networks consist of multiple layers.And each layer has neural units .One is input layer and one is output layer.In between them we have more layers also called as hidden […]

Read more
February 6, 2017 Boxplot

How do I learn Machine Learning

Introduction Like anything else ,its not too difficult to learn Machine Learning but yes you need to put time and efforts for practicing it.More you practice,the more you learn.You can have a look on posts on demo in R & python after you have read this.The links are given below or you can browse the […]

Read more
January 25, 2017 Model tuning of Xgboost

Xgboost :Model tuning in Crossvalidation using caret in R

Introduction Here will discuss about the Xgboost model parameter’s tuning using caret package in R.Let’s begin.Open your R console and follow along. To Get Certified for Best Course on Data Science Developed by Data Scientist ,please follow the below link to avail discount https://www.udemy.com/machine-learning-using-r/?couponCode=GREAT_CODE Importing libraries Importing the library mlbench for sonar dataset and caret […]

Read more
January 24, 2017 Biplot in PCA

Principal Component Analysis in R

How to Perform PCA in R We will discuss here how to perform principal component analysis in R.Although PCA is required for data sets which have very high dimentionality,we will use the iris data set for simple demonstration.Importing the library MASS for iris dataset.The dimentionality of iris data set is 4 excluding the species variable […]

Read more
January 24, 2017 Biplot in PCA

What are Dimentionality Reduction Techniques

Introduction Consider the case where you are being provided a data set which has say ten thousand independent variables or columns or predictor variables and say ten millions of records or observations or rows. Let’s now just focus on ten thousand independent variables and also assume they are continuous so that you need not worry […]

Read more
January 20, 2017 AIC and deviance

Deviance and AIC for Logistic Regression in R

Introduction This is for you,if you are looking for Deviance,AIC,Degree of Freedom,interpretation of p-value,coefficient estimates,odds ratio,logit score and how to find the final probability from logit score in logistic regression in R. Course for Beginners: https://www.udemy.com/machine-learning-using-r/?couponCode=GREAT_CODE Importing libraries & Reading Data Importing the required libraries.MASS is used for importing birthwt dataset library(MASS) #### Storing the […]

Read more
January 19, 2017 Logistic Regression

Logistic Regression output interpretation in R

Introduction This is for you if you are looking for interpretation of p-value,coefficient estimates,odds ratio,logit score and how to find the final probability from logit score in logistic regression in R. Let’s begin !! For Best Course on Data Science Developed by Data Scientist ,please follow the below link to avail discount https://www.udemy.com/machine-learning-using-r/?couponCode=DATA_MASTER Importing libraries,Reading […]

Read more
January 16, 2017 Cross-validation in R

Avoid Over fitting & start crossvalidation

Introduction If you want to learn what is K-fold cross-validation and how is it done in R,then please follow along.Open your RStudio and have fun!! Course for Beginners: https://www.udemy.com/machine-learning-using-r/?couponCode=DISFOR123 What is Cross-validation A model is usually given a known data set(training data set) on which training is done and unknown dataset(testing data set) against which […]

Read more
January 14, 2017 Barplot

Plotting Categorical Variable vs continuous variables

Let’s begin Data visualizations from basic to more advanced levels where we can learn about plotting categorical variable vs continuous variable or categorical vs categorical variables.Let’s start RStudio and begin typing in ๐Ÿ™‚ For Best Course on Data Science Developed by Data Scientist ,please follow the below link to avail discount https://www.udemy.com/machine-learning-using-r/?couponCode=DISFOR123 #### Let’s store […]

Read more
January 14, 2017 k-means

K-means Clustering for Data Analytics in R

Introduction Here we will know about “how to perform k-means clustering in R” and “how to find best value of k in k-means clustering” Importing library Let’s open RStudio and follow along !! Let’s import the ggplot2 library which is needed for ggplot visualization library(ggplot2) Reading Dataset Let’s import the data set named โ€œirisโ€ into […]

Read more