R: Text classification using Caret package

This post is a follow up on my previous post “R: Text classification using SMOTE and SVM”. I have since gained more experience in R and improved my code. Here is an example (specific to my project, so many parts may not be relevant). In this example I start by loading my functions, and datasets. Then… Continue reading R: Text classification using Caret package

R: Text classification using SMOTE and SVM

SMOTE algorithm is “an over-sampling approach in which the minority class is over-sampled by creating ‘synthetic’ examples rather than by over-sampling with replacement”. It is a technique used to resolve class imbalance in training data. SVM (Support Vector Machine) is a machine learning algorithm. As Wikipedia describes it “a support vector machine constructs a hyperplane… Continue reading R: Text classification using SMOTE and SVM

Tips to setup Rstudio on Ubuntu cloud server

This past week I’ve been struggling with memory issues when using R. The computations were taking too long and often resulted in ‘cannot allocate vector of size xx ‘ errors. So I finally decided to move to cloud. In this blog post I will share the resources I found helpful and share some tips I… Continue reading Tips to setup Rstudio on Ubuntu cloud server