caret package for R

R is one of the best tool to do data science (for prototype, and for data fit in memory). And caret is one of the best package to create common machine learning models in R.

As described in packages introduction, caret streamlines the process of creating the models.
For common models, it would do cross validation and parameter tuning automatically.

Here is an quick example on iris data.

There are still some difficulties when using R.
One issue is memory usage on large data sets. Currently I have a random forest model training on data about 1 million rows, which takes 24GB memory.

Next I will try to train some neural network model.

This entry was posted in Computer and Internet, Machine Learning, Science and tagged . Bookmark the permalink.

One Response to caret package for R

  1. Xu Weidong says:

    Change to the parameter of function [ train ] to [ method = ‘nnet’ ] would train a 3-layer neural network model. It works fine for this simple data set.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s