Modelling Imbalanced Target Variable

Modelling Imbalanced Target Variable

What is a model? Model represents a real world scenario with some Epsilon, where Epsilon represents the Error factor. Y = f(X) + epsilon What…

Read more

Pre-Modeling Routines in R

Data Preparation and Pre-Modeling – What & Why? Not having the correct and complete data is often the most cited reason for analytics projects failures,…

Read more

The Art of Big Data Science Hiring…Starts with Training

Hiring good quality talent for Big Data Science is a challenge regardless of location – Silicon Valley, Bangalore, Beijing…it doesn’t matter. As a Big Data…

Read more

A Storm in a big cup

All Hindu gods pose with a weapon in their hands. Quite an array of intriguing weapons are used in a variety of wars/battles in Hindu…

Read more

A Rose by any other name ..

..would smell as sweet, wrote Shakespeare. An Integer in any other form is not the same, is the programmers dictum. Well, it is clear an…

Read more

Understanding Big Data workloads (and more)

Hadoop/MapReduce has jump started a revolution in large scale data processing, which earler was either unfeasible or uneconomical. Now, it is possible to use the…

Read more

Using the Cloudera Manager API with Splunk

One of the most popular distributions of Hadoop is from Cloudera. Cloudera provides a management/monitoring when you their Enterprise Edition/support contract. With this tool, you…

Read more

My Take on Cloud Computing – Part II

No brainer, this is what any SAAS industry person would say why a customer should buy a SAAS product ..actually a service ..over an on-premise…

Read more