To (a) or not to (a), that is the question?

While looking for content for the next edition of my book (newsflash! I’m currently working on the next edition of my book!) I went hunting around for that trick using decision trees to predict the states of a single column independently rather than all together.  Turns out – I never wrote it!  So, in case…

1

How can we mine? Let me count the ways…

Recently I received some customer feedback that SQL Server Data Mining "doesn’t have enough algorithms."  More specifically, the comment was that we have the same capabilities are other Data Mining providers, we just "hide" many facilities as algorithm parameters rather than separating out each as a named algorithm.  So let’s count the Microsoft algorithms a…

1

Tree Utilities in Analysis Services Stored Procedures

This past week I was helping out a customer that wanted to reduce the length of a questionairre by using data mining to determine which of the 300+ questions were actually necessary for them to get the understanding they required.  By using a tree model, and playing with the COMPLEXITY_PENALTY parameter, I was able to…

1

Wisconsin Breast Cancer Dataset available

Frequently I use the Wisconsin Breast Cancer Dataset for demonstrating the Data Mining Addins for Office – enough people asked, so I made it available as an Excel 2007 file (free login required).  For purists, the original data is available at the Machine Learning repository, which is a great location for many sample datasets. Here are some screenshots…

0