Errata for Data Mining with Rattle and R.
We also include updates to Rattle that differ from the screen shots in the book and suggestions submitted by readers.
Page vii, Line -6. Replace "understand verify" with "understand, verify".
Page 7, Section 1.4, Paragraph 1: Replace "Cross Industry Process" with "Cross Industry Standard Process". (Kee Siong Ng)
Page 32, Figure 2.6. If we follow the commands from the text we would end up with a different bar plot. The plot in the book results from turning off the Partition check button and hence displays all 366 observations. With the default partitioning, we see 215 and 41 instead of 300 and 66 as in Figure 2.6. Also, for Figure 2.8 it should be noted that RainTomorrow should be reset in the Data tab as the Target variable.
Page 53, Line -3. Replace "WE" with "We".
Page 133, Paragraph 3. Replace "...Figure 5.18...there are only 21 missing..." with "...31 missing...". (Kevin Florey)
Page 140, Figure 6.3. This figure is a repeat of Figure 6.2. The correct figure is available here.
Page 155, Figure 7.2 and Page 160. The book shows "By Group" in the figure and discusses "By Group" while Rattle 2.6.15 and later has replaced "By Group" with "Interval". (Kevin Florey)
Page 186, Last Paragraph. "...and only eight of the ten clusters". The table actually shows all 10 clusters. (Kevin Florey)
Page 240, Paragraph 3, Line 5. Replace "loss=loss=" with "loss=".
Pages 251-266. This is not an errata as such, but reinforces the point made in the book about slight variations in numeric calculations when randomness is involved. For example, on Page 251, the book has "OOB estimate of error rate: 14.06%" but for a 32bit system we might see "OOB estimate of error rate: 14.84%" or "OOB estimate of error rate: 14.45%". Similarly the Confusion Matrix in the book includes "Yes 26 15 0.63415" whereas we might see "Yes 28 13 0.68292683" or "Yes 27 14 0.65853659". (Kevin Florey)
Page 255. Replace "the Error button" with "the Errors button". (Kevin Florey)
Page 274. The train.err and train.kap values are no longer reported in later versions of Rattle. (Kevin Florey)
Page 274. Replace "the Error button" with "the Errors button". (Kevin Florey)
Page 294, Paragraph 3. Replace ", and so the size of the training set is not usually an issue" with ". Nonetheless, training time tends to increase significantly with the size of the training dataset, and can be prohibitive on large datasets" (Kee Siong Ng)
Send any that you notice to Graham.Williams@togaware.com.
Copyright © 2006-2012 Togaware Pty Ltd
Last Modified 2012-02-04 15:18:51 Graham Williams