Togaware
freedom is in everyone's language Frihed Vrijheid Liberté Freiheit Ελευθερία Свобода Bebas Libertad


Home
Services
Freedoms
Resources

- Rattle

- OpenMoko

- Data Mining

- GNU/Linux

- LaTeX

Supporting

- Analytics/IAPA

- AusDM

- PAKDD

Hosting

- Dirt Navigator

- Gallery

About Us


Rattle: To-Do List

This is a brief list of some of the things in the pipeline or suggested by users. Feel free to email me with suggestions and priorities and especially code.

080512 Fix scoring for multiple class classification
080517 Review Tony Nolan's weather dataset for demo inclusion
080513 Remove paradigms - display all tabs (Tony Nolan)
080513 Combine Data/Select tabs into one (Tony Nolan)
080513 Add a R command console where paradigms were (Tony Nolan)
080513 Introduce Ensemble scoring to add scores together (see NetFlix).
080517 Add a Statistics option under Explore (t-test/z-test/chi^2 (Tony Nolan)
080512 Notice $ amounts and , in thousands and treat as numeric (Tony Nolan)
080517 Add a macros menu with canned R templates (Tony Nolan)
080511 Move to using guiDo from plotAndPlayGTK
080511 Move to using playwith from plotAndPlayGTK for plots
080511 Add support for scagnostics package
080504 Allow risk charts without a RISK identified
080502 Put PMML.COPYRIGHT into .Rattle
080502 Add report info to PMML export of GLM
080501 Fix cluster export to CSV
080429 Add a SQLiteDF data option
080427 Report pmml.nnet error to Zementis
080427 Fix Draw of decision tree for Regression
080427 On Score be sure to include the commands in log.
080426 Add "Transform" check button on Evaluate tab (active when Score and CSV)
       that will perform same transformations on the scoring dataset. (Tony)
080427 Fix export of PMML for rpart
080421 For assoc, if ID is not unique, set Baskets to checked
080422 Complete read.pmml function for rpart.
080419 Accumlate the transforms and apply to new CSV score file (Tom Neice)
080418 Change plots to be tabbed plots with second tab being parameters that
       lay behind the model whose performance is being plotted.
080418 Load rattle script files (.ras) for data, score, test:
080418 Allow txt score file (perhaps automatic if training data is txt)
080415 Fix about Quit button
080408 Transform Tab, "Filter" textbox for rows to keep
080330 After Transform refreshes list, keep same selections?
080318 Migrate all global variables into the crv$ structure - avoid warnings
080315 Add a Last Dataset/Project option (Tony Nolan)
080305 Implement silhouette plot for kmeans clusters
080203 Ed Cox: Add row selection to Select Tab (perhaps using "subset").
080203 Ed Cox: Bug when using ODBC data source multiple times.
060921 Pareto, zipf (ginger.hpl.hp.com/shl/papers/ranking/ranking.html) (Stu)
070106 Transform: Implement mean/median from knn imputations (Daniel Medri)
071215 Display rsq.rpart - the r squared fit by splits.
071216 Add date conversion to Remap: as.Date(COLUMN, format="%d-%m-%Y")
071216 Rename binning to not conflict with sm's binning.

071205 Add in Randy Goebel's association rule visualisation (AI07)
071125 Use relaimpo  to calculate relative importance of variables:
       calc.relimp(Adjusted ~ ., audit[,c(2,7,9:10,13)], 
       type = c("lmg", "last", "first", "betasq", "pratt") )
070422 Disable drag and drop from the file open dialogs (Ray Lindsay).
070406 For the cairo plot window, introduce keyboard shortcuts.
070325 Add a Defaults button to each model builder.
070130 Explore->Distr check buttons train/test/csv (csv from Evaluate tab)
070330 Add selection of clustering: diana, agnus, fanny, etc.
070421 Ensure the Help menu is complete.
061019 Numerous graphics extensions (Geir Nilsen)
070401 Add a File Type combo box to graphics save.
070514 FileChooser to also show file sizes (Tony Nolan)
070324 Implement dendrogram zoom using rgl (see plot3d example)?
070321 For the View textview, keep first row (headers) static as "edit()" does
070321 Implement own Edit textview to edit dataset, instead of ugly "edit()"
060703 Kmeans: If a target exists, generate a hot spots display.
070305 Add a "best k" finder using hclust (Daniele Medri)
060821 Explore->Distributions: Clear = clear all radio buttons
070317 Keep the cwd info local to Rattle rather than globally changing.
070317 Add arffname as a parameter to the rattle function call.
070109 Evaluate: add compare distributions radio button (Ray Lindsay)
070311 Replace radio buttons with icons (Graham Williams)
070319 Make the Explore:Summary:Find button do "Find Next" and remove Next.
070317 Make the Explore:Summary:Find search case insensitive.
070121 Each textview have default explanatory text
061001 Choice of Kernel for SVMs. 
070319 Introduce a progress bar for all activities (Tony Nolan)
070114 Add textmining.R to Rattle
060723 Simplify RF rule sets
070220 Auto update feature(Tony Nolan)
070201 Investigate party for rattle (Daniel Medri)
070201 Invetigate rmpi for rattle (Daniel Medri)
070105 Implement knn model builder (Yale)
060001 Implement bagged boosted stumps (Jeremy Barnes)
070316 Implement "expert" settings for each model (Liyin Xue)

Notes

  1. The caret package provides a unifying interface to the model builders and cold be used underneath in Rattle. It isn't at present - Rattle uses the source packages to ensure we teach the use of the underlying model builders. As a data miner moves into R they may like to consider the use of the caret package for command line usage. The package also provides a tuning facility.
  2. 080118: Considered moving to using the VGAM package for regression. Decide not to move now - VGAM is hijacking S4 classes for predict and other functions. The author is now aware of this. library(VGAM) results in all other models no longer working - no "predict" and pmml.lm fails with no coefficients!