DimStiller v 0.2.1

DimStiller Download (updated November 5, 2010 14:00)

DimStiller Overview

DimStiller is a visual tool for dimensional analysis and reduction. Users can create pipelines called "expressions" in which they chain together analysis techniques called "operators".

One way to think about the task of dimension reduction is to consider the space of all data tables. The user's data is a single point in this space. The desired table produced from the users analysis is another point. The task is then to find the mapping or composition of mappings that move from the input table to the output table. DimStiller operators map input points to some other output point and therefore represent edges between the input and output tables in table space. DimStiller expressions, which are compositions of these operators, represent a connected path of points in table space.

DimStiller Instructions

The DimStiller UI consists of three parts indicated in the figure:

  1. WORKFLOWS. The top part labelled "WORKFLOWS" is the workflows selector. Workflows are like expression templates. Click "Add" to create a new expression from a workflow, or click "Apply" to add the workflow to a loaded expression.
  2. EXPRESSION TREE. The Expression Tree
  3. OPERATOR CONTROL. When a user hilights an operator in the expression tree, the operator's control shows up here. The user can adjust the parameters of the operator using widgets that appear here.

DimStiller has three menus:

To use DimStiller, you first create a new expression. This will implicitly ask you for an input table (see format description) and create a new expression with a single "Input:File" operator. Now the user has two choices: They can

Watch a video demo of an earlier version of the tool here.

Input File Format

The first line in DimStiller should be a comma-separated list of dimension titles, one entry for each dimension. The second row in DimStiller should be a comma-separated list of dimension types. Types can be one of the following three case-insensitive values "CATEGORICAL", "ORDINAL", or "NUMERIC". If this line is left out, then all dimensions are considered "NUMERIC" by default. Every following line is interpreted as a point in the dataset with a comma-separated list of point values. DimStiller does not currently support missing entries in data tables.

Here is an example file:

dim1,dim2,dim3
NUMERIC,CATEGORICAL,ORDINAL
1.0,A,X
2.0,B,Y
3.0,C,Z
			

Command Line Arguments

This section is purely for reference. One can use the packaged shell script or batch file to invoke DimStiller rather than using the command line arguments.

Default command line invocation: java -cp .:core.jar:Jama-1.0.2.jar still.gui.DimStiller -D still/operators/ -W workflows/

Operator Description

Operators can be "torn off" of the main control by clicking the "Tear off" button on the bottom of the control. Tearing off controls allows the control to persist onscreen when adjusting other operator controls. To return the control to the main window, click on the window-close button in the upper left corner of the window.

Wish List

The following features are in the queue for addition to DimStiller:

Please see the feedback section to request any desired operators to DimStiller.

Developers

DimStiller is designed to be extensible in that users can create their own operators. This section will eventually contain instructions on how to create and add operators.

Change Log

Feedback

Please send feedback and requests to <sfingram (at) cs.ubc.ca>