Data Mining Assignment 2
Spring 2009
Submission Deadline: Wednesday, February 18, at
4.00pm
Experiments with Weka
- The weka software package is available on mansfield under
/bin/weka/weka-3-3-6. Please set up the CLASSPATH environment variable
to be /bin/weka/weka-3-3-6 in order
to run this installation. With tcsh, add the following line in your
~/.cshrc file:
setenv CLASSPATH
/bin/weka/weka-3-3-6
- Copy the data files from /bin/weka/weka-3-3-6/data to your home
directory.
- Run the Apriori algorithm on each of the seven databases, e.g.,
java weka.associations.Apriori -t weather.nominal.arff
- Try the -N, -C, and -M switches.
-N <required number of rules>
specifies the number of rules.
-C <miniconf> specifies the minimum confidence.
-M <minisup> specifies the minimum support.
- Submit a report in plain text by e-mail to document the
differences the -N, -C, and -M switches make on the soybean.arff file.
You should try at least two different values for each of these
switches; one greater and one smaller than the default value (which is
the value that you do not use the switch).
Please e-mail questions to xwu@cs.uvm.edu.