View on GitHub

record-classification

This project provides an automatic record classification tool.

load Command

The load is used for reading in gold standard and unseen records from a file on the local file system. The options of this command are:

In order to specify whether the source file contains unseen or gold standard records, the load command must be used in conjunction with one of its sub commands: unseen and gold_standard.

An execution of load command either reads gold standard records or unseen records. For example, the following command:

load --from my_dataset.csv gold_standard -t 0.8 -h

loads gold standard records from a file in the current working directory called my_dataset.csv while skipping the first record, where 80% of the records will be used for training the classifier, and the remaining 20% will be used for evaluation of the classifier. Default values will be used for any of the unspecified options.

In another example, the following command:

load --from my_other_dataset.csv unseen -ii 4 -li 5

loads unseen records from a file in the current working directory called my_other_dataset.csv_, where the ID of each record is specified in the fifth column, and the label of each record is specified in the sixth column. Default values will be used for any of the unspecified options.

Home | CLI