Data Source Selection

The following options are available on the Data tab in RStat. You may also access the Data tab through the Tools menu.

Source

Different options may be available depending on the data type.

Partition

Partitioning splits the single data set into two data sets, a training data set used for analysis and modeling, and a test data set used to evaluate how well a model performs. It is a common practice to test models on new data, different from the data used to create the model.

You can define the partition size either as a percentage of the total records or as an exact number of records. Changing the percentage will automatically change the count and vice versa.


WebFOCUS