Multiplying the Mileage of Your Dataset with Subwindowing

Thumbnail Image
Atyabi, Adham
Fitzgibbon, Sean Patrick
Powers, David Martin
Journal Title
Journal ISSN
Volume Title
Springer Berlin Heidelberg
Copyright © 2011 Springer-Verlag
Rights Holder
This study is focused on improving the classification performance of EEG data through the use of some data restructuring methods. In this study, the impact of having more training instances/samples vs. using shorter window sizes is investigated. The BCI2003 IVa dataset is used to examine the results. The results not surprisingly indicate that, up to a certain point, having higher numbers of training instances significantly improves the classification performance while the use of shorter window sizes tends to worsen performance in a way that usually cannot fully be compensated for by the additional instances, but tends to provide useful gain in overall performance for small divisors into two or three subepochs. We have moreover determined that use of an incomplete set of overlapping windows can have little effect, and is inapplicable for the smallest divisors, but that use of overlapping subepochs from three specific non-overlapping areas (start, middle and end) of a superepoch tends to contribute significant additional information. Examination of a division into five equal non-overlapping areas indicates that for some subjects the first or last fifth contributes significantly less information than the middle three fifths.
Author version made available in accordance with the publisher's policy.
Atyabi, Adham, Fitzgibbon, Sean, Powers, David. 2011. Multiplying the Mileage of Your Dataset with Subwindowing. Brain Informatics: International Conference, BI 2011, Lanzhou, China, September 7-9, 2011. Proceedings. Berlin: Springer. Pp. 173-184