Repository logo
 

Dataset associated with "Detection of non-Gaussian behaviour using machine learning techniques"

dc.contributor.authorGoodliff, Michael
dc.date.accessioned2019-08-29T14:49:35Z
dc.date.available2019-08-29T14:49:35Z
dc.date.issued2019
dc.descriptionThis repository contains eleven data files for the Lorenz63 model: testdata: contains the x, y and z variables test trajectory traindata: contains the x,y and z variables training trajectory skewdata9skew: contains the z-value and p-value on the training trajectory, window length of 9 (radius 4) skewdata17skew: contains the z-value and p-value on the training trajectory, window length of 17 (radius 8) skewdata25skew: contains the z-value and p-value on the training trajectory, window length of 25 (radius 12) MLprediction9diff: Class values for the support vector machine and neural network, window length 9 points (radius 4), differences data classes MLprediction17diff: Class values for the support vector machine and neural network, window length 17 points (radius 8), differences data classes MLprediction25diff: Class values for the support vector machine and neural network, window length 25 points (radius 12), differences data classes MLprediction9skew: Class values for the support vector machine and neural network, window length 9 points (radius 4), skewness data classes MLprediction17skew: Class values for the support vector machine and neural network, window length 17 points (radius 8), skewness data classes MLprediction25skew: Class values for the support vector machine and neural network, window length 25 points (radius 12), skewness data classes.en_US
dc.descriptionCooperative Institute for Research in the Atmosphere
dc.description.abstractAn important assumption made in most variational, ensemble and hybrid based data assimilation systems is that all minimised errors are Gaussian random variables. There has been theory developed at the Cooperative Institute for Research in the Atmosphere (CIRA) that enables for the Gaussian assumption for the different types of errors to be relaxed to a lognormally distributed random variable. While this is a first step towards using more consistent distributions to model the errors involved in numerical weather/ocean prediction, we still need to be able to identify when we need to assign a lognormal distribution in a mixed Gaussian-lognormal approach. In this paper, we present some machine learning techniques and experiments with the Lorenz 63 model. Using these machine learning techniques, we show detection of non-Gaussian distributions can be done using two methods; a support vector machine, and a neural network. This is done by training past data to classify 1) differences with the distribution statistics (means and modes) and 2) the skewness of the probability density function.en_US
dc.description.sponsorshipThis work is supported by the National Science Foundation grant AGS-1738206 at CIRA/CSU.en_US
dc.format.mediumZIP
dc.format.mediumCSV
dc.format.mediumPDF
dc.identifier.urihttps://hdl.handle.net/10217/196642
dc.identifier.urihttp://dx.doi.org/10.25675/10217/196642
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Librariesen_US
dc.relation.ispartofResearch Data
dc.relation.isreferencedbyGoodliff, M., Fletcher, S., Kliewer, A.,Forsythe, J., & Jones, A. (2020). Detection of non-Gaussian behavior using machine learning techniques: A case study on the Lorenz 63 model. Journal of Geophysical Research: Atmospheres,125, e2019JD031551. https://doi.org/10.1029/2019JD031551
dc.titleDataset associated with "Detection of non-Gaussian behaviour using machine learning techniques"en_US
dc.typeDataseten_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Goodliff_201911_10217-196642.zip
Size:
7.62 MB
Format:
Zip File
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.05 KB
Format:
Item-specific license agreed upon to submission
Description: