> Speaking of which, I think it's important to curate a dataset of > success/failure logs with the expected anomalies to be found. Those will > be super useful to prevent regression when trying out new settings or models. > How to store and manage the dataset remains to be defined too. > To give you an idea, fwiw, you can find my original dataset here: > git clone https://softwarefactory-project.io/r/logreduce-tests > How did you collect and curate the original dataset? And, how do you expect the new set looks like? Cheers, Klérisson -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.openstack.org/pipermail/openstack-infra/attachments/20171124/61114c18/attachment.html>