At StackHPC, we have deployed Pangeo, JupyterHub and Kubeflow on K8s cluster deployed using Magnum. Kubeflow comprises of a lot of things so it is likely there is something in there for those doing data science stuff. Best Bharat
On 24 Jan 2020, at 18:09, Tim Bell <Tim.Bell@cern.ch> wrote:
Which data science platforms are you considering ?
We may run some of them at CERN, we generally use Kubernetes (via Magnum) are the underlying provisioning engine with autoscaling up/down now available in Train. Our SPARK environments are provisioned likewise.
Tim
On 24 Jan 2020, at 15:48, Michael McCune <elmiko@redhat.com> wrote:
On Thu, Jan 23, 2020 at 3:54 PM Ruchi Rajasekhar <RRajasekhar@misoenergy.org> wrote: Would anyone happen to know of any data science platforms that can run on OpenStack? I was looking at Pivotal, Pachyderm but they don't run on OpenStack ☹
if you don't mind about adding another layer, i know that several data science platforms are creating kubernetes tooling. it might be worth investigating using one of the kubernetes on openstack deployment options (magnum, maybe others?) and then layering a data science platform on top of that.
good luck!
peace o/