Hello experts, I have one question I hope you could help me with.
A few days ago MapD was open-sourced and a collaboration between MapD and H2O.ai was announced. I want to know if as of today there is a way to run a logistic regression in a big data set in MapD without leaving the GPU stack?
For example, let say we have a data set of 100 million rows in MapD running on gpus. How can we run a logistic regression in that data set without moving the data outside the gpu stack? It is not clear to me if currently I can use XYZ software on top of MapD without moving data or if unfortunately I have to move the data outside the gpu stack in order to do the logistic regression.