At SnowPlow we're hugely excited about using Sky/Qip to do agile analyses on big volumes of web clickstream data generated by SnowPlow (https://github.com/snowplow/snowplow).
One thing I didn't see in the blog post was: what is the recommended way of loading data (e.g. csv or JSON flatfiles) into Sky? So we can get started with our prototyping :-)
There's no automated loader yet. I'll be building out some external libraries (Ruby, Python, etc). What language are you guys writing in? It looks like Java & Scala from the repo.
We do a lot in Java and Scala, but we do most of our ETL in Ruby too. So basically Ruby would be fine - and we can potentially help with the Java port in due course if that would be helpful!
At SnowPlow we're hugely excited about using Sky/Qip to do agile analyses on big volumes of web clickstream data generated by SnowPlow (https://github.com/snowplow/snowplow).
One thing I didn't see in the blog post was: what is the recommended way of loading data (e.g. csv or JSON flatfiles) into Sky? So we can get started with our prototyping :-)