This is cool. For anyone who has cascading experience: what tuning can you do for the Hadoop jobs/does it autotune? How does performance compare to running multiple MapReduce jobs in sequence?
It would be awesome to see this compared to Microsoft's Dryad (http://research.microsoft.com/research/sv/Dryad/) which also supports DAG-like large scale computing. I don't think Dryad is publicly available though...
It would be awesome to see this compared to Microsoft's Dryad (http://research.microsoft.com/research/sv/Dryad/) which also supports DAG-like large scale computing. I don't think Dryad is publicly available though...