I'm becoming a stronger and stronger advocate of teaching command-line interface...

Spooky23 · on Jan 18, 2015

A few years ago between projects, my coworkers cooked up some satirical amazing Web 2.0 data science tools. They used git, did a screencast and distributed it internally.

It was basically a few compiled perl scripts and some obfuscated shell scripts with a layer of glitz. People actually used it and LOVED it... It was supposedly better than the real tools some groups were using.

It was one of the more epic work trolls I've ever seen!

jefftk · on Jan 19, 2015

Maybe I'm misreading you, but it sounds like you're saying "my coworkers made something with a really great UI and people loved it!"

hanoz · on Jan 18, 2015

Your CSV peeking epiphany was in essence a matter of code vs. tools though rather than necessarily CLI vs. GUI. On Windows you might just as well have discovered you could fire up Linqpad and enter File.ReadLines("massive.csv").First() for example.

Someone · on Jan 18, 2015

Running a shell in a GUI doesn't make it lose its "I am a CLI" property. That is a CLI.

eropple · on Jan 19, 2015

I disagree. It's a REPL, but a REPL is not always a CLI.

(Frankly, most REPLs are smarter than shells. I go to irb way more than I do bash, these days.)

TylerE · on Jan 18, 2015

Or just use vim or any other editor smart enough not to try to slurp the whole file in one go.

eru · on Jan 19, 2015

Actually, mmapping the file should Just Work (tm)?

hueving · on Jan 18, 2015

Do you not see the horrific syntax of what you just suggested as simple?

mc808 · on Jan 18, 2015

It's pretty clear what it does. It's also C#, so building up to a less trivial task will be much less horrific than

find . -type f -name '*.pgn' -print0 | xargs -0 -n4 -P4 mawk '/Result/ { split($0, a, "-"); res = substr(a[1], length(a[1]), 1); if (res == 1) white++; if (res == 0) black++; if (res == 2) draw++ } END { print white+black+draw, white, black, draw }' | mawk '{games += $1; white += $2; black += $3; draw += $4; } END { print games, white, black, draw }'

kevin_thibedeau · on Jan 19, 2015

In a real production environment that command line would be put into a script parametrized with named variables and the embedded awk scripts would be changed to here-docs.

mc808 · on Jan 19, 2015

Sounds good although at that point it's just programming, and there are tools that are cleaner and faster and more robust than piping semi-structured strings around from a command line.

The one real benefit that can be argued is ubiquity (on *ix). Not every system has Perl, Python, or Ruby installed - or Hadoop for that matter - but there's usually a programmable shell and some variant of the standard utilities that will get something done in a pinch. If it happens to be 200x faster than some enormous framework, so much the better.

peterhunt · on Jan 19, 2015

Are you arguing that shell scripts scale to larger applications better than C#?

alrs · on Jan 18, 2015

The example was a multi-gigabyte CSV file. You just sucked the whole thing off the disk into RAM so that you could shave off the first line.

If you're unlucky, you started swapping out to disk about halfway through.

recursive · on Jan 19, 2015

That code you're replying about was carefully and correctly written. You just replied as if you know how it works just so you could look like you know what you're talking about.

If you're unlucky, someone who actually knows how File.ReadLines() works will show up in an hour or two and explain that it's lazily evaluated.

alrs · on Jan 20, 2015

:) touche

chrishynes · on Jan 19, 2015

Wrong. ReadLines returns an IEnumerable<string> and lets you read line by line without loading the entire file into memory: http://msdn.microsoft.com/en-us/library/dd383503%28v=vs.110%....