Useful Uses of cat - Readit News

The reason these sequences of commands always start with cat, for me at least, is that I just cat’d the file only to find it was too long or noisy.

    cat filename.txt 

    Up | grep “thing I want”

Is fewer keystrokes than

    cat filename.txt

    grep “thing I want” filename.txt

Or more likely

    cat filename.txt

    grep filename.txt “thing I want” 

    grep “thing I want” filename.txt

hnlmorg · 2 years ago

You can also do:

    grep “thing I want” !$

Bash (and similar) will replace !$ with the last parameter of the previous command.

This is a trick I’ve used lots when wanting to perform a non-piping operation on a command I’ve ‘cat’ed (eg ‘rm -v !$’)

I’d never criticise anyone for “useless” use of ‘cat’ though. If the fork() overhead was really that critical then it wouldn’t be a shell command to begin with.

mmh0000 · 2 years ago

Even easier, press alt+. (fewer keystrokes too! cli golf is fun), it'll copy-and-paste the last parameter from the previous command. If you press . multiple times it'll go further back into your history.

bumblebeast · 2 years ago

I just use <esc>. On the command line to bring back the last argument.

Then I can look at it before hitting return

bee_rider · 2 years ago

I’m sure there are cases where the fork overhead matters. But, alas, I don’t type or read that fast.

77pt77 · 2 years ago

!! is the previous command line BTW

useful shorthand

> sudo !!

for example.

Again, bash specific.

dllthomas · 2 years ago

<!$ grep ... or <$_ grep ... work, too.

neuromanser · 2 years ago

Press alt and dot (full stop) to insert last word from the previous command line:

    $ cat file
    $ grep stuff alt-.

Alternatively, make use off the READNULLCMD mechanism in Zsh:

    $ < file

translates to

    $ ${READNULLCMD:-more} < file

Thus you can

    $ < file

then UP (or ctrl-p which I find more ergonomic) and continue with "grep stuff":

    $ < file grep stuff

(Redirections can be anywhere in the command.)

https://zsh.sourceforge.io/Doc/Release/Redirection.html#Redi...

sophacles · 2 years ago

I look through a lot of logs, so I've aliased `not` to `grep -vE` and using the method you describe over several iterations i end up with a history that has a lot of

    cat log | not spam1 | not spam2 | not 'spam(3|4)' | .... | less

lambdaba · 2 years ago

Zsh global aliases can be nice for commands commonly used in pipelines:

  alias -g V='| grep -vE'
  alias -g L="| $PAGER"

Then you can do

  cat log V spam1 V spam2 L

I also like

  alias -g G='| grep'
  alias -g X='| xargs'

The possibilities are endless...

mmh0000 · 2 years ago

I've been using Linux for a very long time and never thought of that as an alias. I love it! Thank you for sharing.

kqr · 2 years ago

I really like that alias. Thanks!

deathanatos · 2 years ago

I've always though UUoC while working interactively are fine. One shouldn't interrupt their flow to fix minutia in a one time command.

UUoC criticism, to me, belongs when one sits down to script.

0cf8612b2e1e · 2 years ago

Eventually I will want to debug/change the script where we get right back to the situation where I want cat at the front of the line.

skipkey · 2 years ago

Or what I find myself doing often:

cat “filename.txt”

Up | grep “thing”

Up | grep -v “not thing”

Up | grep -v “other thing”

Etc. it’s just easier to build this way even if the initial cat is unnecessary.

mdekkers · 2 years ago

> grep filename.txt “thing I want”

> grep “thing I want” filename.txt.

…every time

hawski · 2 years ago

Try:

  < file.txt grep pattern

Less keystrokes.

FreeFull · 2 years ago

That doesn't work for the first step though, where you want to print the file to stdout without processing it in any way

Deleted Comment

The concept presented is something I can agree with in principle, but "transforming a filename into the content of the file" is a really thin justification for a responsibility.

By all means don't build something where you have cascading effect and need to retest an entire pipeline, but this is _not_ it.

P.S.

And if you really really want to keep it separate, just do "< access.log head -500 etc etc etc" (no I didn't forget a pipe. And yes the "< inputfile" works even if it's in front of what you're calling).

hk__2 · 2 years ago

> And if you really really want to keep it separate, just do "< access.log head -500 etc etc etc" (no I didn't forget a pipe. And yes the "< inputfile" works even if it's in front of what you're calling).

Or just use `cat` and let the pipe separate the different steps. "< access.log head" is nice but it breaks this representation where each step is piped into the next one. Sure, once you’re done fiddling you can rewrite the thing to remove the "cat", but when you are constructing the thing I find it clearer to use cat.

stouset · 2 years ago

> Sure, once you’re done fiddling you can rewrite the thing to remove the "cat"

Or just… don’t?

kristjansson · 2 years ago

> it breaks this representation where each step is piped into the next one

No it doesn't?

  < access.log head -n 500 | grep mail | perl -e …

is completely valid, and reads right-to-left as well as the cat version. IMO using stdin is preferable to either solution in TFA.

ajross · 2 years ago

> "transforming a filename into the content of the file" is a really thin justification for a responsibility.

Uh... I dunno, but my lizard brain thinks that the whole idea of mediating filesystem operations on storage and IPC mechanisms like pipes is a lot more complicated, magic, and deserving of a single command than merely filtering the data on stdin.

I agree with the article and the logic, and think this historic meme was basically wrong originally. You string up your chain of pipelines with the first element being "where does it come from?" and not merely whatever the first operation happens to be just because that operation allows for some kind of file input or redirection syntactically.

TOGoS · 2 years ago

> transforming a filename into the content of the file" is a really thin justification for a responsibility

This is one of those things where I think it is until it isn't.

I sometimes second-guess myself when I think I might be over-single-responsibilifying. "Well in practice these two things are so trivial that this feels a little silly."

It often turns out to have been a good call in hindsight, especially when working with other people who aren't necessarily thinking about these things at all. If the responsibilities have been sufficiently split up, they're more likely to change only the part that needed to be changed, and less likely to complectify the two things together that really shouldn't've been. Or when I go "oh wow that thing that I thought I overly-abstracted sure composes well with this unexpected new thing!"

Hardcore separation of concerns is just another method of defensive programming.

> < access.log head -500 etc etc etc

It's too bad that the syntax is so different. Why does the first stage not end with "|"? There's space for shell syntax improvements, here. Maybe a 'cat'-like builtin that translates `cat foo | bar` into `bar <foo` so you can have the nice syntax but don't needlessly create processes would leave everyone happy.

jiveturkey · 2 years ago

in the example given in the website, it could all be done in the perl! no need for the pipeline at all.

cat /dev/sdb > backup.img # make a disk image cat /dev/sdb > /dev/sdc # clone disk cat ~/Downloads/* # play Russian roulette with your terminal cat > file # minimalistic text editor, ^D to exit saving, ^C to exit erasing the file cat << wq > file # nearly complete emulation of ed grep -r bongo . | cat # shorter than typing --color=never cat -v file # cause 20 points of damage to wizards of bell labs cat file > file # empty a file without removing the file cat meow meow > meows # duplicate file contents

TacticalCoder · 2 years ago

What's really useless are all the "useless use of cat" comments (and shellcheck warnings although I take it for a shell script there could be cases where one less process may be justified [although really if you're at that point you've got other things to worry than cat, sadly]).

I use "cat ... | ... | ... " like in TFA and just like many in this thread because it simply makes sense. It's more intuitive. It's easier to read. It requires less braincycles to remember how this or that command wants its parameter passed, etc.

I think the "useless use of cat" movement made its time: it failed. Many of us are never going to give up our use(less|ful) of cat (you decide). So stop wasting your time complaining about it.

thaumaturgy · 2 years ago

I use shellcheck a lot, and I've found so far that its complaints fall neatly into one of two buckets: "oh, whoops, nice catch" and "shut up, shellcheck".

"useless use of cat" goes into the latter bucket. Complaining at me about it does not actually improve the code; it's just a nag about a bad habit that, arguably, isn't even a bad habit.

dataflow · 2 years ago

I think you're mistaken here, and confusing two different usages of cat.

"Useless" uses of cat aren't bad habits during interactive usage, for all the reasons people mention here which I won't rehash.

For scripts, however, the story is different than for one-off commands. For one thing, it's slower due to the extra forks and copying of data across pipes, so there's at least that. For another, it prevents the command from inspecting the other end of its pipe, which can negatively impact usage in some case. (For example, if the program knows its input is from a terminal, it may flush its output on every newline it sees.) Moreover, a bunch of the arguments for the interaction case (like "it's fewer keystrokes" or whatever) don't even apply to the script case in the first place...

The end result here is that you definitely shouldn't assume some habit is just fine with scripting merely because it's fine when you're typing on the terminal, or vice-versa.

gruturo · 2 years ago

VSpike · 2 years ago

I definitely think if you want to use `cat` then just go ahead, it's fine. Sometimes these things are a power play, a way to distinguish between people who know the social codes and those who don't. In this case, it probably had a reasonable origin even if it's now more of a way to beat on newcomers. On old systems, memory was limited, disk was slow and forking was expensive. Saving a process in a script or one liner was a noticeable improvement performance-wise.

I learned some bash from an old-timer who would write an infinite-loop like this:

  while :; do 
    # loop body here
  done

This works because the `:` is a way to set a label, and it implicitly returns 0. It's just a weird wrinkle of the language. So, why not do `while true`? On old systems, `true` was not a builtin and would call `/usr/bin/true`. Writing the loop this way saves a process fork on each iteration.

On a modern system, you'd be hard pushed to measure the difference, so it really doesn't matter which style you prefer.

OJFord · 2 years ago

> This works because the `:` is a way to set a label, and it implicitly returns 0. It's just a weird wrinkle of the language.

Do you have a source for that? I thought it was just POSIX built-in for true. Like `.` vs. `source`. What's a label in this context anyway?

Hah, yeah I was completely wrong on that! Should have fact checked myself. That's a falsehood I absorbed at some point and didn't question.

cryptonector · 2 years ago

No need to ask for a source. The word "label" in the POSIX shell documentation only occurs in the description of `case`, and it doesn't happen in the manual page for bash, dash, zsh, etc.

penguin_booze · 2 years ago

Equally surprised. I know ':' is a label in sed, but labels in (ba)sh, I'm not aware. If it's indeed a label, is there a goto?

dolmen · 2 years ago

> This works because the `:` is a way to set a label, and it implicitly returns 0.

Nope. Unix shell doesn't have labels (are you mixing with DOS batch files?).

: is a shell builtin that does nothing. In the bash man page, look for the first entry of the "SHELL BUILTIN COMMANDS" section. https://www.gnu.org/software/bash/manual/html_node/Bourne-Sh...

PennRobotics · 2 years ago

infinite loop in C:

  for(;;){
      // loop body here
  }

jxy · 2 years ago

I'm in the camp of using

    <input X|Y|Z >output

The point of this syntax is that I can readily replace it with

    F() { X|Y|Z }
    <input F >output

iforgotpassword · 2 years ago

Since I like spaces around stuff I started putting two spaces after the infile at some point.

  < infile  x | y | z > outfile

I just didn't like how the filename was so close to the command name ;)

CBLT · 2 years ago

I'm in this camp as well, starting with input file redirection just makes so much sense to me.

    </proc/0/environ xargs -0

I also don't tend to want enormous volumes of text in my terminal scrollback so I generally view files or pipe verbose commands to `less`, then when I find what I want to send to the terminal I use the `|` less command to pipe it to `cat`.

Or to grab just a few lines for my later reference:

    kubectl get po/my-pod -o yaml | less
    /* find the lines I'm interest in */
    -N
    |^sed -n 34,35p

adityaathalye · 2 years ago

Oh `F()`. I should use redirection more with my Bash functions, and add it to my list (pun intended) of functional tricks [1].

[1] https://evalapply.org/posts/shell-aint-a-bad-place-to-fp-par...

bloopernova · 2 years ago

This is by far the best explanation I've read for using that method. Thank you!

unhammer · 2 years ago

I have an old note named the same as this blog post:

_pvxk · 2 years ago

Oh and another one I use all the time is as an "identity transform" when selecting between filters, e.g.

    dostuff () { 
        if [[ $1 = clean ]]; then 
            grep -v dirt
        else 
            cat
        fi | do_other_stuff
    }

hiAndrewQuinn · 2 years ago

`cat x | Y | Z` is Subject-Verb-Object.

`Y x | Z` is Verb-Subject-Object.

That's why I prefer using cat "uselessly".

kazinator · 2 years ago

   < x Y | Z > w

Y takes input redirected from x, piped into Z, which outputs into w.

_v7gu · 2 years ago

No, that's obviously taking the inner product of the <x| bra and |Z> ket while applying the Y operator, and multiplying by a scalar w

TeMPOraL · 2 years ago

> Y takes input redirected from x, piped into Z, which outputs into w.

I.e.

  x | Z | tee w | Y

? that's... something else entirely.

The shell < has never intuitive to me, for some reason, but this has helped me see it in a new light. Thanks!

cat is a verb though

bowsamic · 2 years ago

cat x is a subject

pwdisswordfishc · 2 years ago

"x" is not a subject.

GuB-42 · 2 years ago

There is another reason: to make sure the program doesn't do anything funny with the file, like modifying it. I know it won't happen with "head" but for commands I am not familiar with, it is a way to be sure. And example of a command that does "something funny" is gunzip. With just a file as an argument, it will decompress the .gz file and erase the original instead of reading it and dumping its content to stdout.

I usually prefer to do "< file command" though. "cat" adds an extra layer of indirection, forcing stream processing and hiding the original file, but that's usually unnecessary. If you really don't trust the program, it is not an adequate solution anyways.

If you want to hide the file's device and inode number from the program that will be consuming the file's contents, then, yes, using cat makes sense. I've never had to do that. Just redirecting stdin is enough.

Or use zcat.