How to ignore certain filenames using "find"?

148

24

One of my favorite BASH commands is:

find . -name '*.*' -exec grep 'SearchString' {} /dev/null \;

which searches the contents of all of the files at and below the current directory for the specified SearchString. As a developer, this has come in handy at times.

Due to my current project, and the structure of my codebase, however, I'd like to make this BASH command even more advanced by not searching any files that are in or below a directory that contains ".svn", or any files that end with ".html"

The MAN page for find kind of confused me though. I tried using -prune, and it gave me strange behavior. In an attempt to skip only the .html pages (to start), I tried :

find . -wholename './*.html' -prune -exec grep 'SearchString' {} /dev/null \;

and did not get the behavior I was hoping for. I think I might be missing the point of -prune. Could you guys help me out?

Thanks

Cody S

Posted 2012-03-05T23:28:16.087

Reputation: 1 704

2You can search inside file with grep -rl 'SearchString' – emanuele – 2014-11-10T14:43:35.743

@emanuele Hi, welcome to SuperUser (and the Stack Exchange network). This is a question I asked, and that was answered, 2 1/2 years ago. Typically, if you would like to add an answer to the question, please do so by scrolling to the bottom and answering there, instead of in a comment. Since this question already has an accepted answer (the one with the green checkmark), it's unlikely that your answer is going to get much attention, however. FYI. – Cody S – 2014-11-10T22:52:47.440

2Hi, it is not an answer to your question. It is only a tip, as you stated in preamble that use find to search inside a file. – emanuele – 2014-11-11T12:48:14.980

2FWIW, -name '*.*' does not find all files: only those with a . in their name (the use of *.* is typically an DOS-ism, whereas in Unix, you normally use just * for that). To really match them all, just remove the argument altogether: find . -exec .... Or if you want to only apply grep to files (and skip directories) then do find . -type f -exec .... – Stefan – 2016-02-20T01:49:50.717

1Just fyi: find is not a build-in bash command but a separate program – jhenninger – 2012-03-06T00:54:43.853

Answers

209

You can use the negate (!) feature of find to not match files with specific names:

find . ! -name '*.html' ! -path '*.svn*' -exec grep 'SearchString' {} /dev/null \;

So if the name ends in .html or contains .svn anywhere in the path, it will not match, and so the exec will not be executed.

Paul

Posted 2012-03-05T23:28:16.087

Reputation: 52 173

5I think you want -wholename '*.svn*' rather than -name. – fuenfundachtzig – 2015-05-09T22:35:24.060

@fuenfundachtzig wholename just does the match against the path as well as the file name doesn't it? How would that improve the answer? If it does in a way I am missing, feel free to edit the answer and enhance it. – Paul – 2015-05-10T11:55:13.810

2Yes, it does, so that the .svn directories are excluded from the search results. – fuenfundachtzig – 2015-05-10T12:41:57.017

@Paul The desired effect is to exclude "files that are in or below a directory that contains .svn", so path (or wholename, but path is more portable) is more accurate than name for the answer. They questioner doesn't appear to have any files with .svn in the name. – Stephen Schrauger – 2016-08-27T23:41:06.170

I tried this with -exec echo mv {} .. and it told me it was going to try to move . to ... Do you need to do something to avoid that? – Noumenon – 2016-10-24T02:42:26.093

1@Noumenon ! -name '.' should exclude . from the find results. – Paul – 2016-10-24T02:57:28.627

Is there a description of which versions of find this '!' syntax is compatible with? It seems to not work with my Ubuntu 14.04 server – user5359531 – 2018-10-02T00:34:15.450

1Should I still specify -name '.' somewhere in there? Would I do that before, or after the negations? – Cody S – 2012-03-06T01:21:06.200

Was the intention of your *.* match to ensure it only matched files containing a .? Find will match all files in the absence of a name directive, so the above will match everything except html and svn – Paul – 2012-03-06T01:23:51.920

12

I've had the same issue for a long time, and there are several solutions which can be applicable in different situations:

  • ack-grep is a sort of "developer's grep" which by default skips version control directories and temporary files. The man page explains how to search only specific file types and how to define your own.
  • grep's own --exclude and --exclude-dir options can be used very easily to skip file globs and single directories (no globbing for directories, unfortunately).
  • find . \( -type d -name '.svn' -o -type f -name '*.html' \) -prune -o -print0 | xargs -0 grep ... should work, but the above options are probably less of a hassle in the long run.

l0b0

Posted 2012-03-05T23:28:16.087

Reputation: 6 306

9

The following find command does prune directories whose names contain .svn, Although it does not descend into the directory, the pruned path name is printed ...(-name '*.svn' is the cause!) ..

You can filter out the directory names via: grep -d skip which silently skips such input "directory names".

With GNU grep, you can use -H instead of /dev/null. As a slight side issue: \+ can be much faster than \;, eg. for 1 million one-line files, using \; it took 4m20s, using \+ it took only 1.2s.

The following method uses xargs instead of -exec, and assumes there are no newlines \n in any of your file names. As used here, xargs is much the same as find's \+.

xargs can pass file-names which contain consecutive spaces by changing the input delimiter to '\n' with the -d option.

This excludes directories whose names contain .svn and greps only files which don't end with .html.

find . \( -name '*.svn*' -prune  -o ! -name '*.html' \) |
   xargs -d '\n' grep -Hd skip 'SearchString'

Peter.O

Posted 2012-03-05T23:28:16.087

Reputation: 2 743

1Thank you for pointing out the \+ variant of the -exec action. Hooray for slight side issues! – Christian Long – 2014-09-12T18:10:09.150

Of course, since + is not a special character to the shell, you don't need to type \ before it. – Scott – 2016-08-28T00:35:38.503