Sponsored Content
Top Forums Shell Programming and Scripting Bash - Find files excluding file patterns and subfolder patterns Post 303003829 by jim mcnamara on Thursday 21st of September 2017 07:13:58 AM
Old 09-21-2017
Maintaining something like that is hard - as you found out.
Consider splitting up the lists, make a file with the exclusion patterns, one pattern per line.
Call it exclusions. Changes in red.

Example:
Code:
find "$PATH1"  -type f \( -name "*" ! -iname "*~"  ! -iname "*.iso"  ! -iname "*.tar"  ! -iname "*.bz2"  ! -iname "*.gz"
  ! -iname "*.tgz"  ! -iname "*.7z"  ! -iname "*.sfx"  ! -iname "*.zip"  ! -iname "*.rpm" 
 ! -iname "*.gz.aa"  ! -iname "*.rpm"  ! -iname "*.pdf"  ! -iname "*.png"  ! -iname "*.jpg"  ! -iname "*.dll" 
 ! -iname "*.exe"  ! -iname "*.xsession*"  \) -a -type d \( -name "/etc/gconf/gconf.*" -o -name ".cache" 
-o -name ".Cache" -o -name ".mozilla" -o -name ".googleearth"  \) -prune  -o -print    | grep -v -f $HOME/exclusions > resultfile 

In order to test exclusions you can feed it output from a directory that had oddball file names
Code:
ls -a /path/to/somedir | grep  -f $HOME/exclusions   # note: no -v option

 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

To find 3 patterns in a file

hy i have a requirement in which my script needs to find 3 patterns in a file and if any pattern is missing it should sent a mail Patterns Interval60min_Daily_readings$a.txt Interval_Daily_readings$a.txt Daily_readings$a.txt Basically i want to test for the above Patterns in the... (2 Replies)
Discussion started by: ali560045
2 Replies

2. Shell Programming and Scripting

Searching patterns in 1 file and deleting all lines with those patterns in 2nd file

Hi Gurus, I have a file say for ex. file1 which has 3500 lines in it which are different account numbers and another file (file2) which has 230000 lines in it. I want to read all the lines in file1 and delete all those lines from file2 which has that same pattern as in file1. I am not quite... (4 Replies)
Discussion started by: toms
4 Replies

3. Shell Programming and Scripting

Find files that do not match specific patterns

Hi all, I have been searching online to find the answer for getting a list of files that do not match certain criteria but have been unsuccessful. I have a directory that has many jpg files. What I need to do is get a list of the files that do not match both of the following patterns (I have... (21 Replies)
Discussion started by: nikos-koutax
21 Replies

4. Shell Programming and Scripting

Excluding patterns from a list

I have the following code that takes the command line arguments. However I want to remove from the command line list the user options. For example, removing --quiet --shift=3 sort=4/5/6 I have written the following code to take care of this situation. set strLst = `echo $argv | tr '... (3 Replies)
Discussion started by: kristinu
3 Replies

5. UNIX for Dummies Questions & Answers

Find diff between two patterns in two files and append

Hi, I'm a newbie at programming in Unix, and I seem to have a task that is greater than I can handle. Trying to learn awk by the way (but in the end, i just need something that works). My goal is to compare two files and output the difference between the two. I've been reading, and I think I... (5 Replies)
Discussion started by: legato22
5 Replies

6. Shell Programming and Scripting

Find matched patterns in multiple files

Hi, I need help to find matched patterns in 30 files residing in a folder simultaneously. All these files only contain 1 column. For example, File1 Gr_1 st-e34ss-11dd bt-wwd-fewq pt-wq02-ddpk pw-xsw17-aqpp Gr_2 srq-wy09-yyd9 sqq-fdfs-ffs9 Gr_3 etas-qqa-dfw ddw-ppls-qqw... (10 Replies)
Discussion started by: redse171
10 Replies

7. Shell Programming and Scripting

Find matched patterns in a column of 2 files with different size and merge them

Hi, i have input files like below:- input1 Name Seq_ID NewID Scores MT1 A0QZX3 1.65 277.4 IVO A0QZX3 1.65 244.5 HPO A0QZX3 1.65 240.5 RgP A0Q3PP 5.32 241.0 GX1 LPSZ3S 96.1 216.9 MEL LPSS3X 4.23 204.1 LDD LPSS3X 4.23 100.2 input2 Fac AddName NewID ... (9 Replies)
Discussion started by: redse171
9 Replies

8. Shell Programming and Scripting

Find matched patterns and print them with other patterns not the whole line

Hi, I am trying to extract some patterns from a line. The input file is space delimited and i could not use column to get value after "IN" or "OUT" patterns as there could be multiple white spaces before the next digits that i need to print in the output file . I need to print 3 patterns in a... (3 Replies)
Discussion started by: redse171
3 Replies

9. Shell Programming and Scripting

Help needed in excluding certain word patterns

Hi, I need help with following. I need to exclude words that match following patterns a. more than length 4 (example SBRAP) b. contains mixture uppercase and lower case regardless of the length (example GSpD) File contains COFpC MCHX SP SNFCA GEH SBRAP DGICA JPMpE WFCpP GSpD AXL... (5 Replies)
Discussion started by: jakSun8
5 Replies

10. Shell Programming and Scripting

Find files not matching multiple patterns and then delete anything older than 10 days

Hi, I have multiple files in my log folder. e.g: a_m1.log b_1.log c_1.log d_1.log b_2.log c_2.log d_2.log e_m1.log a_m2.log e_m2.log I need to keep latest 10 instances of each file. I can write multiple find commands but looking if it is possible in one line. m file are monthly... (4 Replies)
Discussion started by: wahi80
4 Replies
SGMLNORM(1)															       SGMLNORM(1)

NAME
sgmlnorm - normalize SGML documents An SGML System Conforming to International Standard ISO 8879 -- Standard Generalized Markup Language SYNOPSIS
sgmlnorm [ -Cdemnv ] [ -bbctf ] [ -ccatalog ] [ -Ddir ] [ -iname ] [ -wwarning ] sysid... DESCRIPTION
sgmlnorm prints on the standard output a normalized document instance for the SGML document contained in the concatenation of the entities with system identifiers sysid... When the normalized instance is prefixed with the original SGML declaration and prolog, it will have the same ESIS as the original SGML document, with the following exceptions: o The output of sgmlnorm does not protect against the recognition of short reference delimiters, so any USEMAP declarations must be removed from the DTD. o The normalized instance will use the reference delimiters, even if the original instance did not. o If marked sections are included in the output using the -m option, the reference reserved names will be used for the status keywords even if the original instance did not. o Any ESIS information relating to the SGML LINK feature will be lost. The normalized instance will not use any markup minimization features except that: o Any attributes that were not specified in the original instance will not be included in the normalized instance. (Current attributes will be included.) o If the declared value of an attribute was a name token group, and a value was specified that was the same as the name of the attribute, then the attribute name and value indicator will be omitted. For example, with HTML sgmlnorm would output <DL COMPACT> rather than <DL COMPACT="COMPACT">. OPTIONS
-bbctf Use the BCTF name bctf for output. See nsgmls(1) for more information about this option. -cfile Use the catalog entry file file. -C The filename... arguments specify catalog files rather than the document entity. The document entity is specified by the first DOCU- MENT entry in the catalog files. -d Output a document type declaration with the same external identifier as the input document, and with no internal declaration subset. No check is performed that the document instance is valid with respect to this DTD. -Ddirectory Search directory for files specified in system identifiers. Multiple -D options are allowed. See nsgmls(1) for more information about this option. -e Describe open entities in error messages. -iname Pretend that <!ENTITY % name "INCLUDE"> occurs at the start of the document type declaration subset in the SGML document entity. Since repeated definitions of an entity are ignored, this definition will take precedence over any other definitions of this entity in the document type declaration. Multiple -i options are allowed. If the SGML declaration replaces the reserved name INCLUDE then the new reserved name will be the replacement text of the entity. Typically the document type declaration will contain <!ENTITY % name "IGNORE"> and will use %name; in the status keyword specification of a marked section declaration. In this case the effect of the option will be to cause the marked section not to be ignored. -m Output any marked sections that were in the input document instance. -n Output any comments that were in the input document instance. -r Raw output. Don't perform any conversion on RSs and REs when printing the entity. The entity would typically have the storage man- ager attribute records=asis. -v Print the version number. -wtype Control warnings and errors according to type. See nsgmls(1) for the allowed values of type. AUTHOR
James Clark <jjc@jclark.com> Conversion to DocBook and man page from original HTML documention by Steve Cheng <steve@ggi-project.org>. SEE ALSO
nsgmls(1) /usr/share/doc/sp/sgmlnorm.htm SP
05 July 1999 SGMLNORM(1)
All times are GMT -4. The time now is 05:37 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy