04-02-2010
Get List of Unique File Names
I have a large directory of web pages. I am doing a search through the web pages using grep and would like to get a list of unique file names of search results. The following command works fine to give me a list of file names where
term appears:
grep -l
term *.html
However, since these are web pages, the
term that I'm looking for might also be an html tag (i.e. <strong>, <body>, etc) and I don't want those to to be in the search results. To filer out certain tags, I'm using the following command:
grep
term *.html | grep -v
filter
But if
term appears more than once in a web page, the file name shows up more than once in the results.
Is there a way to get a unique list of filenames after applying a "grep filter (grep -v)"? Piping things to
grep -l doesn't seem to work.
I've tried using these commands, but they don't work:
- grep term *.html | grep -v filter | grep -l
- grep term *.html | grep -v filter | grep -l term
- grep term *.html | grep -v filter | xargs grep -l
- grep -l term `grep term *.html | grep -v filter`
Ray
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi Gurus,
I need to list only those files which are most recent and the latest log file.
For example if I have the following file list:
JOB001.LOG_00uv02_00006 - Jul 7 12:16
JOB001.LOG_00vi0t_00001 - Aug 4 21:58
JOB001.LOG_00vi0t_00002 - Aug 5 09:15
JOB001.LOG_00vi0t_00003 - Aug 5... (6 Replies)
Discussion started by: shash
6 Replies
2. Shell Programming and Scripting
I have the following list of file names stored in $fnames, so that if I do
foreach f ($fnames)
echo "$f"
end
I will get
n02-z30-sr65-rgdt0p50-dc0p002-16x12drw-run1
n02-z30-sr65-rgdt0p50-dc0p002-16x12drw-run2
n02-z30-sr65-rgdt0p50-dc0p002-16x12drw-run3... (3 Replies)
Discussion started by: kristinu
3 Replies
3. Shell Programming and Scripting
Hi Experts,
Here is my scenario:
Am maintaining a file which has list of logs with complete path and file names like bleow
a/b/c/Daily/file1_20111012.log
d/e/f/Monthly/file1_20111001.log
g/h/Daily/file1_20110120.log
i/Daily/file1_20110220.log
How to copy the file names frm the list... (7 Replies)
Discussion started by: laxm
7 Replies
4. Shell Programming and Scripting
I need to take the second column of a .csv file and count the number of instances of each unique value in that same second column. I'd like the output to be value,count sorted by most instances. Thanks for any guidance!
Data example:
317476,317756,0
816063,318861,0
313123,319091,0... (4 Replies)
Discussion started by: batcho
4 Replies
5. Shell Programming and Scripting
I have 84 files with the following names splitseqs.1, spliseqs.2 etc.
and I want to change the .number to a unique filename.
E.g.
change splitseqs.1 into splitseqs.7114_1#24
and
change spliseqs.2 into splitseqs.7067_2#4
So all the current file names are unique, so are the new file names.... (1 Reply)
Discussion started by: avonm
1 Replies
6. Shell Programming and Scripting
Hi All,
I have a .txt file with the list of filenames as given below.
/dev_data/dev3/ctl/20120622_Employee.txt
/dev_data/dev3/ctl/20120623_Employee.txt
/dev_data/dev3/ctl/20120624_Employee.txt
I want to read this file & write the dates alone from the filenames into a .done file.
... (6 Replies)
Discussion started by: dsfreddie
6 Replies
7. Shell Programming and Scripting
Hello,
I have two directories - prev and current . They both have same multiple subdirectories and files. Now the current directory can have some updated files and some new files added that is not in prev. I want to find the list of file names that differ. I am doing this because i can not... (2 Replies)
Discussion started by: jakSun8
2 Replies
8. UNIX for Dummies Questions & Answers
Hi All,
Can any one help me to list out the directory names which contain the specified file.
See for example
File name : file.201307014.LKT
Have the directory structure as below.
/app/work/data/INDIA/file.201307014.LKT
/app/work/data/AMERICA/file.201307014.KTP... (5 Replies)
Discussion started by: Balasankar
5 Replies
9. Shell Programming and Scripting
I'm attempting to write a script that will look into a directory, discover all unique names, create directories for their prefix names and place the files inside. I do this now one by one with a simple script but I'd like to automate the process.
The directory would contain something like: ... (6 Replies)
Discussion started by: scribling
6 Replies
10. UNIX for Beginners Questions & Answers
Dear Users,
Appreciate your help if you could help me with splitting a large file > 1 million lines with sed or awk. below is the text in the file
input file.txt
scaffold1 928 929 C/T +
scaffold1 942 943 G/C +
scaffold1 959 960 C/T +... (6 Replies)
Discussion started by: kapr0001
6 Replies
LEARN ABOUT XFREE86
zgrep
ZGREP(1) General Commands Manual ZGREP(1)
NAME
zgrep - search possibly compressed files for a regular expression
SYNOPSIS
zgrep [ grep_options ] [ -e ] pattern filename...
DESCRIPTION
Zgrep invokes grep on compressed or gzipped files. These grep options will cause zgrep to terminate with an error code:
(-[drRzZ]|--di*|--exc*|--inc*|--rec*|--nu*). All other options specified are passed directly to grep. If no file is specified, then the
standard input is decompressed if necessary and fed to grep. Otherwise the given files are uncompressed if necessary and fed to grep.
If the GREP environment variable is set, zgrep uses it as the grep program to be invoked.
EXIT CODE
2 - An option that is not supported was specified.
AUTHOR
Charles Levert (charles@comm.polymtl.ca)
SEE ALSO
grep(1), gzexe(1), gzip(1), zdiff(1), zforce(1), zmore(1), znew(1)
ZGREP(1)