01-06-2009
awk grep sed or something better
Hello all,
Can anyone help with the following?
I have file1 with 150,000 words in a list and file2 with 148,000 words in a list - all of which are in file1. I want to create a new file with the words that DO NOT match (i.e of 2000 words). I have done this very simple command , which is still running after 4 hours and still not complete
.
grep -v -f file2 file1 > output
Does anyone know of any quicker method?
Thank you
Layla
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
hi all
by using cat /etc/passwd
I've got these output.
ajh1ect:x:839:501:Anthony:/home/ajh1ect:/bin/bash
mjb1ect:x:840:501:Michael:/home/mjb1ect:/bin/bash
mv3ect:x:841:501:Marian:/home/mv3ect:/bin/bash
now I want to see just the user ID and group ID.
so what is the code will be with... (2 Replies)
Discussion started by: nokia1100
2 Replies
2. Shell Programming and Scripting
Can someone help me in understanding when to use SED, AWK and GREP (3 Replies)
Discussion started by: kn.naresh
3 Replies
3. UNIX for Dummies Questions & Answers
Hello. I am an older newbie trying to learn Unix. I have a task to perform and it entails counting lines of code. Currently, I am pointing to the directory where the files are contained and performing a 'find' on the file extensions (cpp, c, html, java, etc.) and piping that info with a 'wc -l'.... (2 Replies)
Discussion started by: mastachef
2 Replies
4. UNIX for Dummies Questions & Answers
I have two .txt files one called good.txt and the other one is called bad.txt. Both contain email addresses in the following format:
john@john.com
bob@bob.com
sarah@sarah.com
Basically, I want to scrub good.txt against bad.txt and save the resulting output in scrubbed.txt meaning that if... (2 Replies)
Discussion started by: holyearth
2 Replies
5. UNIX for Dummies Questions & Answers
------------------------------------------------------------------
Ex of Warning messgae,(Many similar lines occure for Both Test and Test1)
-WARNING:Below Field not implemented in file File name: /home/test/
new/file1, msg buffer is:
:Test:000948
... (1 Reply)
Discussion started by: prsam
1 Replies
6. Shell Programming and Scripting
Hi All,
I have a huge file, I need to two things from this file. I need to know the IP address or the hostname and second thing is the date&time.
The file looks like this and I need to get my data from this...
Trying...
Connected to 204.109.172.117.
Escape character is '^]'.
Fri... (4 Replies)
Discussion started by: samnyc
4 Replies
7. Shell Programming and Scripting
thanks for your reply.
but i'm not quite sure what your code is doing.
i may be using it wrong but i'm not getting what i'm supposed to get.
could you please elaborate?
thanks again, (6 Replies)
Discussion started by: kratos.
6 Replies
8. UNIX for Dummies Questions & Answers
Thread1 {
x = 2
y = 10485
}
Thread2 {
x = 16
y = 1048
}
Thread3 {
x = 1
y = 1049
}
Thread4 {
x = 4
y = 1047
z = 500
}
Suppose the above is a piece of code. I need to automate and verify that the value of x under Thread1's 2.
There are several... (3 Replies)
Discussion started by: foxtron
3 Replies
9. Shell Programming and Scripting
Hi everyone!
I have a file like this
And I would like to find the Medium label when the value "last write" is "Jan 14" (it's could be another value like "jan 6")
I really don't know what way to use to solve this problem...
Thanks! (5 Replies)
Discussion started by: Castelior
5 Replies
10. Shell Programming and Scripting
got a file as y.txt
1 abc,def,ghj
2 defj,abc.kdm,ijk
3 lmn,cbk,mno
4 tmp,tmop,abc,pkl
5 pri,chk,cbk,lmo
6 def,cbk.pro,abc.kdm
i want to search in the above file the key word like abc
looking for two outcomes by passing the parameter value as abc into function and the two outocmes are... (6 Replies)
Discussion started by: silgun
6 Replies
comm(1) User Commands comm(1)
NAME
comm - select or reject lines common to two files
SYNOPSIS
comm [-123] file1 file2
DESCRIPTION
The comm utility reads file1 and file2, which must be ordered in the current collating sequence, and produces three text columns as output:
lines only in file1; lines only in file2; and lines in both files.
If the input files were ordered according to the collating sequence of the current locale, the lines written will be in the collating
sequence of the original lines. If not, the results are unspecified.
OPTIONS
The following options are supported:
-1 Suppresses the output column of lines unique to file1.
-2 Suppresses the output column of lines unique to file2.
-3 Suppresses the output column of lines duplicated in file1 and file2.
OPERANDS
The following operands are supported:
file1 A path name of the first file to be compared. If file1 is -, the standard input is used.
file2 A path name of the second file to be compared. If file2 is -, the standard input is used.
USAGE
See largefile(5) for the description of the behavior of comm when encountering files greater than or equal to 2 Gbyte ( 2**31 bytes).
EXAMPLES
Example 1: Printing a list of utilities specified by files
If file1, file2, and file3 each contain a sorted list of utilities, the command
example% comm -23 file1 file2 | comm -23 - file3
prints a list of utilities in file1 not specified by either of the other files. The entry:
example% comm -12 file1 file2 | comm -12 - file3
prints a list of utilities specified by all three files. And the entry:
example% comm -12 file2 file3 | comm -23 -file1
prints a list of utilities specified by both file2 and file3, but not specified in file1.
ENVIRONMENT VARIABLES
See environ(5) for descriptions of the following environment variables that affect the execution of comm: LANG, LC_ALL, LC_COLLATE,
LC_CTYPE, LC_MESSAGES, and NLSPATH.
EXIT STATUS
The following exit values are returned:
0 All input files were successfully output as specified.
>0 An error occurred.
ATTRIBUTES
See attributes(5) for descriptions of the following attributes:
+-----------------------------+-----------------------------+
| ATTRIBUTE TYPE | ATTRIBUTE VALUE |
+-----------------------------+-----------------------------+
|Availability |SUNWesu |
+-----------------------------+-----------------------------+
|CSI |enabled |
+-----------------------------+-----------------------------+
|Interface Stability |Standard |
+-----------------------------+-----------------------------+
SEE ALSO
cmp(1), diff(1), sort(1), uniq(1), attributes(5), environ(5), largefile(5), standards(5)
SunOS 5.10 3 Mar 2004 comm(1)