Removing lines from a file


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Removing lines from a file
# 1  
Old 12-16-2005
Removing lines from a file

Hello

i have 2 files file1 and file2 as shown below

file1
110010000000206|567810008161509
110010000000207|567810072227627
110010000000208|567811368851555
110010000000209|567811422513652
110010000000210|567812130217683
110010000000211|567813220211182
110010000000212|567813449322589
110010000000213|567813741319623
110010000000214|567816323171591
110010000000215|567816660521463
110010000000216|567818208711973
110010000000217|567819516604228
110010000000218|567819540685909
110010000000219|567820748714137
110010000000220|567821948536668
110010000000221|567822556413253

file2

110010000000206
110010000000210
110010000000211
110010000000214
110010000000217
110010000000221

Now i want a third file obtained from file1 which will not have the entries from file2.

ie

110010000000207|567810072227627
110010000000208|567811368851555
110010000000209|567811422513652
110010000000212|567813449322589
110010000000213|567813741319623
110010000000215|567816660521463
110010000000216|567818208711973
110010000000218|567819540685909
110010000000219|567820748714137
110010000000220|567821948536668

Now my problem is that file1 has 10 million entries and file2 has half a million entries. So grep -v option is out. Pls suggest an easy way out.

I seem to get stuck with problems working with big files. Smilie

Regards
Pradeep
# 2  
Old 12-16-2005
It's going to be difficult to parse that many row in any shell script.

Are you not able to load the files into a DB via isql, bcp, or sqlplus?
# 3  
Old 12-16-2005
nawk -f pra.awk file2 file1

pra.awk:
Code:
BEGIN {
  FS=OFS="|"
}
NR==FNR { arr[$1]; next}
!($1 in arr) && !($2 in arr)

# 4  
Old 12-16-2005
Try this

egrep -v -f tmp2 tmp1
# 5  
Old 12-17-2005
Thanks nawk worked!!!

Thats a load off my chest

Thanks a lot man

Regards
Smilie
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Removing lines from a file

I have a file `/tmp/wrk` containing filenames with paths. I want to remove filenames from this file, for example remove all filenames containing alja cagr cavt clta cmdo or corl remove all filenames containing data for days in region `d.2016.001` to `d.2016.207` remove all filenames... (10 Replies)
Discussion started by: kristinu
10 Replies

2. Shell Programming and Scripting

Removing multiple lines from input file, if multiple lines match a pattern.

GM, I have an issue at work, which requires a simple solution. But, after multiple attempts, I have not been able to hit on the code needed. I am assuming that sed, awk or even perl could do what I need. I have an application that adds extra blank page feeds, for multiple reports, when... (7 Replies)
Discussion started by: jxfish2
7 Replies

3. Shell Programming and Scripting

Removing lines from a file

Hi, I have a linux server that was hacked and I have a bunch of files that sporadically contain the following lines through out the file: <?php eval(base64_decode("Xxxxxxxxxxxxxx/xxxxxxxx")); I did't put the exact lines of the file in this post. The "Xxxx" are random letters/numbers.... (8 Replies)
Discussion started by: nck
8 Replies

4. UNIX for Dummies Questions & Answers

removing several lines from a file

Hi folks, I have a long string of DNA sequences, and I need to remove several lines, as well as the line directly following them. For example, here is a sample of my starting material: >548::GY31UMJ02DLYEH rank=0007170 x=1363.5 y=471.0 length=478... (1 Reply)
Discussion started by: kkohl78
1 Replies

5. Shell Programming and Scripting

removing lines from file

Hi I have many files all with 1 field per line as in 12345 abcde john.paul.net 6789101 how do I remove ceratin lines from these files. Have tried sed but sed wrecks my head! Many thanks in advance for any help (9 Replies)
Discussion started by: rob171171
9 Replies

6. Shell Programming and Scripting

Removing Lines From a File

Hi Does anybody know of a command that will enable me to remove all entries in a file that have the format (name & time) more testfile anthony 2003 anthonyr 2008 amorel 15:00 anthonyp 14:35 anthonyp 14:35 anthonyr 2008 ardean 13:28 arlene 2003 arlenem 08:15 arlenem 08:15... (5 Replies)
Discussion started by: jamba1
5 Replies

7. Shell Programming and Scripting

Removing the first and last lines in a file

Hi Gurus, I'm a little new to UNIX. How can I do remove the first and last line in a file? Say, supppose I have a file as below: Code: 1DMA 400002BARRIE 401002CALGARY/LETHBRI 402002CARLETON 500001PORTLAND-AUBRN 501001NEW YORK, NY 502001BINGHAMTON, NY ... (2 Replies)
Discussion started by: naveendronavall
2 Replies

8. AIX

Removing the first and last lines in a file

Hi Gurus, I'm a little new to UNIX. How can I do remove the first and last line in a file? Say, supppose I have a file as below: 1DMA 400002BARRIE 401002CALGARY/LETHBRI 402002CARLETON 500001PORTLAND-AUBRN 501001NEW YORK, NY ... (1 Reply)
Discussion started by: naveendronavall
1 Replies

9. UNIX for Dummies Questions & Answers

Removing lines from a file

I'm trying to find a command which will allow me to remove a range of lines (2-4) from a .dat file from the command line without opening the file. Someone mentioned using the ex command? Does anyone have any ideas? thanks (6 Replies)
Discussion started by: computersaysno
6 Replies

10. Shell Programming and Scripting

Removing lines within a file

Hi There, I've written a script that processes a data file on our system. Basically the script reads a post code from a list file, looks in the data file for the first occurrence (using grep) and reads the line number. It then tails the data file, with the line number just read, and outputs to a... (3 Replies)
Discussion started by: tookers
3 Replies
Login or Register to Ask a Question