Huge files manipulation

11-11-2008

Registered User

325, 2

Join Date: Nov 2007

Last Activity: 26 April 2020, 8:13 AM EDT

Posts: 325

Thanks Given: 0

Thanked 2 Times in 2 Posts

Quote:

Originally Posted by Klashxx

...
I tried all the usual methods ( awk / sort /uniq / sed /grep .. ) but it always ended with the same result (memory core dump)

In using HP-UX large servers.
...

IMHO the definitive answer is installing gawk, at least that's the routine advice given over at the HP forums to people facing the same situation, that really helps resolving this issue, and it'll also come very handy in the future. From my experience, sed is able to manipulate huge files in HP-UX ( 11.00 ).

If that's not an option then perl will be your friend. In the meantime you can use awk 2 perl converter ( a2p code.awk ) that comes with perl install package, for an equivalent perl solution from an awk code.The following one might be helpful for your problem, and I'm sure that it can be optimized even more, but it'll get the job done:

Code:

#!/usr/bin/perl

$[ = 1;
$\ = "\n";
$FS = "|";
while (<>) {
    chomp;
     @Fld = split($FS, $_, -1);
     if ($a{$Fld[1] . $FS . $Fld[2] . $FS . $Fld[3] . $FS .

      $Fld[4] . $FS . $Fld[5] . $FS . $Fld[6] . $FS . $Fld[7]}++ == 0) {
        print $_;
    }
}

[ Tested on Cygwin, not near a HP box now. ]
______________________________________

EDIT: On HP-UX worked fine, it took a while though. It ran about 4 min on a file with ~ 200,000 lines, ( ~ 13 MB ).

Last edited by rubin; 11-11-2008 at 09:49 PM.. Reason: Tested on HP-UX

rubin

View Public Profile for rubin

Find all posts by rubin

12-01-2008

Registered User

115, 0

Join Date: Jul 2005

Last Activity: 24 July 2019, 4:34 PM EDT

Location: Northern California

Posts: 115

Thanks Given: 0

Thanked 0 Times in 0 Posts

Hi, I know this is old, but couldn't you do this?

#!/usr/bin/ksh

while read name

do

something_to $name

done < $data_to_read

This is line by line, from "mastering shell scripting".

Last edited by nj78; 12-01-2008 at 06:16 PM.. Reason: early submit

nj78

View Public Profile for nj78

Find all posts by nj78

UNIX for Advanced & Expert Users

Huge files manipulation

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Aggregation of Huge files

Discussion started by: Ravichander

2. UNIX for Dummies Questions & Answers

File comparison of huge files

Discussion started by: kaaliakahn

3. Shell Programming and Scripting

Compression - Exclude huge files

Discussion started by: DevendraG

4. Shell Programming and Scripting

Comparing 2 huge text files

Discussion started by: linuxgeek

5. Shell Programming and Scripting

Compare 2 folders to find several missing files among huge amounts of files.

Discussion started by: jiapei100

6. Shell Programming and Scripting

Splitting the Huge file into several files...

Discussion started by: lakteja

7. Shell Programming and Scripting

Split a huge data into few different files?!

Discussion started by: patrick87

8. High Performance Computing

Huge Files to be Joined on Ux instead of ORACLE

Discussion started by: magedfawzy

9. UNIX for Dummies Questions & Answers

Difference between two huge files

Discussion started by: pyaranoid

10. Shell Programming and Scripting

Comparing two huge files

Discussion started by: kmkbuddy_1983