12-14-2009
hiiiiii Friends..Thier is a error occuring..Its not giving correct output for my huge data.. Check this type of file out..
I hav a file like
a.dat:
HTML Code:
ISC 1976 8 12 23 26 47.09 26.6967 97.0421 31.0 326 6.20 79 0.00 5.90 6.10 0 0.00 6.20 0 7.99e+2
PDE 1976 8 12 23 26 47.09 26.6967 97.0421 31.0 326 6.40 79 0.00 5.90 6.10 0 0.00 6.40 0 7.99e+2
HFS 1984 5 6 15 18 20.00 18.9000 99.2000 0.0 0 6.00 0 0.00 0.00 0.00 0 0.00 6.00 0 NULL
ISC 1984 5 6 15 19 11.32 24.2152 93.5256 32.0 480 5.70 85 0.00 6.00 5.60 14 5.80 6.00 0 1.19e+25
MOS 1984 5 6 15 19 11.32 24.2152 93.5256 32.0 480 6.20 85 0.00 6.00 5.60 14 5.60 6.20 0 1.19e+25
NAO 1984 5 6 15 19 11.32 24.2152 93.5256 32.0 480 5.60 85 0.00 6.00 5.60 14 0.00 6.00 0 1.19e+25
ISC 1986 11 1 5 45 4.82 27.1726 96.3983 82.0 9 4.10 2 0.00 0.00 0.00 0 0.00 4.10 0 NULL
MOS 1986 11 1 5 2 40.27 26.8483 96.3965 11.0 335 5.60 68 0.00 5.20 5.00 5 5.00 5.60 0 6.96e+2
NAO 1986 11 1 5 2 40.27 26.8483 96.3965 11.0 335 5.10 68 0.00 5.20 5.00 5 0.00 5.20 0 6.96e+23
NDI 1986 11 1 5 2 40.27 26.8483 96.3965 11.0 335 5.30 68 0.00 5.20 5.00 5 0.00 5.30 0 6.96e+23
HFS 1988 2 6 14 50 45.38 24.6677 91.5619 33.0 496 6.20 89 6.10 5.80 5.80 20 5.90 6.20 0 6.7e+2
ISC 1988 2 6 14 50 45.38 24.6677 91.5619 33.0 496 5.80 89 0.00 5.80 5.80 20 5.80 5.80 0 6.7e+24
MOS 1988 2 6 14 50 45.38 24.6677 91.5619 33.0 496 6.10 89 0.00 5.80 5.80 20 5.70 6.20 0 6.7e+24
THe output i must get is
b.dat:
HTML Code:
PDE 1976 8 12 23 26 47.09 26.6967 97.0421 31.0 326 6.40 79 0.00 5.90 6.10 0 0.00 6.40 0 7.99e+2
MOS 1984 5 6 15 19 11.32 24.2152 93.5256 32.0 480 6.20 85 0.00 6.00 5.60 14 5.60 6.20 0 1.19e+25
MOS 1986 11 1 5 2 40.27 26.8483 96.3965 11.0 335 5.60 68 0.00 5.20 5.00 5 5.00 5.60 0 6.96e+2
HFS 1988 2 6 14 50 45.38 24.6677 91.5619 33.0 496 6.20 89 6.10 5.80 5.80 20 5.90 6.20 0 6.7e+2
It must check for 2,3,4,5, columns to same & remain the duplicates based on the longest row with values & largest 19th column ..
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
It is my first post, hoping to get help from the forum.
In a directory, I have 5000 multiple files that contains around 4000 rows with 10 columns in each file containing a unique string 'AT' located at 4th column.
OM 3328 O BT 268 5.800 7.500 4.700 0.000 ... (9 Replies)
Discussion started by: asanjuan
9 Replies
2. Shell Programming and Scripting
Hi,
I want to print column value based on row number say multiple of 8.
Input file:
line 1 67 34
line 2 45 57
. . .
. . .
line 8 12 46
. . .
. . .
line 16 24 90
. . .
. . .
line 24 49 67
Output
46
90
67 (2 Replies)
Discussion started by: Surabhi_so_mh
2 Replies
3. Shell Programming and Scripting
I am a newbie to shell scripting ..
I have a .csv file. It has 1000 some rows and about 7 columns...
but before I insert this data to a table I have to parse it and clean it ..basing on the value of the first column..which a string of phone number type...
example below..
column 1 ... (2 Replies)
Discussion started by: mitr
2 Replies
4. Shell Programming and Scripting
Given a file such as this I need to remove the duplicates.
00060011 PAUL BOWSTEIN ad_waq3_921_20100826_010517.txt
00060011 PAUL BOWSTEIN ad_waq3_921_20100827_010528.txt
0624-01 RUT CORPORATION ad_sade3_10_20100827_010528.txt
0624-01 RUT CORPORATION ... (13 Replies)
Discussion started by: script_op2a
13 Replies
5. Shell Programming and Scripting
cat file1.txt
field1 "user1":
field2:"data-cde"
field3:"data-pqr"
field4:"data-mno"
field1 "user1":
field2:"data-dcb"
field3:"data-mxz"
field4:"data-zul"
field1 "user2":
field2:"data-cqz"
field3:"data-xoq"
field4:"data-pos"
Now i need to have the date like below.
i have just... (7 Replies)
Discussion started by: ckaramsetty
7 Replies
6. Shell Programming and Scripting
Hi,
I have a file which consists of two columns but the first one can be varying in length like
123456789 0abcd
123456789 0abcd
4015 0 0abcd
5000 0abcd
I want to go through the file reading each line, count the number of characters in the first column and delete... (2 Replies)
Discussion started by: swasid
2 Replies
7. Shell Programming and Scripting
Hi all
I have following kind of input file
ESR1 PA156 leflunomide PA450192 leflunomide
CHST3 PA26503 docetaxel Pa4586; thalidomide Pa34958; decetaxel docetaxel docetaxel
I want to remove duplicates and I want to separate anything before and after PAxxxx entry into columns or... (1 Reply)
Discussion started by: manigrover
1 Replies
8. Shell Programming and Scripting
Dear All,
I have input like this,
J_15TEST_ASH05_33A22.13885.txt: $$ 1 MAKE SP1501 1 1 4 6101 7392 2 2442 2685 18 3201 4008 20 120 4158
J_15TEST_ASH05_33A22.13885.txt: $$ 1 MAKE SP1502 1 1 4 5125 6416 2 ... (4 Replies)
Discussion started by: attila
4 Replies
9. Shell Programming and Scripting
I am trying to see if I can use awk to remove duplicates from a file. This is the file:
-==> Listvol <==
deleting /vol/eng_rmd_0941
deleting /vol/eng_rmd_0943
deleting /vol/eng_rmd_0943
deleting /vol/eng_rmd_1006
deleting /vol/eng_rmd_1012
rearrange /vol/eng_rmd_0943
... (6 Replies)
Discussion started by: newbie2010
6 Replies
10. Shell Programming and Scripting
Dear all,
How can I remove duplicated column in a text file?
Input:
LG10_PM_map_19_LEnd 1000560 G AA AA AA AA AA GG
LG10_PM_map_19_LEnd 1005621 G GG GG GG AA AA GG
LG10_PM_map_19_LEnd 1011214 A AA AA AA AA GG GG
LG10_PM_map_19_LEnd 1011673 T TT TT TT TT CC CC... (1 Reply)
Discussion started by: huiyee1
1 Replies
LEARN ABOUT REDHAT
dlaqgb
DLAQGB(l) ) DLAQGB(l)
NAME
DLAQGB - equilibrate a general M by N band matrix A with KL subdiagonals and KU superdiagonals using the row and scaling factors in the
vectors R and C
SYNOPSIS
SUBROUTINE DLAQGB( M, N, KL, KU, AB, LDAB, R, C, ROWCND, COLCND, AMAX, EQUED )
CHARACTER EQUED
INTEGER KL, KU, LDAB, M, N
DOUBLE PRECISION AMAX, COLCND, ROWCND
DOUBLE PRECISION AB( LDAB, * ), C( * ), R( * )
PURPOSE
DLAQGB equilibrates a general M by N band matrix A with KL subdiagonals and KU superdiagonals using the row and scaling factors in the vec-
tors R and C.
ARGUMENTS
M (input) INTEGER
The number of rows of the matrix A. M >= 0.
N (input) INTEGER
The number of columns of the matrix A. N >= 0.
KL (input) INTEGER
The number of subdiagonals within the band of A. KL >= 0.
KU (input) INTEGER
The number of superdiagonals within the band of A. KU >= 0.
AB (input/output) DOUBLE PRECISION array, dimension (LDAB,N)
On entry, the matrix A in band storage, in rows 1 to KL+KU+1. The j-th column of A is stored in the j-th column of the array AB as
follows: AB(ku+1+i-j,j) = A(i,j) for max(1,j-ku)<=i<=min(m,j+kl)
On exit, the equilibrated matrix, in the same storage format as A. See EQUED for the form of the equilibrated matrix.
LDAB (input) INTEGER
The leading dimension of the array AB. LDA >= KL+KU+1.
R (output) DOUBLE PRECISION array, dimension (M)
The row scale factors for A.
C (output) DOUBLE PRECISION array, dimension (N)
The column scale factors for A.
ROWCND (output) DOUBLE PRECISION
Ratio of the smallest R(i) to the largest R(i).
COLCND (output) DOUBLE PRECISION
Ratio of the smallest C(i) to the largest C(i).
AMAX (input) DOUBLE PRECISION
Absolute value of largest matrix entry.
EQUED (output) CHARACTER*1
Specifies the form of equilibration that was done. = 'N': No equilibration
= 'R': Row equilibration, i.e., A has been premultiplied by diag(R). = 'C': Column equilibration, i.e., A has been postmulti-
plied by diag(C). = 'B': Both row and column equilibration, i.e., A has been replaced by diag(R) * A * diag(C).
PARAMETERS
THRESH is a threshold value used to decide if row or column scaling should be done based on the ratio of the row or column scaling factors.
If ROWCND < THRESH, row scaling is done, and if COLCND < THRESH, column scaling is done.
LARGE and SMALL are threshold values used to decide if row scaling should be done based on the absolute size of the largest matrix element.
If AMAX > LARGE or AMAX < SMALL, row scaling is done.
LAPACK version 3.0 15 June 2000 DLAQGB(l)