01-29-2009
zaxxon..
i went with your suggestion and it worked great...
but the more i dug into the data, the more i saw how crappy it was..
lots of leading and trailing whitespace on most of the fields, so i ended up using this approach..
awk 'BEGIN{FS=OFS="\t"};{ for (i=NF; i>0; i--) gsub(/^[ \t]+|[ \t]+$/, "",$i); print}' < SALES_data.dat> SALES_data_cleansed.dat
thanks for the help
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have many messages such as the test message below:
00:00000:00021:2002/05/13 13:57:00.51 ERROR:- Test error, my test error!!!
I am writing a script in which I need to get everything from the word "ERROR:-" onwards.
I normally use awk for these things, but I am not an expert at it so i am... (6 Replies)
Discussion started by: baileyr1
6 Replies
2. Shell Programming and Scripting
Hi,
I know sed is stream text editor and not a bit more than that. Can anyone explain its usage and advantages?
How is awk different from sed?
I donno i am a bit confused about it. But i have coded in awk and shell.
Thanks,
Nisha
:confused: (7 Replies)
Discussion started by: Nisha
7 Replies
3. UNIX for Advanced & Expert Users
Hey all,
Can I put sed command inside the awk action ?? If not then can i do grep in the awk action ??
For ex:
awk '$1=="174" { ppid=($2) ; sed -n '/$ppid/p' tempfind.txt ; }' tempfind.txt
Assume: 174 is string.
Assume: tempfind.txt is used for awk and sed both.
tempfind.txt... (11 Replies)
Discussion started by: varungupta
11 Replies
4. UNIX for Advanced & Expert Users
Hi,
I have a data file with 5 columns - like this:
"20080401 09:43:08.770798 +0100s","TEST 1","R 1","A TEST","Nov 27 2007","1"
"20080401 09:43:08.770798 +0100s","THIS IS A TEST","R 2","B TEST","Nov 30 2007","10"
"20080401 09:43:08.770798 +0100s","ANOTHER TEST","R 3","B TEST","Nov 05... (7 Replies)
Discussion started by: MrG-San
7 Replies
5. UNIX for Dummies Questions & Answers
I've got an inventory database with eight columns with things like product name, manufacturer, UPC code, etc. on each line. Our PO (purchase order) number is in the first column. I can grep the date and get the full line of data but I would like to strip out everything but the PO number in the... (5 Replies)
Discussion started by: NetJones
5 Replies
6. Shell Programming and Scripting
What if I wanted to add a word such as IT after the first character and if theres 3 characters, after the 2nd character?
output would be:
G, it H
G, H it P
G, H, P it L
I'm thinking that AWK would be the easiest way to do this... Currently looking it up.
Right now I'm using awk but I... (13 Replies)
Discussion started by: puttster
13 Replies
7. Shell Programming and Scripting
Hi All,
Is there a way of comparing two columns in the same file and deleting the row if the values of the columns match.
I have the sample data file as below.
M024900|175309.00|968.00|17
M025001|19861.79|97.90|148
M025002|431.70|159.00|3
M025003|912.30|159.90|6 ... (6 Replies)
Discussion started by: nua7
6 Replies
8. UNIX for Dummies Questions & Answers
I have a file that contain the data below:
B1
1
2
3
B2
20
30
40
B3
7
8
B4
100
B5
21
22
23How can I retrieve the data for B1 into a seperate file. (8 Replies)
Discussion started by: bobo
8 Replies
9. Shell Programming and Scripting
Dear Geeks,
I want to manipulate a file with certain modifications for that using sed or AWK how to do this process for one file i have this type of data.
Input File:
"Restricted and Reserved names .ANISH",3798,"TEST.CO",1201208,6/16/10 0:00,6/16/13 0:00,,,"CO","2nd"^M
"Restricted and... (4 Replies)
Discussion started by: anishkumarv
4 Replies
10. Shell Programming and Scripting
Hi,
I am running a script sample.sh in bash environment .In the script i am using sed and awk commands which when executed individually from terminal they are getting executed normally but when i give these sed and awk commands in the script it is giving the below errors :-
./sample.sh: line... (12 Replies)
Discussion started by: satishmallidi
12 Replies
LEARN ABOUT DEBIAN
unknown
UNKNOWN(1) General Commands Manual UNKNOWN(1)
NAME
unknown - identify possible genotypes for unknowns
SYNOPSIS
A program to rapidly identify which genotypes are possible for individuals typed as unknowns in the input pedigree.
unknown [ -cl ]
DESCRIPTION
unknown infers possible genotypes and mating combinations for parents with unknown genotypes for ilink(1), mlink(1) and linkmap(1).
OPTIONS
-c Use conditional allele frequencies.
-l Choose a good set of loop breakers automatically.
RETURN VALUE
0 Successful completion
ERRORS
10 File not found
255 Failure
EXAMPLES
Normally, unknown(1) is run immediately prior to its sister programs, ilink(1), mlink(1) and linkmap(1), like this:
unknown
mlink
FILES
unknown(1) reads the two files pedfile.dat and datafile.dat as its own input and produces various temporary files that are used as input to
the next program. These temporary files are ipedfile.dat, upedfile.dat, speedfile.dat and newspeedfile.dat.
NOTES
unknown(1) is part of the FASTLINK package, which is a re-implementation of the LINKAGE suite of computer tools that help investigate
genetic linkage as first proposed G.M. Lathrop, J.M. Lalouel, C. Julier, and J. Ott.
AUTHORS
Dylan Cooper, Alejandro Schaffer, and Tony Schurtz based on work originally by Jurg Ott, Ph.D, et. al.
This manual page was written by Elizabeth Barham <lizzy@soggytrousers.net> for the Debian GNU/Linux system (but may be used by others).
WORD-WIDE-WEB
http://www.ncbi.nlm.nih.gov/CBBResearch/Schaffer/fastlink.html
SEE ALSO
ilink(1), linkmap(1), lodscore(1), mlink(1).
April 15, 2003 UNKNOWN(1)