awk how to


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users awk how to
# 1  
Old 02-08-2010
awk how to

Hello all
I have a file with this type of information:

Code:
data1;data2;data3
data4:data5;data6
data7:data8,data9

As you see, fileds are separated by diferents caracters each time.
I would like to have always the same caracter the first time for exemple :
as this:
Code:
data1:data2;data3
data4:data5;data6
data7:data8,data9

How can I do this, with awk maybe?
Carefull that if there are 2 ; in the same line, I want to change only the first occurence.
Thanks a lot
# 2  
Old 02-08-2010
Try...
Code:
sed 's/\;/\:/' infile

# 3  
Old 02-08-2010
Ok it works.
Can you tell me why the second ; have not been changed?
in this case for exemple
"PRINCETON:PRINCETON UNIVERSITY, PLASMA PHYSICS LABORATORY;1989"

It's exactly what I want, but I do not anderstand why it works !! jaajaj
Cheers

---------- Post updated at 06:34 AM ---------- Previous update was at 06:24 AM ----------

Hello again
all the file have not been changed well, look this line for exemple:

KALSRUHE;KERNFORSCHUNGSZENTRUM KARLSRUHE GMBH:1990

May I pass the sed command twice?
# 4  
Old 02-08-2010
Hi,

As far as i know you should use "g" option (global) to apply the substitution process each lines of your input file:

Code:
sed 's/\;/\:/g' infile

Besides if you need to change more than one field seperator you can do it with "-e" option ( connecting a few SED commands with -e).
# 5  
Old 02-08-2010
Maybe with a real exemple it will be easy to anderstand, sorry...
So part of the file is this:

Code:
MADRID : JUNTA DE ENERGIA NUCLEAR, 1957
MADRID : JUNTA DE ENERGIA NUCLEAR, 1958
MADRID : JUNTA DE ENERGIA NUCLEAR, 1958
MADRI : :CIEMAT, 1988
MADRID : CIEMAT, 1988
MADRID : JUNTA DE ENERGIA NUCLEAR, 1985


As you see, the 3 first lines are good, it's easy to split the sring with excel to obtain 3 colums (that is the final objective)
But the nex one is problematic because we have : :
The idea will be to convert all the FIRST separate caracter, in this case : with an other, never used in the file, such as @ or ##
This way I'll be abble to use it in excel to separate values.

---------- Post updated at 09:38 AM ---------- Previous update was at 09:36 AM ----------

Me again
If I aplly the sed you told me earlier, The result is:
Code:
MADRID ## JUNTA DE ENERGIA NUCLEAR, 1958
MADRID ## JUNTA DE ENERGIA NUCLEAR, 1958
MADRI ## ##CIEMAT, 1988
MADRID ## CIEMAT, 1988
MADRID ## JUNTA DE ENERGIA NUCLEAR, 1985

and again, this line will be impossible to parse by excel: MADRI ## ##CIEMAT, 1988

the correct should be: MADRI ## :CIEMAT, 1988

Last edited by Scott; 02-08-2010 at 10:43 AM.. Reason: Code tags
# 6  
Old 02-08-2010
Code:
awk '{sub(":", "##"); print}' file

# 7  
Old 02-08-2010
Yes it works...
Very good
now last dubt, is it possible to identify a 4 digit field?? (the date in this case)
I ask because in this file, all the dates are separate with , .... but a few (andreds) are separated with : or ;
So the idea will be to identify the date and replace only the separate caracter just before.
Is it possible?
Thanks (this is the last question...)
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk output yields error: awk:can't open job_name (Autosys)

Good evening, Im newbie at unix specially with awk From an scheduler program called Autosys i want to extract some data reading an inputfile that comprises jobs names, then formating the output to columns for example 1. This is the inputfile: $ more MapaRep.txt ds_extra_nikira_usuarios... (18 Replies)
Discussion started by: alexcol
18 Replies

2. Shell Programming and Scripting

Pass awk field to a command line executed within awk

Hi, I am trying to pass awk field to a command line executed within awk (need to convert a timestamp into formatted date). All my attempts failed this far. Here's an example. It works fine with timestamp hard-codded into the command echo "1381653229 something" |awk 'BEGIN{cmd="date -d... (4 Replies)
Discussion started by: tuxer
4 Replies

3. Shell Programming and Scripting

Passing awk variable argument to a script which is being called inside awk

consider the script below sh /opt/hqe/hqapi1-client-5.0.0/bin/hqapi.sh alert list --host=localhost --port=7443 --user=hqadmin --password=hqadmin --secure=true >/tmp/alerts.xml awk -F'' '{for(i=1;i<=NF;i++){ if($i=="Alert id") { if(id!="") if(dt!=""){ cmd="sh someScript.sh... (2 Replies)
Discussion started by: vivek d r
2 Replies

4. Shell Programming and Scripting

HELP with AWK one-liner. Need to employ an If condition inside AWK to check for array variable ?

Hello experts, I'm stuck with this script for three days now. Here's what i need. I need to split a large delimited (,) file into 2 files based on the value present in the last field. Samp: Something.csv bca,adc,asdf,123,12C bca,adc,asdf,123,13C def,adc,asdf,123,12A I need this split... (6 Replies)
Discussion started by: shell_boy23
6 Replies

5. Shell Programming and Scripting

awk command to compare a file with set of files in a directory using 'awk'

Hi, I have a situation to compare one file, say file1.txt with a set of files in directory.The directory contains more than 100 files. To be more precise, the requirement is to compare the first field of file1.txt with the first field in all the files in the directory.The files in the... (10 Replies)
Discussion started by: anandek
10 Replies

6. Shell Programming and Scripting

Comparison and editing of files using awk.(And also a possible bug in awk for loop?)

I have two files which I would like to compare and then manipulate in a way. File1: pictures.txt 1.1 1.3 dance.txt 1.2 1.4 treehouse.txt 1.3 1.5 File2: pictures.txt 1.5 ref2313 1.4 ref2345 1.3 ref5432 1.2 ref4244 dance.txt 1.6 ref2342 1.5 ref2352 1.4 ref0695 1.3 ref5738 1.2... (1 Reply)
Discussion started by: linuxkid
1 Replies

7. Shell Programming and Scripting

Problem with awk awk: program limit exceeded: sprintf buffer size=1020

Hi I have many problems with a script. I have a script that formats a text file but always prints the same error when i try to execute it The code is that: { if (NF==17){ print $0 }else{ fields=NF; all=$0; while... (2 Replies)
Discussion started by: fate
2 Replies

8. Shell Programming and Scripting

awk: assign variable with -v didn't work in awk filter

I want to filter 2nd column = 2 using awk $ cat t 1 2 2 4 $ VAR=2 #variable worked in print $ cat t | awk -v ID=$VAR ' { print ID}' 2 2 # but variable didn't work in awk filter $ cat t | awk -v ID=$VAR '$2~/ID/ { print $0}' (2 Replies)
Discussion started by: honglus
2 Replies

9. Shell Programming and Scripting

scripting/awk help : awk sum output is not comming in regular format. Pls advise.

Hi Experts, I am adding a column of numbers with awk , however not getting correct output: # awk '{sum+=$1} END {print sum}' datafile 2.15291e+06 How can I getthe output like : 2152910 Thank you.. # awk '{sum+=$1} END {print sum}' datafile 2.15079e+06 (3 Replies)
Discussion started by: rveri
3 Replies

10. Shell Programming and Scripting

Awk problem: How to express the single quote(') by using awk print function

Actually I got a list of file end with *.txt I want to use the same command apply to all the *.txt Thus I try to find out the fastest way to write those same command in a script and then want to let them run automatics. For example: I got the file below: file1.txt file2.txt file3.txt... (4 Replies)
Discussion started by: patrick87
4 Replies
Login or Register to Ask a Question