concatenate lines using shell scripting


Login or Register to Reply

 
Thread Tools Search this Thread
# 1  
concatenate lines using shell scripting

i have a mega file in this format:

a,
b,
c,
d,
=
a2,
b2,
c2,
d2,
=
a3,
b3
=

i want to combine lines until = meet. the result should be :
a,b,c,d,
a2,b2,c3,d2
a3,b3

need your help. thanks
# 2  
You can of course do this via shell scripting, but using sed would be a faster and by far clearer way to do it.

Why do you want to do it with a shell script?

bakunin
# 3  
sorry, i didn't mean that. sed or awk are all good.

---------- Post updated at 01:11 PM ---------- Previous update was at 12:58 PM ----------

would you please show me how to do it in sed or awk? thx
# 5  
perhaps.....bur certainly better....

Code:
awk '{if($0 !~ /=/) ORS=""}{if($0 ~/=/) ORS="\n"} {print}' | sed -e 's/=//g' -e 's/,$//g' file

# 6  
in sed it is easy: we have two types of lines, the ones reading "=" and the others. When we encounter a "="-line, we want to print out what we have so far, minus the newlines. If we encounter one of the other lines we want to store its contents until we encounter a "="-line.

sed has a so-called "hold space", think of it as a variable, where you can store things until you need them. We append everything to this hold space until we encounter a "="-line, then we recall the hold space, filter out all embedded newlines and print it, then start over.

In the following script i have put in comment for your understanding, remove them, because sed doesn't allow inline commenting in scripts. Furthermore, you can put the whole script on one line, replacing linefeeds with semicolons:

Code:
sed -n '/^=/ {                   # if a line starts with "="
          s/.*//                 # delete this lines content
          x                      # exchange the pattern space (empty) and the hold space
          s/\n//g                # delete newlines
          p                      # then print what you have
     }
     /^=/ ! {                    # if a line doesn't start with "="
          H                      # append it to the hold space
     }' /your/file > newfile

for short:

Code:
sed -n '/^=/{s/.*//;x;s/\n//gp};/^=/!{H}' /your/file > newfile

I hope this helps.

bakunin
# 7  
Quote:
Originally Posted by protocomm
perhaps.....bur certainly better....
Did you check the OP requirements Smilie , certainly not.
Login or Register to Reply

|
Thread Tools Search this Thread
Search this Thread:
Advanced Search

More UNIX and Linux Forum Topics You Might Find Helpful
Delete all CONSECUTIVE text lines from file shell scripting
RJSKR28
Hi I have a text file like below. THe content of the text will vary. Entire text file have four consecutive lines followed with blank line. I want to delete the occurrence of the two consicutive lines in the text file. I don't have pattern to match and delete. Just i need to delete all...... Shell Programming and Scripting
5
Shell Programming and Scripting
Shell script to read lines in a text file and filter user data Shell Programming and Scripting
VikrantD
sxsaaas... UNIX for Dummies Questions & Answers
3
UNIX for Dummies Questions & Answers
Concatenate lines in file shell script
systemoper
Hi colleagues, I have a file in this format. "/cccc/pppp/dddd/ggg/prueba.txt". ERROR" THE error bbbbbbbbbb finish rows. "/kkkk/mmmm/hhhh/jjj/ejemplo.txt". ERROR This is other error rows.I need my file in this format. "/cccc/pppp/dddd/ggg/prueba.txt". ERROR" THE error bbbbbbbbbb finish rows....... UNIX for Advanced & Expert Users
3
UNIX for Advanced & Expert Users
searching multiple lines and replacing in shell scripting
anil8103
Hi, I have a file with below contents, ssenthil = rw anilkg = rw I want to search for "ssenthil" and need to delete line 1 and 2 , if the third line starts with "" respectively and blank line immediately and third line starts with " anilkg = rw Please help me . Great day...... Shell Programming and Scripting
5
Shell Programming and Scripting
Concatenate lines between lines starting with a specific pattern
s052866
Hi, I have a file such as: --- >contig00001 length=35524 numreads=2944 gACGCCGCGCGCCGCGGCCAGGGCTGGCCCA CAGGCCGCGCGGCGTCGGCTGGCTGAG >contig00002 length=4242 numreads=43423 ATGCCGAAGGTCCGCCTGGGGCTGG CGCCGGGAGCATGTAGCG --- I would like to concatenate the lines not starting with ">"...... Shell Programming and Scripting
9
Shell Programming and Scripting