Remove multiline text between brackets Post: 302879334

Sponsored Content

Top Forums Shell Programming and Scripting Remove multiline text between brackets Post 302879334 by climatron on Wednesday 11th of December 2013 11:26:35 PM

12-12-2013

Registered User

The awk answer to this problem

This one intrigued me because I'm more of an awk person than a perl person...

Code:

awk '{print $2} NR==1' RS={ FS="} " ORS= file

I wasn't familiar with RS, FS and ORS as built in variables and now realize that's some very powerful stuff there. (by the way, I don't know how I made it so many years never knowing about FNR... argh!!).

To explain your example, RS is the record separator, which is how a line is defined. Normally I think the default RS is "\n" i.e. the newline. FS is the field separator and normally that's whitespace. ORS is output record separator.

In other words, define a new line by finding {. Break the "line" into two fields separated by "} " (note space after brackets... So "my big}date yesterday" is not a match). The first field is what's inside the {}, the second is what comes after. Print the second field (printing the first field would only show what was inside the brackets). Make the output record separator nothing (otherwise it would insert line breaks any time there's a {}. One downside of this though is that if the line break is within the {}, that break disappears. So

Yesterday we ate {some
giant} cake and
drank tea

becomes

Yesterday we ate cake and
drank tea

which is not the end of the world I suppose.

Great answer, Scrutinizer!

Last edited by Franklin52; 12-12-2013 at 03:39 AM.. Reason: Please use code tags

climatron

View Public Profile for climatron

Find all posts by climatron

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Delete text between square brackets and also delete those square brackets using sed or awk

Hi All, I have a text file which looks like this: computer programming systems engineering I want to get rid of these square brackets and also the text that is inside these brackets. So that my final text file looks like this: computer programming systems engineering I am using...

2. Shell Programming and Scripting

Remove text between brackets

How can I use bash to remove all text between "<" and ">" in a file?

3. Shell Programming and Scripting

Remove brackets { } in the data

Hello folks, I have a data file in which each line has 54 numbers, and every 3 numbers are bracketed. So totally 18 pairs of brackets in each line. A typical line is like: {29.187000274658203 -16.148000717163086 -0.9380000233650208} {30.63800048828125 -15.977999687194824...

4. Shell Programming and Scripting

remove brackets and put it in a column and remove repeated entry

Hi all, I want to remove the remove bracket sign ( ) and put in the separate column I also want to remove the repeated entry like in first row in below input (PA156) is repeated ESR1 (PA156) leflunomide (PA450192) (PA156) leflunomide (PA450192) CHST3 (PA26503) docetaxel...

5. Shell Programming and Scripting

Remove brackets repeats and separate in columns

6. Shell Programming and Scripting

Remove whatever is mention in brackets

Hi all My previous question was complicated let me simplify it I have to just remove whatever is present in bracket () along with brackets ERCC1 (PA155) Platinum compounds (PA164713176) Allele A is not associated with response to Platinum compounds in women with Ovarian Neoplasms as...

7. Shell Programming and Scripting

Remove everything inside of brackets

I need to use something bash related to remove everything inside of brackets. For example. In the following: abc<def>ghi<jkl>mno the result should be: abcghimno

8. Shell Programming and Scripting

How to remove multiline HTML tags from a file?

I am trying to remove a multiline HTML tag and its contents from a few HTML files following the same basic pattern. So far using regex and sed have been unsuccessful. The HTML has a basic structure like this (with the normal HTML stuff around it): <div id="div1"> <div class="div2"> <other...

9. Shell Programming and Scripting

Remove lines from multiline json

Hi , When extracting the data from API end point ,its giving multi line json .I want to remove certain lines with group": "tag" or tget and respect "item" values python test.py /data{" Id":" 7554317""group":"get", "item":"xx5e1"],"fields":} { "time": 1520460953, "...

10. Shell Programming and Scripting

How to remove the text between all curly brackets from text file?

Hello experts, I have a text file with lot of curly brackets (both opening { & closing } ). I need to delete them alongwith the text between opening & closing brackets' pair. For ex: Input:- 59. Rh1 Qe4 {(Qf5-e4 Qd8-g8+ Kg6-f5 Qg8-h7+ Kf5-e5 Qh7-e7+ Ke5-f5 Qe7-d7+ Qe4-e6 Qd7-h7+ Qe6-g6...

LEARN ABOUT MINIX

join

JOIN(1) 						      General Commands Manual							   JOIN(1)

NAME

       join - relational database operator

SYNOPSIS

       join [-an] [-e s] [-o list] [-tc] file1 file2

DESCRIPTION

       Join  forms,  on the standard output, a join of the two relations specified by the lines of file1 and file2.  If file1 is `-', the standard
       input is used.

       File1 and file2 must be sorted in increasing ASCII collating sequence on the fields on which they are to be joined, normally the  first	in
       each line.

       There  is  one line in the output for each pair of lines in file1 and file2 that have identical join fields.  The output line normally con-
       sists of the common field, then the rest of the line from file1, then the rest of the line from file2.

       Fields are normally separated by blank, tab or newline.	In this case, multiple separators count as one, and leading  separators  are  dis-
       carded.

       These options are recognized:

       -an    In addition to the normal output, produce a line for each unpairable line in file n, where n is 1 or 2.

       -e s   Replace empty output fields by string s.

       -o list
	      Each output line comprises the fields specified in list, each element of which has the form n.m, where n is a file number and m is a
	      field number.

       -tc    Use character c as a separator (tab character).  Every appearance of c in a line is significant.

SEE ALSO

       sort(1), comm(1), awk(1).

BUGS

       With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.

       The conventions of join, sort, comm, uniq, look and awk(1) are wildly incongruous.

7th Edition							  April 29, 1985							   JOIN(1)