Sponsored Content
Top Forums Shell Programming and Scripting Adding tags in between sentences with awk Post 302934168 by Scrutinizer on Thursday 5th of February 2015 04:42:28 AM
Old 02-05-2015
Hi, that shows that the empty lines in the input files contain some characters. Try this instead:
Code:
awk '!NF{$0=x}1' file |  awk '{$1=$1; print "<s>\n" $0 "\t.\n</s>"}' RS=  FS='\n' OFS='\t-\n'

There is still some ambiguity. In the first half there is a trailing dot, in the second half there is a trailing dash.
Also, your samples appears to not be TAB-delimited, contrary to what you say in the description..

Last edited by Scrutinizer; 02-05-2015 at 07:42 AM..
 

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

spliting up sentences

hello, i'm looking to split up text into a list of words but can't figure it out, any help would be great. thanks steven (2 Replies)
Discussion started by: stevox
2 Replies

2. Shell Programming and Scripting

How to identify sentences from a text?

Hi, I have to identify sentences from this text. If i split these statements by this way: @sentence= split(/\.\W*/,$text); I will get these following things also in the output along with proper sentences. Biol Reprod. 2002 Mar;66(3):785-95. Egydio de Carvalho C, Tanaka H,... (2 Replies)
Discussion started by: vanitham
2 Replies

3. Shell Programming and Scripting

How to get exact match sentences?

Hi, I have sentences like this: $sent= Protein modeling studies reveal that the RG-rich region is part of a three to four strand antiparallel beta-sheet, which in other RNA binding protein functions as a platform for nucleic acid interactions. Heterogeneous nuclear ribonucleoparticle... (19 Replies)
Discussion started by: vanitham
19 Replies

4. Shell Programming and Scripting

comparing sentences

Hi, I have a file and that file contains the following sentences. Here we show that a virus-encoded transcription factor, viral mRNA, cellular RNA-binding protein heterodimer G3BP/Caprin-1 (p137), translation initiation factors eIF4E and eIF4G, and ribosomal proteins are concentrated in the... (4 Replies)
Discussion started by: vanitham
4 Replies

5. UNIX for Dummies Questions & Answers

How to filter sentences??

Hi, I have few sentences here. $a1="Division of Hematology-Oncology, and Stem cell transplantation, Schneider Childrens Hospital, Albert Einstein College of Medicine, New Hyde Park, New York. "; $a2="Department of Cell Biology and Anatomy, College of Medicine, National Cheng Kung... (3 Replies)
Discussion started by: vanitham
3 Replies

6. UNIX for Dummies Questions & Answers

extracting sentences that only contain a word

Hi guys Need your help how do I extract sentences with only a word i.e. today is hot hot very humid humid2 Sample output hot very (0 Replies)
Discussion started by: jamestan
0 Replies

7. Shell Programming and Scripting

extracting sentences that only contain a word

Hi guys Need your help how do I extract sentences with only a word i.e. today is hot hot very humid humid2 Sample output hot (6 Replies)
Discussion started by: jamestan
6 Replies

8. Shell Programming and Scripting

Using awk to find sentences.

I am trying to print out sentences that meets a regular expression in awk (I’m open to using other tools, too). I got the regular expression I want to use, "(\+ \{4\})" from user ripat in a grep forum. Unfortunately with grep I couldn't print only the sentence. While searching for awk... (8 Replies)
Discussion started by: danbroz
8 Replies

9. Shell Programming and Scripting

How to use $variable in conditional sentences?

Hello all I am doing a Makefile but I can't return the value of $var to use it in conditional sentences: #!/bin/sh GO=$(shell) go GOPATH=$(GO) env GOPATH make: @$(GOPATH) @if ; then mkdir -p "$(GOPATH)/bin" ; fi When I type "make", @$GOPATH returns /home/icvallejo/go... (5 Replies)
Discussion started by: icvallejo
5 Replies
colcrt(1)						      General Commands Manual							 colcrt(1)

NAME
colcrt - Filters text processor output for screen previewing SYNOPSIS
colcrt [- -2] [file...] The colcrt command filters output from text processors such as nroff so that it can be previewed on a cathode ray tube (CRT). OPTIONS
Suppresses underlining; this is useful for previewing boxed tables from tbl. Causes all half-lines to be printed, effectively doublespac- ing the output. This is useful when printing output with subscripts and superscripts on a line printer, where half-lines do not normally appear. DESCRIPTION
The colcrt command provides virtual half-linefeed and reverse linefeed sequences for terminals without such capability, and on which over- striking is destructive. The colcrt command changes underline characters to dashes and places them and half-line characters on new lines in between the normal output lines. NOTES
colcrt cannot back up more than 102 lines. General overstriking is lost; as a special case a | (vertical bar) overstruck with a - (dash) or _ (underline) becomes a + (plus sign). Lines are trimmed to 132 bytes. EXAMPLES
The following command line illustrates a typical use of colcrt: tbl exum2.n | nroff -ms | colcrt - | more SEE ALSO
Commands: col(1), deroff(1), more(1), neqn(1), nroff(1), pg(1), tbl(1), ul(1) colcrt(1)
All times are GMT -4. The time now is 03:59 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy