05-28-2007
Copying Text between two unique text patterns
Dear Colleagues:
I have .rtf files of a collection of newspaper articles. Each newspaper article starts with a variation of the phrase "Document * of 20" and is separated from the next article with the character string "==================="
I would like to be able to take the text composing each news article from between these two patterns and dump them into separate, uniquely named files. I've been playing around with SED, grep, cut and csplit, but nothing seems to be working. I have the regular expressions developed to capture the two lines "Document * of 20" and "--------" independently, but I can't figure out how to capture and play with the text between the two lines. I hope you can help.
Yours,
Simon J. Kiss
Queen's University
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I have a file with 14million lines and I would like to extract all the unique lines from the file into another text file.
For example:
Contents of file1
happy
sad
smile
happy
funny
sad
I want to run a command against file one that only returns the unique lines (ie 1 line for happy... (3 Replies)
Discussion started by: soliberus
3 Replies
2. Shell Programming and Scripting
Hi all! Im trying to extract a portion of text from a file and put it into a new file. I need all the lines between <Placement> and </Placement> including the Placemark lines themselves. Is there a way to extract all instances of these and not just the first one found? I've tried using sed and... (4 Replies)
Discussion started by: Grizzly
4 Replies
3. Shell Programming and Scripting
I'm attempting to write a script to identify users who have sudo access on a server. I only want to extract the ID's of the sudo users after a unique line of text. The list of sudo users goes to the EOF so I only need the script to start after the unique line of text. I already have a script to... (1 Reply)
Discussion started by: bouncer
1 Replies
4. UNIX for Advanced & Expert Users
Is there a trick for copying from something like a powerpoint into vi? Every time I try to copy text from something like a powerpoint to vi my spacing gets messed up. I think it has something to do with my .vimrc file. When I renamed it was able to copy it in just fine so can someone please... (2 Replies)
Discussion started by: cokedude
2 Replies
5. UNIX for Dummies Questions & Answers
Hi All,
I'm hoping this is an easy question, but I'm having a weird problem trying to simply copy and paste text from MS Windows (XP) Notepad and then pasting into vi or vim in AIX. When I type "oslevel" I get "5.3.0.0". The problem is that once the text is pasted, there are sections of text... (2 Replies)
Discussion started by: PlainInverted
2 Replies
6. Shell Programming and Scripting
I would like to replace ], with ]]], between /* SECTION2-BEGIN */ and /* SECTION2-END */ in my file. My file contains the following information:
/* SECTION1-BEGIN */
,
/* SECTION1-END */
/* SECTION2-BEGIN */
,
/* SECTION2-END */
/*... (5 Replies)
Discussion started by: azdps
5 Replies
7. Shell Programming and Scripting
Hi All,
I want to extract the text between some pattern which occurs repeatedly in a file. For example my input is like,
/home/.....
..........java:25: cannot find symbol
............
/home/......
/home/.......
I want to display... (2 Replies)
Discussion started by: Vignesh58
2 Replies
8. Shell Programming and Scripting
I need to filter the text in between two patterns and output that to a different file. Please help me how to do it.
Ex:
.............
<some random text>
.............
Pattern_1
<Few lines that need to be output to different file>
Pattern_2
................
...............
<more text in... (4 Replies)
Discussion started by: metturr
4 Replies
9. Shell Programming and Scripting
awk -F "" '/<TestName>|<testname>|<Offerer>|<offerer>|<Line1>|<line1>|<City>|<city>|<State>|<state>/ {print $2, $3}' OFS='\t' UBE3A.xml > UBE3A.txt
Is it possible to use the code above to search for a pattern that is non-unique?
For example, if I wanted to capture the<MethodList>|<string>... (6 Replies)
Discussion started by: cmccabe
6 Replies
10. Shell Programming and Scripting
I am trying to use awk to print the unique entries in $2
So in the example below there are 3 lines but 2 of the lines match in $2 so only one is used in the output.
File.txt
chr17:29667512-29667673 NF1:exon.1;NF1:exon.2;NF1:exon.38;NF1:exon.4;NF1:exon.46;NF1:exon.47 703.807... (5 Replies)
Discussion started by: cmccabe
5 Replies
NEWS(1) USER COMMANDS NEWS(1)
NAME
news - display system news
SYNOPSIS
news [-adDeflnpvxs] [[article1] [article2] ..]
DESCRIPTION
The news command keeps you informed of news concerning the system. Each news item is contained in a separate file in the /var/lib/sysnews
directory. Anyone having write permission to this directory can create a news file.
If you run the news command without any flags, it displays every unread file in the /var/lib/sysnews directory.
Each file is preceded by an appropriate header. To avoid reporting old news, the news command stores a currency time. The news command con-
siders your currency time to be the date the $HOME/.news_time file was last modified. Each time you read the news, the modification time of
this file changes to that of the reading. Only news item files posted after this time are considered unread.
OPTIONS
-a, --all
Display all news, also the already read news.
-d, --datestamp
Add a date stamp to each article name printed. this can only be used with the -nl flags.
-D, --datefmt <fmt>
Specify a date format, see the strftime(3) man page for more details. the default format is (%b %d %Y)
-f, --newsdir <dir>
Read news from an alternate newsdir.
-l, --oneperline
One article name per line.
-n, --names
Only show the names of news articles.
-p, --page
Pipe articles through $PAGER or more(1) if the $PAGER environment variable is not set.
-s, --articles
Reports the number of news articles.
MAINTAINER OPTIONS
-e, --expire #
Expire news older than # days.
-x, --exclude a,b,c
A comma separated list of articles which may not be expired. if a file named .noexpire exists in the /var/lib/sysnews direcory,
filenames are read from it also. names in this file may be comma separated, and/or one per line.
AUTHOR
Charles, <int@link.xs4all.nl>
Linux 18 January 1995 NEWS(1)