04-11-2011
extract text between two words on a single line
Hi Guys,
Can someone help me with a way to extract text between two words on a single line.
For example if the file has below content I want to extract all text between b and f inclusive of b and f. Aparently sed does this but does it line by line and I guess it cannot read word by word.
a b c d e f -> this is single line
Thanks!
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I've got a configuration file that is filled with xml text statements for example:
<...../>
<...../>
<...../>
<data id="java-options" value="-server -Djava.security.policy..../>
<...../>
<...../>
<...../>
I want to write a korn shell script that will go to this specific line and add a... (2 Replies)
Discussion started by: progkcp
2 Replies
2. Shell Programming and Scripting
Hi..
How to search for multiple words in a single line using grep?.
Eg: Jack and Jill went up the hill
Jack and Jill were best friends
Humpty and Dumpty were good friends too
----------
I want to extract the 2nd statement(assuming there are several statements with... (11 Replies)
Discussion started by: anduzzi
11 Replies
3. Shell Programming and Scripting
Hi
Is it possible to do the following in a single command
/usr/xpg4/bin/sed -e '/rows selected/d' /aemu/CALLAUTO/callauto.txt > /aemu/CALLAUTO/callautonew.txt
/usr/xpg4/bin/sed -e '/^$/d' /aemu/CALLAUTO/callautonew.txt > /aemu/CALLAUTO/callauto_new.txt
exit (1 Reply)
Discussion started by: aemunathan
1 Replies
4. Shell Programming and Scripting
Hello,
Need help substituting a particular word in a file having a single line but no newline character at the end.
I was trying to use sed but it doesn't work probably because there is no newline char at the end of the line.
$ cat hlq_detail
/outputs/alvan23/PDFs/bills
$ cat... (5 Replies)
Discussion started by: Shan_u2005
5 Replies
5. Shell Programming and Scripting
The file contains one line of text followed by a number. I want to take the number X at the end, take it out and display the last X words. X is the key telling me how many words from the end that I want and X will always be less than the number of words, so no problem there.
Example input and... (4 Replies)
Discussion started by: fubaya
4 Replies
6. Shell Programming and Scripting
FOLKS ,
i have a text file that is generated automatically of an another korn shell script, i want to bring in the fifth line of the text file in to my korn shell script and look for a particular word in the line . Can you all share some thoughts on this one.
thanks...
Venu (3 Replies)
Discussion started by: venu
3 Replies
7. UNIX for Dummies Questions & Answers
I would like to add a line to the end of a single column text file. How do I go about doing that?
Input:
BEGIN
1
2
3
Output:
BEGIN
1
2
3
END
Thanks! (1 Reply)
Discussion started by: evelibertine
1 Replies
8. Shell Programming and Scripting
I am trying to build a sinkhole for BIND. I created a master zone file for malicious domains and created a separate conf file, but I am stuck.
I have a list of known bd domains that is updated nightly. The file simply contains the list of domains, one on each line:
Bad.com
Bad2.com... (4 Replies)
Discussion started by: uuallan
4 Replies
9. Shell Programming and Scripting
Hello programmers,
I need to create a list of unique words from a text file using PERL...may i have the code for that please?
Thank you (1 Reply)
Discussion started by: alsohari
1 Replies
10. UNIX for Beginners Questions & Answers
hi I made this simple script to extract data and pretty much is a list and would like to extract data of two words separated by commas and I would like to make a new text file that would list these extracted data into a list and each in a new line.
Example that worked for me with text file... (5 Replies)
Discussion started by: dandaryll
5 Replies
LEARN ABOUT DEBIAN
text::affixes
Affixes(3pm) User Contributed Perl Documentation Affixes(3pm)
NAME
Text::Affixes - Prefixes and suffixes analisys of text
SYNOPSIS
use Text::Affixes;
my $text = "Hello, world. Hello, big world.";
my $prefixes = get_prefixes($text);
# $prefixes now holds
# {
# 3 => {
# 'Hel' => 2,
# 'wor' => 2,
# }
# }
# or
$prefixes = get_prefixes({min => 1, max => 2},$text);
# $prefixes now holds
# {
# 1 => {
# 'H' => 2,
# 'w' => 2,
# 'b' => 1,
# },
# 2 => {
# 'He' => 2,
# 'wo' => 2,
# 'bi' => 1,
# }
# }
# the use for get_suffixes is similar
DESCRIPTION
Provides methods for prefixe and suffix analisys of text.
METHODS
get_prefixes
Extracts prefixes from text. You can specify the minimum and maximum number of characters of prefixes you want.
Returns a reference to a hash, where the specified limits are mapped in hashes; each of those hashes maps every prefix in the text into the
number of times it was found.
By default, both minimum and maximum limits are 3. If the minimum limit is greater than the lower one, an empty hash is returned.
A prefix is considered to be a sequence of word characters (w) in the beginning of a word (that is, after a word boundary) that does not
reach the end of the word ("regular expressionly", a prefix is the $1 of /(w+)w/).
# extracting prefixes of size 3
$prefixes = get_prefixes( $text );
# extracting prefixes of sizes 2 and 3
$prefixes = get_prefixes( {min => 2}, $text );
# extracting prefixes of sizes 3 and 4
$prefixes = get_prefixes( {max => 4}, $text );
# extracting prefixes of sizes 2, 3 and 4
$prefixes = get_prefixes( {min => 2, max=> 4}, $text);
get_suffixes
The get_suffixes function is similar to the get_prefixes one. You should read the documentation for that one and than come back to this
point.
A suffix is considered to be a sequence of word characters (w) in the end of a word (that is, before a word boundary) that does not start
at the beginning of the word ("regular expressionly" speaking, a prefix is the $1 of /w(w+)/).
# extracting suffixes of size 3
$suffixes = get_suffixes( $text );
# extracting suffixes of sizes 2 and 3
$suffixes = get_suffixes( {min => 2}, $text );
# extracting suffixes of sizes 3 and 4
$suffixes = get_suffixes( {max => 4}, $text );
# extracting suffixes of sizes 2, 3 and 4
$suffixes = get_suffixes( {min => 2, max=> 4}, $text);
OPTIONS
Apart from deciding on a minimum and maximum size for prefixes or suffixes, you can also decide on some configuration options.
exclude_numbers
Set to 0 if you consider numbers as part of words. Default value is 1.
# this
get_suffixes( {min => 1, max => 1, exclude_numbers => 0}, "Hello, but w8" );
# returns this:
{
1 => {
'o' => 1,
't' => 1,
'8' => 1
}
}
lowercase
Set to 1 to extract all prefixes in lowercase mode. Default value is 0.
ATTENTION: This does not mean that prefixes with uppercased characters won't be extracted. It means they will be extracted after being
lowercased.
# this...
get_prefixes( {min => 2, max => 2, lowercase => 1}, "Hello, hello");
# returns this:
{
2 => {
'he' => 2
}
}
TO DO
o Make it more efficient (use C for that)
AUTHOR
Jose Castro, "<cog@cpan.org>"
COPYRIGHT & LICENSE
Copyright 2004 Jose Castro, All Rights Reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.10.0 2005-11-19 Affixes(3pm)