10-21-2011
Extraction of strings from a file, after pattern matching
I need to extract strings from a file.
The file contains data like:
Plan ABCD
IN-+-172BB---118C2C---GGN_342-+-MM77_23--+-LAS24_3|GGK_774
| | \-LAS24_2|GGN_774
| +-AA_800_1-+-BAS_000|GGK_362
| | \-BAS_001|GGK_360
| \-DD_000T1---DAM_001|STEEL_0
Plan SHELL_1
IN-+-CRCBB---118C2D---FRB_342-+-SS77_23--+-LAS20_1|GGK_734
| +-AB_800_1-+-BAS_001|GGK_332
| | \-BAS_003|GGK_700
Where it shows a sort of chart, where I need a list of all the unique strings:
1. All the strings starting with Plan (e.g. ABCD, SHELL_1)
2. All the strings within - and - (e.g. 118C2C, AB_800_1, FRB_342, 172BB etc.).
3. All the strings with single - and containing | in between separately (e.g. LAS20_1|GGk_734, BAS_001|GGK_332, BAS_003|GGK_700, LAS24_3|GGK_774 etc.).
Its a long file, can someone help me how to extract these strings.
I tried 2 approaches, first to read the file line by line, and using awk to print the arguments till NF is reached , but I got error that can not open file.
Second, I converted the file in a simple delimited file, through sed, but still unable to extract these strings out.
Last edited by abkush; 10-21-2011 at 08:27 AM..
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi All,
I'm new to perl,
My requirement is to check if particular file exists.
e.g. filename.txt, filename1.txt, filename2.txt etc
I tried the below code:-
my $var1 = "filename.txt"
if ( -e ($var1 = ~ /file\w/))
{
print "File found \n";
}
else
{ print "File not found \n";
}
... (0 Replies)
Discussion started by: doitnow
0 Replies
2. UNIX for Advanced & Expert Users
hi everybody,
i have a file, in it I need to extract some data that follows a particular pattern..
For example: my file contains like
now running Speak225
sep 22 mon 16:34:05 2008
--------------------------------
... (4 Replies)
Discussion started by: mohkris
4 Replies
3. Programming
Hi,
I have large files with fixed length fields or fields seperated by delimeter. I would like to do validation on some or all fields to check for numeric or date or characters etc.. I would like to write this in C++. Please let me know if any one have any ideas on this.
Thanks for all... (2 Replies)
Discussion started by: rameshmelam
2 Replies
4. Shell Programming and Scripting
Hi All ,
I need to extract the strings that are matching with the pattern : CUST.<AnyStringOfAnyLength>.<AnyStringOfAnyLength> from a file and then write all these string into another file.
e.g. If a file SOURCE contains following lines :
IF(CUST.ABCD.EFGH==1) THEN
CUST.ABCD.EFGH =... (7 Replies)
Discussion started by: swapnil.nawale
7 Replies
5. Shell Programming and Scripting
I have a huge file that has roughly 30304 lines. I need to extract specific info from that file. For example,
Box 1 > *aaaaaaaajjjj*
> hbbvjvj
> jdnnfddllll
> *dgdfhfekwjh*
Box 2 > *aaaaaaa'aj'jjj*
> dse hkjuejef bfdw
> dyeee
> dsewq
> *dgdfhfekwjh*
>feweiuei
Box 3 > *aaaa"aaaaj"jjj*
>... (25 Replies)
Discussion started by: Ernst
25 Replies
6. Shell Programming and Scripting
Hi,
file -> temp.txt
cat temp.txt
/home/pradeep/123/a_asp.html
/home/pradeep/123/a_asp1.html
/home/pradeep/435/a_asp2.html
/home/pradeep/arun/abc/a_dfr.html
/home/pradeep/arun/123/a_kir.html
/home/pradeep/123/arun/a_dir.html
....
....
..
i need to get a_*.html(bolded strings... (4 Replies)
Discussion started by: pradebban
4 Replies
7. Shell Programming and Scripting
Hi ,
I am writing a shell script to check pvsizes in linux box.
# for i in `cat vgs1`
> do
> echo "########### $i ###########"
> pvs|grep -i $i|awk '{print $2,$1,$5}'>pvs_$i
> pvs|grep -i $i|awk '{print $1}'|while read a
> do
> fdisk -l $a|head -2|tail -1|awk '{print $2,$3}'>pvs_$i1
>... (3 Replies)
Discussion started by: nanduri
3 Replies
8. Shell Programming and Scripting
Hi
I need to do a patten match between files .
I am new to shell scripting and have come up with this so far. It take 50 seconds to process files of 2mb size . I need to tune this code as file size will be around 50mb and need to save time.
Main issue is that I need to search the pattern from... (2 Replies)
Discussion started by: nitin_daharwal
2 Replies
9. Shell Programming and Scripting
Hi ,
I have a situation where I need to search an xml file for the presence of a tag
<FollowOnFrom> and also , presence of partial part of the following tag <ContractRequest _LoadId and if these 2 exist ,then
extract the value from the following tag <_LocalId> which is
"CW2094139". There... (2 Replies)
Discussion started by: paul1234
2 Replies
10. UNIX for Beginners Questions & Answers
Hello all, I can get close to what I am looking for but cannot seem to hit it exactly and was wondering if I could get your help.
I have the following sample from textfile with many thousands of lines: File 1
PS001,001 HLK
PS002,004 L<G
PS004,002 XNN
PS004,006 BVX
PS004,006 ZBX=... (7 Replies)
Discussion started by: jvoot
7 Replies
XSTR(1) General Commands Manual XSTR(1)
NAME
xstr - extract strings from C programs to implement shared strings
SYNOPSIS
xstr [ -c ] [ - ] [ file ]
DESCRIPTION
Xstr maintains a file strings into which strings in component parts of a large program are hashed. These strings are replaced with refer-
ences to this common area. This serves to implement shared constant strings, most useful if they are also read-only.
The command
xstr -c name
will extract the strings from the C source in name, replacing string references by expressions of the form (&xstr[number]) for some number.
An appropriate declaration of xstr is prepended to the file. The resulting C text is placed in the file x.c, to then be compiled. The
strings from this file are placed in the strings data base if they are not there already. Repeated strings and strings which are suffices
of existing strings do not cause changes to the data base.
After all components of a large program have been compiled a file xs.c declaring the common xstr space can be created by a command of the
form
xstr
This xs.c file should then be compiled and loaded with the rest of the program. If possible, the array can be made read-only (shared) sav-
ing space and swap overhead.
Xstr can also be used on a single file. A command
xstr name
creates files x.c and xs.c as before, without using or affecting any strings file in the same directory.
It may be useful to run xstr after the C preprocessor if any macro definitions yield strings or if there is conditional code which contains
strings which may not, in fact, be needed. Xstr reads from its standard input when the argument `-' is given. An appropriate command
sequence for running xstr after the C preprocessor is:
cc -E name.c | xstr -c -
cc -c x.c
mv x.o name.o
Xstr does not touch the file strings unless new items are added, thus make can avoid remaking xs.o unless truly necessary.
FILES
strings Data base of strings
x.c Massaged C source
xs.c C source for definition of array `xstr'
/tmp/xs* Temp file when `xstr name' doesn't touch strings
SEE ALSO
mkstr(1)
BUGS
If a string is a suffix of another string in the data base, but the shorter string is seen first by xstr both strings will be placed in the
data base, when just placing the longer one there will do.
3rd Berkeley Distribution May 7, 1986 XSTR(1)