11-02-2012
If you store one file in appropriate associative arrays, and then update it from a second file, you can then disgorge the updated data as a normalized file. Deletes would be in how updates are handled, that new sections replace whole old corresponding sections. You build a string key for your associative array just like the prefix needed for the comm solution. Think of a system with the appearance of just one disk with just one big directory and / is just a file name character. It works OK because this directory is a random hash container.
Processing the data into a working form with hierarchy prefixes on every line means you can sort and comm them, but as I say, if the order is important, you need to number them at that level of the profix. After processing for changes, you can reverse the process to make a normalized output file with hierarchy. (The funny thing about protocols in communication is that each protocol essentially adds more prefixes to the message, like html in http in tcp in ip in ethernet.)
Last edited by DGPickett; 11-02-2012 at 01:16 PM..
This User Gave Thanks to DGPickett For This Post:
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I need to compare 2 diff type of files and find out the duplicate after comparing each types of files:
Type 1 file name is like: file1.abc
(the extension abc could any 3 characters but I can narrow it down or hardcode for 10/15 combinations).
The other file is file1.bcd01abc (the extension... (2 Replies)
Discussion started by: ricky007
2 Replies
2. Shell Programming and Scripting
Hi,
I have a command "get_data" with some parameters in few *.text files of a directory. I want to first find those files that contain this command and then append the following parameter to the end of the command.
example of an entry in the file :-
get_data -x -m50 /etc/web/getid
this... (1 Reply)
Discussion started by: PrasannaKS
1 Replies
3. Shell Programming and Scripting
Hi
I want to compare 2 files. The files have the same amount of rows and columns. So each line must be compare against the other and if one differs from the other, the result of both must be stored in a seperate file.
I am doing this in awk.
Here is my file1:
Blocks... (2 Replies)
Discussion started by: ladyAnne
2 Replies
4. Shell Programming and Scripting
Hi all,
I have been searching online to find the answer for getting a list of files that do not match certain criteria but have been unsuccessful.
I have a directory that has many jpg files. What I need to do is get a list of the files that do not match both of the following patterns (I have... (21 Replies)
Discussion started by: nikos-koutax
21 Replies
5. Shell Programming and Scripting
Hi All,
I need help for below scenario :
I have a principals.xml_24092012backup file :
cat principals.xml_24092012backup
</user>
<user username="eramire" password="2D393C01720749256303D204826A374D9AE9ABABBF8A">
<roleMapping rolename="VIEW_EVERYTHING"/>
</user>
... (2 Replies)
Discussion started by: kiran_j
2 Replies
6. Shell Programming and Scripting
Hi,
I need help to find matched patterns in 30 files residing in a folder simultaneously. All these files only contain 1 column. For example,
File1
Gr_1
st-e34ss-11dd
bt-wwd-fewq
pt-wq02-ddpk
pw-xsw17-aqpp
Gr_2
srq-wy09-yyd9
sqq-fdfs-ffs9
Gr_3
etas-qqa-dfw
ddw-ppls-qqw... (10 Replies)
Discussion started by: redse171
10 Replies
7. Shell Programming and Scripting
Hi,
i have input files like below:-
input1
Name Seq_ID NewID Scores
MT1 A0QZX3 1.65 277.4
IVO A0QZX3 1.65 244.5
HPO A0QZX3 1.65 240.5
RgP A0Q3PP 5.32 241.0
GX1 LPSZ3S 96.1 216.9
MEL LPSS3X 4.23 204.1
LDD LPSS3X 4.23 100.2
input2
Fac AddName NewID ... (9 Replies)
Discussion started by: redse171
9 Replies
8. Shell Programming and Scripting
Hi,
I am trying to extract some patterns from a line. The input file is space delimited and i could not use column to get value after "IN" or "OUT" patterns as there could be multiple white spaces before the next digits that i need to print in the output file . I need to print 3 patterns in a... (3 Replies)
Discussion started by: redse171
3 Replies
9. Shell Programming and Scripting
Hi,
I have multiple files in my log folder. e.g:
a_m1.log
b_1.log
c_1.log
d_1.log
b_2.log
c_2.log
d_2.log
e_m1.log
a_m2.log
e_m2.log
I need to keep latest 10 instances of each file.
I can write multiple find commands but looking if it is possible in one line.
m file are monthly... (4 Replies)
Discussion started by: wahi80
4 Replies
10. Shell Programming and Scripting
Hello.
For a given folder, I want to select any files find $PATH1 -f \( -name "*" but omit any files like pattern name ! -iname "*.jpg" ! -iname "*.xsession*" ..... \) and also omit any subfolder like pattern name -type d \( -name "/etc/gconf/gconf.*" -o -name "*cache*" -o -name "*Cache*" -o... (2 Replies)
Discussion started by: jcdole
2 Replies
JOIN(1) General Commands Manual JOIN(1)
NAME
join - relational database operator
SYNOPSIS
join [ options ] file1 file2
DESCRIPTION
Join forms, on the standard output, a join of the two relations specified by the lines of file1 and file2. If file1 is `-', the standard
input is used.
File1 and file2 must be sorted in increasing ASCII collating sequence on the fields on which they are to be joined, normally the first in
each line.
There is one line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
sists of the common field, then the rest of the line from file1, then the rest of the line from file2.
Fields are normally separated by blank, tab or newline. In this case, multiple separators count as one, and leading separators are dis-
carded.
These options are recognized:
-an In addition to the normal output, produce a line for each unpairable line in file n, where n is 1 or 2.
-e s Replace empty output fields by string s.
-jn m Join on the mth field of file n. If n is missing, use the mth field in each file.
-o list
Each output line comprises the fields specifed in list, each element of which has the form n.m, where n is a file number and m is a
field number.
-tc Use character c as a separator (tab character). Every appearance of c in a line is significant.
SEE ALSO
sort(1), comm(1), awk(1)
BUGS
With default field separation, the collating sequence is that of sort -b; with -t, the sequence is that of a plain sort.
The conventions of join, sort, comm, uniq, look and awk(1) are wildly incongruous.
JOIN(1)