02-07-2012
Grab unique record from different files on a condition
Hi,
I think this is the toughest prob
![Question Smilie](https://www.unix.com/images/icons/icon51.png)
I have ever come across and I thankfully owe all of u for helping me cross this.
cat 1.txt
Quote:
chr1 100 200
chr1 300 400
chr1 350 467
chr1 450 700
chr2 500 600
chr2 345 765
chr3 101 300
chr3 132 456
cat 2.txt
Quote:
chr1 156 199
chr1 165 230
chr1 201 299
chr1 525 600
chr2 800 1000
chr2 534 676
chr2 200 400
chr2 100 200
chr3 200 400
chr3 500 600
chr3 400 700
K now. This is what I am looking for.
Output.txt
Quote:
chr1 300 400 1.txt
chr1 350 467 1.txt
chr1 201 299 2.txt
chr2 800 1000 2.txt
chr2 100 200 2.txt
chr3 500 600 2.txt
Here is how my output has been generated.
First, the column one of each file has to be matched to column one of other files, like chr1 to chr1, chr2 to chr2 and chr3 to chr3 only. No different column values has to be matched.
Second, if a particular range of column 2 and 3 intersects/comes in between the range of column 2 and 3 of the other file, they have to be eliminated.
Examples from the given input:
chr1 100 200(1.txt) intersects with chr1 156 199(2.txt), chr1 165 230(2.txt). So, they are eliminated.
chr1 450 700(1.txt) intersects with chr1 525 600(2.txt). So, these two are eliminated from the output.
Similarly,
chr2 500 600(1.txt) intersects with chr2 534 676(2.txt). So, it is eliminated.
chr2 345 765(1.txt) intersects with chr2 200 400(2.txt). So, it is eliminated from the output file.
Same is the case for chr3 too. My files have different number of records in each of them which are not sorted. The last column in the output file indicates the file from which the record originates. If you have any questions or suggestions, please write in the reply and I shall reply ASAP to clarify your doubt that might give me a chance to kick this problem out. All your time, patience and attention are highly appreciated.
Thanks in advance.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
I'm loading multiple delimited files into an Oracle DB using sqlldr on Unix. I would like to get only the most recent record per each unique key. There may be multiple updates for each key, but I only want the most recent one.
There is a date column in my delimited files, so I'm using cat to... (2 Replies)
Discussion started by: OPTIMUS_prime
2 Replies
2. Shell Programming and Scripting
Hi Everyone,
Using shell script i am getting final file as attached below. In this 4th column value should be unique using any sequence.
for instance I've 1_13020_SSGM which is appearing 6 times in file and i should change it like 1_13020_SSGM_1,1_13020_SSGM_2,....1_13020_SSGM_6.
Can someone... (4 Replies)
Discussion started by: gehlnar
4 Replies
3. Shell Programming and Scripting
I am generating a output:
Name Count_1 Count_2
abc 12 12
def 15 14
ghi 16 16
jkl 18 18
mno 7 5
I am sending the output in html email, I want to add the code:
<font color="red"> NAME COLUMN record </font>
for the Name... (8 Replies)
Discussion started by: karumudi7
8 Replies
4. Shell Programming and Scripting
.......... (1 Reply)
Discussion started by: Antonlee
1 Replies
5. Shell Programming and Scripting
hi
my problem is little complicated one. i have 2 files which appear like this
file 1
abbsss:aa:22:34:as akl abc 1234
mkilll:as:ss:23:qs asc abc 0987
mlopii:cd:wq:24:as asd abc 7866
file2
lkoaa:as:24:32:sa alk abc 3245
lkmo:as:34:43:qs qsa abc 0987
kloia:ds:45:56:sa acq abc 7805
i... (5 Replies)
Discussion started by: anurupa777
5 Replies
6. Shell Programming and Scripting
Hello there! I'm having a lot of trouble writing a script.
The script is supposed to:
1) Find all files with the name "Object.mtl" within each folder in the directory: /Users/username/Desktop/convert/Objects
2) Search and replace the string ".bmp" with ".tif" (without the quotations)
3)... (1 Reply)
Discussion started by: Blue Solo
1 Replies
7. Shell Programming and Scripting
Hi,
I have to output a new csv file from an input csv file with first unique value in the first column.
input csv file
color product id status
green 102 pass
yellow 201 hold
yellow 202 keep
green 101 ok
green 103 hold
yellow 203 ... (5 Replies)
Discussion started by: Chris LAU
5 Replies
8. UNIX for Dummies Questions & Answers
Hey All,
first post :rolleyes:
So I am writting a script to pull down files from an ftp that will be called from a bat file on windows. This seems pretty straight forward, and grabs all of the "files" in the cd location, but I am running into some permission issue that will not allow me to... (1 Reply)
Discussion started by: mpatton
1 Replies
9. UNIX for Beginners Questions & Answers
Hi Experts,
we have a requirement , need your help to remove the footer record in the file.
Input file :
1011070375,,21,,NG,NG,asdfsfadf,1011,,30/09/2017,ACI,USD,,0.28,,,,,,,,,,,,
1011070381,,21,,NG,NG,sgfseasdf,1011,,30/09/2017,ACI,GBP,,0.22,,,,,,,,,,,,... (6 Replies)
Discussion started by: KK230689
6 Replies
10. Shell Programming and Scripting
Hi Experts,
I have csv file with 30, 40 columns
Pasting just 2 column for problem description.
Need to print error if below combination is not present in file
check for column-1 (DocumentNumber) and filter columns where value in DocumentNumber field is same.
For all such rows, the field... (7 Replies)
Discussion started by: as7951
7 Replies
LEARN ABOUT DEBIAN
grcompiler
grcompiler(1) grcompiler(1)
NAME
grcompiler - Compile GDL into a TrueType Font
SYNOPSIS
grcompiler [-d] [-D] [-g] [-n number] [-v number] [-w number] [-wall] [-q] gdl-file font-file [output-file] [output-fontname]
Description
Grcompiler compiles a Graphite Description Language description of the smarts for a font along with a TrueType Font to create a graphite
font with graphite tables in it.
-d Generate XML debugger file.
-D Generate all debugger files.
-g Permit and ignore invalid glyph definitions.
-nnumber
Set name table start location.
-q Quiet mode (no messages except on error).
-vnumber
Set Silf table version number.
-wnnnn Ignore warning with the given number. This option may be repeated.
-wall Display all warnings.
Environment
grcompiler makes use of the following environment variables
GDLPP Specifies the command to execute to run the GDL preprocessor
Files
gdlerr.txt
Warnings and errors are output to this file.
dbg_classes.txt
Debug file that specifies the classes stored in the font in terms of their GDL source and containing glyph ids in hex.
dbg_cmap.txt
Debug file giving Unicode to glyph mapping of all the glyphs in the font and vice versa.
dbg_enginecode.txt
Debug file containing the action code and constraint code associated with each pass and rule in a pass.
dbg_fsm.txt
Debug file that gives the finite state machines of each pass and the rules referenced. For each pass, a mapping is given between
glyph and column number.
dbg_glyphattrs.txt
Debug file containing all the glyph attributes and their names, along with values specified for each glyph.
dbg_parsetree.txt
Debug file giving the complete parse tree of the GDL source file.
dbg_ruleprec.txt
Debug file that lists the rules in precedence order (order in which they will be tested) for each pass.
$_temp.gdl
Debug file containing the output of gdlpp
Author
Martin Hosken <mhosken@users.sourceforge.net>
$Date: 2012-05-29 11:36:50 $ grcompiler(1)