03-31-2009
Fuzzy Match Logic for Numerical Values
I have searched the internet (including these forums) and perhaps I'm not using the right wording.
What I'm looking for is a function (preferably C) that analyzes the similitude of two numerical or near-numerical values, and returns either a true/false (match/nomatch) or a return code that corresponds to the type of near-match relationship that was encountered. The former would require the user to pre-define (through arguments/environ/variables) the acceptable tolerance or conditions, whereas the latter would be subsequently evaluated by the caller to see if the return code indicated a relationship (between the two numbers) within the tolerances of the application.
For example, each of these could be a different return code and the caller could determine whether or not the relationship is considered a "match" for the purposes of his application:
2401 and 2410 (trailing juxtaposition)
2401 and 4201 (leading juxtaposition)
2401 and 2041 (imbedded juxtaposition)
2479 and 24799 (substring match/trailing dupe)
12 and 12A (this is what I meant above by "near numerical" values)
etc.
Any information that would help me find some code that accomplishes this would be greatly appreciated.
~Marcus
9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi everyone,
I have file1 and file2 comma separated both.
file1 is:
Header1,Header2,Header3,Header4,Header5,Header6,Header7,Header8,Header9,Header10
Code7,,,,,,,,,
Code5,,,,,,,,,
Code3,,,,,,,,,
Code9,,,,,,,,,
Code2,,,,,,,,,file2... (17 Replies)
Discussion started by: cgkmal
17 Replies
2. UNIX for Dummies Questions & Answers
I have a text file where the second column is a list of numbers going from small to large. I want to extract the rows where the second column is smaller than or equal to 0.0001.
My input:
rs10082730 9e-08 12 46002702
rs2544081 1e-07 12 46015487
rs1425136 1e-06 7 35396742
rs2712590... (1 Reply)
Discussion started by: evelibertine
1 Replies
3. Shell Programming and Scripting
Hello ALL,
Hope all fine for you.
I have the following task but no idea about how to do.
I have 2 files (ascii)
one of them is a list of bib records, looks like this:
=LDR 01228nam 2200301 b 4500
=001 00000000000001
=005 20090429:10082000
=008 ... (2 Replies)
Discussion started by: ldiaz2106
2 Replies
4. Shell Programming and Scripting
I am debugging a script and have stuck up at one code line
awk -F , '{if (match($3,001)) { print $2 } }' Master20120307090511.tmp
The Master20120307090511.tmp is
001,ARE , 001
002,ARE , 002
003,ARE , 003
006,ARE , 006
011,ARE , 011
012,ARE , 012
What happens is when i fire this ... (5 Replies)
Discussion started by: vee_789
5 Replies
5. Shell Programming and Scripting
Hi Forum:
I have struggle with it and decide to use my eye ball to accomplish this.
Basically I am looking for sequence of date inside a file.
If one of the sequence repeat 2-3 time or skip once; it's still consider a match.
input text file:
Sep 6 A
Sep 6 A
Sep 10 A
Sep 7 B
Sep 8... (7 Replies)
Discussion started by: chirish
7 Replies
6. UNIX for Dummies Questions & Answers
I'm new to Unix and I have been trying to fix this problem for the past week.
How would I use grep to display only certain numbers for a list. For example, if I have this list:
Joe senior 4/50
John junior 25/50
Mary junior 41/50
Martha sophomore 2/50
...How do I get a file... (1 Reply)
Discussion started by: PTcharger
1 Replies
7. Shell Programming and Scripting
In the bash below the unique headers of each vcf.gz are stored in a text file with the same name. That is if 16-0000-file.vcf.gz was used the header text file would be 16-0000-file_header.txt.
There can be multiple vcf.gz in a directory, usually 3, that I need to fix the header in each file before... (6 Replies)
Discussion started by: cmccabe
6 Replies
8. Shell Programming and Scripting
In the below bash I am trying to rename eachof the 3 text files in /home/cmccabe/Desktop/percent by matching the numerical portion of each file to lines 3,4, or 5 in /home/cmccabe/Desktop/analysis.txt. There will always be a match between the files. When a match is found each text file in... (2 Replies)
Discussion started by: cmccabe
2 Replies
9. UNIX for Beginners Questions & Answers
Hi,
PROCINFO seems to be a great function but I don't manage to make it works.
input:
B,A,C
B
B,B
As an example, just want to count the occurence of each letter across the input and sort them by decreased order.
Wanted output:
B 4
A 1
C 1
When I use this command, the PROCINFO... (4 Replies)
Discussion started by: beca123456
4 Replies
ZGREP(1) General Commands Manual ZGREP(1)
NAME
zgrep - search possibly compressed files for a regular expression
SYNOPSIS
zgrep [ grep_options ] [ -e ] pattern filename...
DESCRIPTION
Zgrep invokes grep on compressed or gzipped files. These grep options will cause zgrep to terminate with an error code:
(-[drRzZ]|--di*|--exc*|--inc*|--rec*|--nu*). All other options specified are passed directly to grep. If no file is specified, then the
standard input is decompressed if necessary and fed to grep. Otherwise the given files are uncompressed if necessary and fed to grep.
If the GREP environment variable is set, zgrep uses it as the grep program to be invoked.
EXIT CODE
2 - An option that is not supported was specified.
AUTHOR
Charles Levert (charles@comm.polymtl.ca)
SEE ALSO
grep(1), gzexe(1), gzip(1), zdiff(1), zforce(1), zmore(1), znew(1)
ZGREP(1)