04-01-2009
Hi Corona. Thanks for your feedback. We tried LD but it really lacked intelligence for our purposes. For example, the numbers 2401 and 24 it reported as being 50% alike. Clearly by human eye you can see it is and we wouldn't want to include this as a "match." i.e., John Doe at 2401 Main St and John Doe at 24 Main St - probably not the same person. Yet, 2401 and 2410 is a strong possibility. We'd allow this if we did not find John Doe at 2401 Main but we did find a John Doe at 2410, we'd call it a match. Again, LD gives these two numbers a weight of 50% similitude. So in order to catch 2401 and 2410 (simple juxtaposition) we had to set our threshold at 50; yet then we get really crappy matches like 2401 and 24.
In order to catch 310 and 301 with LD, we'd have to lower our threshold to 33%, but then you really let in the junk!
I've no doubt LD has its applications (in fact, we do use it for street names), but it's not so hot for fuzzy number matches. Thank you for your thoughts though.
~Marcus
9 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi everyone,
I have file1 and file2 comma separated both.
file1 is:
Header1,Header2,Header3,Header4,Header5,Header6,Header7,Header8,Header9,Header10
Code7,,,,,,,,,
Code5,,,,,,,,,
Code3,,,,,,,,,
Code9,,,,,,,,,
Code2,,,,,,,,,file2... (17 Replies)
Discussion started by: cgkmal
17 Replies
2. UNIX for Dummies Questions & Answers
I have a text file where the second column is a list of numbers going from small to large. I want to extract the rows where the second column is smaller than or equal to 0.0001.
My input:
rs10082730 9e-08 12 46002702
rs2544081 1e-07 12 46015487
rs1425136 1e-06 7 35396742
rs2712590... (1 Reply)
Discussion started by: evelibertine
1 Replies
3. Shell Programming and Scripting
Hello ALL,
Hope all fine for you.
I have the following task but no idea about how to do.
I have 2 files (ascii)
one of them is a list of bib records, looks like this:
=LDR 01228nam 2200301 b 4500
=001 00000000000001
=005 20090429:10082000
=008 ... (2 Replies)
Discussion started by: ldiaz2106
2 Replies
4. Shell Programming and Scripting
I am debugging a script and have stuck up at one code line
awk -F , '{if (match($3,001)) { print $2 } }' Master20120307090511.tmp
The Master20120307090511.tmp is
001,ARE , 001
002,ARE , 002
003,ARE , 003
006,ARE , 006
011,ARE , 011
012,ARE , 012
What happens is when i fire this ... (5 Replies)
Discussion started by: vee_789
5 Replies
5. Shell Programming and Scripting
Hi Forum:
I have struggle with it and decide to use my eye ball to accomplish this.
Basically I am looking for sequence of date inside a file.
If one of the sequence repeat 2-3 time or skip once; it's still consider a match.
input text file:
Sep 6 A
Sep 6 A
Sep 10 A
Sep 7 B
Sep 8... (7 Replies)
Discussion started by: chirish
7 Replies
6. UNIX for Dummies Questions & Answers
I'm new to Unix and I have been trying to fix this problem for the past week.
How would I use grep to display only certain numbers for a list. For example, if I have this list:
Joe senior 4/50
John junior 25/50
Mary junior 41/50
Martha sophomore 2/50
...How do I get a file... (1 Reply)
Discussion started by: PTcharger
1 Replies
7. Shell Programming and Scripting
In the bash below the unique headers of each vcf.gz are stored in a text file with the same name. That is if 16-0000-file.vcf.gz was used the header text file would be 16-0000-file_header.txt.
There can be multiple vcf.gz in a directory, usually 3, that I need to fix the header in each file before... (6 Replies)
Discussion started by: cmccabe
6 Replies
8. Shell Programming and Scripting
In the below bash I am trying to rename eachof the 3 text files in /home/cmccabe/Desktop/percent by matching the numerical portion of each file to lines 3,4, or 5 in /home/cmccabe/Desktop/analysis.txt. There will always be a match between the files. When a match is found each text file in... (2 Replies)
Discussion started by: cmccabe
2 Replies
9. UNIX for Beginners Questions & Answers
Hi,
PROCINFO seems to be a great function but I don't manage to make it works.
input:
B,A,C
B
B,B
As an example, just want to count the occurence of each letter across the input and sort them by decreased order.
Wanted output:
B 4
A 1
C 1
When I use this command, the PROCINFO... (4 Replies)
Discussion started by: beca123456
4 Replies
LEARN ABOUT PHP
sqlite_num_rows
SQLITE_NUM_ROWS(3) SQLITE_NUM_ROWS(3)
sqlite_num_rows - Returns the number of rows in a buffered result set
SYNOPSIS
int sqlite_num_rows (resource $result)
DESCRIPTION
Object oriented style (method):
int SQLiteResult::numRows (void )
Returns the number of rows in the buffered $result set.
PARAMETERS
o $result
- The SQLite result resource. This parameter is not required when using the object-oriented method.
Note
This function cannot be used with unbuffered result handles.
RETURN VALUES
Returns the number of rows, as an integer.
EXAMPLES
Example #1
Procedural example
<?php
$db = sqlite_open('mysqlitedb');
$result = sqlite_query($db, "SELECT * FROM mytable WHERE name='John Doe'");
$rows = sqlite_num_rows($result);
echo "Number of rows: $rows";
?>
Example #2
Object-oriented example
<?php
$db = new SQLiteDatabase('mysqlitedb');
$result = $db->query("SELECT * FROM mytable WHERE name='John Doe'");
$rows = $result->numRows();
echo "Number of rows: $rows";
?>
SEE ALSO
sqlite_changes(3), sqlite_query(3), sqlite_num_fields(3).
PHP Documentation Group SQLITE_NUM_ROWS(3)