Search Results

Search: Posts Made By: gimley
3,383
Posted By gimley
Many thanks to all. I was using an older version...
Many thanks to all. I was using an older version of AWK. I installed the new version and got the results.
3,383
Posted By gimley
Out of memory message
I have written a script to generate trigrams. The script runs perfectly on a relatively small file. When a large file is given, around 700 KB, I get the message

Counting trigrams ... 1614375...
1,523
Posted By gimley
Find Syllable count mismatch
Hello,
I have written a syllable splitter for Pseudo English [conforming to the rules of Indic] and Indic.
I have a large database with the following structure

Syllables in Pseudo English...
3,318
Posted By gimley
Remove dupes in a large file
I have a large file 1.5 gb and want to sort the file.
I used the following AWK script to do the job
!x[$0]++
The script works but it is very slow and takes over an hour to do the job. I suspect...
1,340
Posted By gimley
Many thanks for the answer. That is what I plan...
Many thanks for the answer. That is what I plan to do. The number of files in each directory is considerable but I have no choice but to use this method.
1,340
Posted By gimley
Could you please explain in what way?? Thanks.
Could you please explain in what way?? Thanks.
1,340
Posted By gimley
Sorry for the late reply. Many thanks. I work in...
Sorry for the late reply. Many thanks. I work in a windows environment hence the request for perl or awk.
1,340
Posted By gimley
Alignment tool to join text files in 2 directories to create a parallel corpus
I have two directories called English and Hindi. Each directory contains the same number of files with the only difference being that in the case of the English Directory the tag is
.english
and...
1,781
Posted By gimley
Many thanks for pointing out the blooper. I guess...
Many thanks for pointing out the blooper. I guess in my excitement, I forgot the Field separator.
Tried it out and it works perfectly. Many thanks
1,781
Posted By gimley
Am replying from my phone.Thanks very much. Am...
Am replying from my phone.Thanks very much. Am out at present and should be back in a couple of hours. I will get back to you on this asap.

---------- Post updated at 03:28 AM ---------- Previous...
1,781
Posted By gimley
Creating verbal structures from a dictionary and a template
My main aim here is to create a database of verbs in a language [in this case English] to Hindi. The output if it works well will be put up on a University site for researchers to use for Machine...
977
Posted By gimley
Many thanks for the warning. The solutions you...
Many thanks for the warning. The solutions you have provided are elegant and will ensure data integrity.
I tested all three and the results are excellent.
Thanks for taking time off to write these...
977
Posted By gimley
Thanks a lot. It worked How stupid of me!!!
Thanks a lot. It worked
How stupid of me!!!
977
Posted By gimley
Help with script to convert rows to columns
Hello
I have a large database with the following structure:

Headword=Gloss1;Gloss2;Gloss3

The Glosses are separated by a ;
What I need is to reduce the multiple glosses on each row to...
3,179
Posted By gimley
Thanks a lot for your kind help. I now understand...
Thanks a lot for your kind help. I now understand why my PERL script goofed up also.

---------- Post updated 05-09-18 at 01:45 AM ---------- Previous update was 05-08-18 at 10:49 PM ----------
...
3,179
Posted By gimley
Thanks a lot. Excuse my ignorance but how many...
Thanks a lot. Excuse my ignorance but how many bytes do I allocate ?
My data is in UTF8 format and if I want to ensure that 5000 characters are chunked, what would be the byte size. In ASCII format...
3,179
Posted By gimley
Modification of perl script to split a large file into chunks of 5000 chracters
I have a perl script which splits a large file into chunks.The script is given below

use strict;
use warnings;
open (FH, "<monolingual.txt") or die "Could not open source file. $!";
my $i = 0;...
1,142
Posted By gimley
Many thanks for all your kind help. My broadband...
Many thanks for all your kind help. My broadband connectivity was down all day and hence the delay. All the solutions worked. I had ensured that my data had no trailing spaces so the issue of...
1,142
Posted By gimley
Help to identify blank space in a file
Hello,
I have a dictionary of over 400,000 words with the following structure

source=target

The database contains single words as well as phrases. To train the data, I need only mappings with...
1,222
Posted By gimley
Thanks a lot, especially for the code and the...
Thanks a lot, especially for the code and the precious comments. I always assumed that awk respected the order in the file and did not disturb the same.
You made my day.
1,222
Posted By gimley
Thanks for your kind words. I am 70 years old....
Thanks for your kind words. I am 70 years old. Accustomed to C programming and I guess in my hurry I forgot to check what is already available.

---------- Post updated 01-17-18 at 02:06 AM...
1,222
Posted By gimley
Dear Don Cragun, Thanks for taking time off to...
Dear Don Cragun,
Thanks for taking time off to reply. As usual, I hunted for this specific issue but could not find an answer. I hope I did not miss out the solution.
I have awk/sed and perl on my...
1,222
Posted By gimley
Creating a master file of conjugated verbs by concatenating root and inflection from separate files
Excuses for the long descriptive title.
I am working with Sindhi and developing a database of all verbal conjugations in that language.
I have generated 2 files:
Verbs.dic contains all the verbs,...
1,074
Posted By gimley
Sorry for the delay. I had to download a sed...
Sorry for the delay.
I had to download a sed which works on Windows10.
Yes, it works and "does the job". "Cleans" up all the punctuations
Thanks a lot.
1,074
Posted By gimley
Hello, I am not too familiar with sed If I...
Hello,
I am not too familiar with sed
If I run the sed script you have given
sed 's/[[:punct:]]/ &/g' file
would it do the job.
A naive question maybe, but how do I run the script?
Thanks a lot...
Showing results 1 to 25 of 303

 
All times are GMT -4. The time now is 07:48 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy