Syllable splitter in Perl


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Syllable splitter in Perl
# 1  
Old 09-24-2011
Syllable splitter in Perl

Hello,
I am a relative newbie and want to split Names in English into syllables. Does anyone know of a perl script which does that. Since my main area is linguistics, I would be happy to add rules to it and post the perl script back for other users. I tried the CPan perl modules but they don't really do what I want.
Any help would be gratefully acknowledged
Many thanks
# 2  
Old 09-24-2011
Hi

Take a look to module Lingua::Phonology::Syllable

I've not used it so not sure if can be useful for you.

Regards,
Birei
# 3  
Old 09-25-2011
Many thanks. Tried it. Unfortunately it doesn't work too well. Will have to write a program in C to do the job.
# 4  
Old 09-25-2011
There is a package in Linux known as style

style(1) - Linux man page

It gives you several readability statistics along with the number of syllables. Kindly see if that could be of any help. You may download the package Download Diction 1.11 for Linux - Diction and style are two old standard UNIX commands. - Softpedia
# 5  
Old 09-25-2011
Many thanks. Unluckily my OS is vista and all Linux tools do not help me.
Best regards,
Gimley
# 6  
Old 09-25-2011
There is a package in PERL Kim Ryan / Lingua-EN-Fathom - search.cpan.org

I am not sure whether it will solve your purpose. I haven't used it though.

---------- Post updated at 10:15 AM ---------- Previous update was at 10:13 AM ----------

or will this help?

Diction for Windows
# 7  
Old 09-25-2011
Many thanks.
Will try both and get back to you. The trouble with CPAN libraries is the complexity of installation. If the readme file does not give correct parameters you can really have a lot of trouble running the program.
Login or Register to Ask a Question

Previous Thread | Next Thread

8 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Find Syllable count mismatch

Hello, I have written a syllable splitter for Pseudo English and Indic. I have a large database with the following structure Syllables in Pseudo English delimited by |=Syllables in Devanagari delimited by | The tool produces syllables in both scripts. An example is given below: ... (2 Replies)
Discussion started by: gimley
2 Replies

2. Shell Programming and Scripting

awk or perl script for preposition splitter

Hello, I am writing a Natural Language Parser and one of the tools I need is to separate prepositional phrase markers which begin with a Preposition. I have a long list of such markers (sample given below)and am looking for a script in awk or perl which will allow me to access a look-up file... (2 Replies)
Discussion started by: gimley
2 Replies

3. Shell Programming and Scripting

File splitter

I have below script which does splitting based on a different criteria. can it be amended to produce required result SrcFileName=XML_DUMP awk '/<\?xml version="1\.0" encoding="utf-8"\?>/{n++} n{f="'"${SrcFileName}_"'" sprintf("%04d",n) ".txt" print >> f close(f)}' $SrcFileName.txt My... (3 Replies)
Discussion started by: santosh2k2
3 Replies

4. Shell Programming and Scripting

Source xml file splitter

I have a source file that contains multiple XML files concatenated in it. The separator string between files is <?xml version="1.0" encoding="utf-8"?>. I wanted to split files in multiple files with mentioned names. I had used a awk code earlier to spilt files in number of lines i.e. awk... (10 Replies)
Discussion started by: santosh2k2
10 Replies

5. Shell Programming and Scripting

Text Splitter

Hi, I need to split files based on text: BEGIN DSJOB Identifier "LA" DateModified "2011-10-28" TimeModified "11.10.02" BEGIN DSRECORD Identifier "ROOT" BEGIN DSSUBRECORD Owner "APT" Name "RecordJobPerformanceData" Value "0" ... (16 Replies)
Discussion started by: unme
16 Replies

6. Shell Programming and Scripting

Creating a syllable concordance

Hello, I have two files. The first file contains specific syllables of a language (Hindi) and the second file contains a large database from which these syllables have been culled. The syllable file which has syllables in Hindi has one syllable per line and the corpus file has a data... (8 Replies)
Discussion started by: gimley
8 Replies

7. Programming

Help with splitter code in JAVA

I was creating a file using splitter and printwriter. The result in the file come out as: TO:bbb,ccc,eee Instead of, TO:bbb TO:ccc TO:eee May I know what's wrong with this? (1 Reply)
Discussion started by: eel
1 Replies

8. Shell Programming and Scripting

File splitter by nth row

I need to split a file into n separate files of about the same size. The way the file will be split is at every nth row, starting with the first row, that row will be cut and copied to it's corresponding new file so that each file has unique records. Any 'leftovers' will go into the last file. e.g.... (4 Replies)
Discussion started by: sitney
4 Replies
Login or Register to Ask a Question