Hello,
I am writing a phonetic converter for French from written French to IPA. French has the convention of putting an apostrophe and joining 2 words if the first word ends in an e.
Loading all such words in my dictionary just over loads the database.
Apart from this the Engine which I have written does not convert words followed by punctuation markers.
What I need is an awk or perl script which can act as a preprocessor and preprocess the data separating the apostrophe by a space.
I am giving below a sample rule file which is in UTF8 which needs to handle all such cases. The convention is that the lefthand side string is converted to a righthand side string with
as the delimiter
Some samples for testing are given below
Many thanks for your kind help. Caveat.: I work under Windows.
I build several files by using the cut command to grab select fields(columns) from a really bid csv file. Each file is one column of data. I then put them together using paste command. Here is the code built in tcsh:
cut -d , -f 1 some.csv > 1.csv
cut -d , -f 10 some.csv > 10.csv
paste 1.csv... (2 Replies)
Hi,
I have the input file having data as follow:
file1.txt
001 aaa_1:abcd
002 bbb_2:abcd
I want output as,
001xabcd
002xabcd
Here iam trying to replace "{1 space}{alphanumeric string with underscore}{:}" with characrter "x".
I tried to achieve this using sed;but Iam not getting this... (5 Replies)
Hi everyone,
I am new to the world of shell script programming.
I have a file named Fnd1.txt which has the contents as below.
I need to replace the \t with the tab space. Can any one help me
to write a perl scipt for this.
USA45V1\tG\t341029
USAV1T1\tG\t450545
USAREJ1\tG\t572645... (5 Replies)
Is there a way to do a find and replace in a .gz file in a single script ?
I can always unzip, find and replace and then zip it again but would hate to do this everytime.
Thanks !
Vivek (1 Reply)
Hi -
I am looking for a replacing a string in a in multiple *.sql files in directory with a new string without using a temporary file
Normally I can use sed command as below
for W in ls `FILE*.sql`
do
sed 's/OLD/NEW/g' $W > TEMPFILE.dat
mv TEMPFILE.dat $W
done
But Here in my... (9 Replies)
Hi Everyone,
I am new to this forum and new to sed/awk programming too !!
I need to find particular string in file1(text file) and replace it with a value from another text file(file2) the file2 has only one line and the value to be replaced with is in the second column.
file 1:
(assert (=... (21 Replies)
Legends,
I have a file /tmp/list.txt
I want to find "/bin/" and replace it with "/log/"
I tried the follwoing but no luck
Sandy: /tmp> perl -pi -e 's/\/bin\/\/log\/' /tmp/list.txt >> /tmp/try
Substitution pattern not terminated at -e line 1.
AND,
Sandy: /tmp> perl -pi -e... (2 Replies)
Dear Shell Expert,
I have requirement of replacing passwd field in number of files based on username in them.
The matching username and "new" password in available in a separate file. below is the sample for both.
#sample of one of the file content, red highlighted field are username and... (20 Replies)
Hello Forum.
I have a file called abc.sed with the following commands;
s/1/one/g
s/2/two/g
...
I also have a second file called abc.dat and would like to substitute all occurrences of "1 with one", "2 with two", etc and create a new file called abc_new.dat
sed -f abc.sed abc.dat >... (10 Replies)
Hi All,
I am having below sample data in a file.
I need to find all the line form this file with word ABC and i need to replace the characters at position 120 which is "CO:BOGFDUI"(30chars) in the lines with blank space.
I have tried using grep to find the word with ABC (grep ABC filename),... (3 Replies)
Discussion started by: abhi_123
3 Replies
LEARN ABOUT DEBIAN
polish
POLISH(5) Debian POLISH(5)NAME
polish - a list of Polish words
DESCRIPTION
/usr/share/dict/polish is an ASCII file which contains an alphabetic list of words, one per line.
FILES
/etc/dictionaries-common/words is a symbolic link to a /usr/share/dict/<language> file. /usr/share/dict/words is a symbolic link to
/etc/dictionaries-common/words, and is the name by which other software should refer to the system word list. See
select-default-wordlist(8) for more information.
The directory /usr/share/dict can contain word lists for many languages, with name of the language in English, e.g., /usr/share/dict/french
and /usr/share/dict/danish contain respectively lists of French and Danish words if they exist. Such lists should be coded using the UTF-8
character set encoding.
SEE ALSO ispell(1), select-default-wordlist(8), and the Filesystem Hierarchy Standard.
HISTORY
The words lists are not specific, and may be generated from any number of sources.
The system word list used to be /usr/dict/words. For compatibility, software should check that location if /usr/share/dict/words does not
exist.
AUTHOR
Word lists are collected and maintained by various authors.
Debian Project March 29th, 2011 POLISH(5)