Hi !
I am trying to remove doubbled entrys in a textfile only between delimiters.
Like that example but i dont know how to do that with sort or similar.
input:
output:
i would be pleasured for every help !
Mfg Fugitivus
I have a file:
Fred
Fred
Fred
Jim
Fred
Jim
Jim
If sort is executed on the listed file, shouldn't the output be?:
Fred
Fred
Fred
Fred
Jim
Jim
Jim (3 Replies)
Using the last, uniq, sort and cut commands, determine how many times the different users have logged in.
I know how to use the last command and cut command...
i came up with last | cut -f1 -d" " | uniq
i dont know if this is right, can someone please help me... thanks (1 Reply)
Does anyone have a quick and dirty way of performing a sort and uniq in perl?
How an array with data like:
this is bkupArr BOLADVICE_VN
this is bkupArr MLT6800PROD2A
this is bkupArr MLT6800PROD2A
this is bkupArr BOLADVICE_VN_7YR
this is bkupArr MLT6800PROD2A
I want to sort it... (4 Replies)
hi
i have data which is in two columns (such as below). i need to compare two rows against each other and if one row matches the other row (except for different case), and their values in the second column are different, then it prints out one of the rows (either is fine).
here is an... (5 Replies)
Hello,
I have a large data file:
1234 8888 bbb
2745 8888 bbb
9489 8888 bbb
1234 8888 aaa
4838 8888 aaa
3977 8888 aaa
I need to remove duplicate lines (where the first column is the duplicate). I have been using:
sort file.txt | uniq -w4 > newfile.txt
However, it seems to keep the... (11 Replies)
Hi All,
I have a text file with the format shown below. Some of the records are duplicated with the only exception being date (Field 15). I want to compare all duplicate records using subscriber number (field 7) and keep only those records with greater date.
... (1 Reply)
Hi again,
I have files with the following contents
datetime,ip1,port1,ip2,port2,number
How would I find out how many times ip1 field shows up a particular file? Then how would I find out how many time ip1 and port 2 shows up?
Please mind the file may contain 100k lines. (8 Replies)
Hello all,
Need to pick your brains,
I have a 10Gb file where each row is a name, I am expecting about 50 names in total. So there are a lot of repetitions in clusters.
So I want to do a
sort -u file
Will it be considerably faster or slower to use a uniq before piping it to sort... (3 Replies)
Hi All,
Below the actual file which i like to sort and Uniq -u
/opt/oracle/work/Antony/Shell_Script> cat emp.1st
2233|a.k. shukula |g.m. |sales |12/12/52 |6000
1006|chanchal singhvi |director |sales |03/09/38 |6700... (8 Replies)
Discussion started by: Antony Ankrose
8 Replies
LEARN ABOUT DEBIAN
g2p-sk
g2p-sk(1) USER COMMANDS g2p-sk(1)NAME
g2p-sk - phonetic transcription for Slovak
SYNOPSIS
g2p-sk [--color] [--dl debug level] [--help] [--stats] [--ofile <file_name>] [<input file>]
DESCRIPTION
The phonetic transcription is essential for some linguistic or speech recognition applications. Depending on the language either rule based
or statistical approach is being used. g2p-sk implements the rule based approach but in the future it may be replaced by statistical one.
Each input word consisting of the sequence of graphemes is transcribed in to the sequence of phones in the SAMPA coding. If no input file
is specified, the standard input is expected. If input file is used then the output is written in to the file as well. The filename is
input filename with the extension "_trans.txt".
The input output code page is ISO 8859-2. To use it with different CP use some CP converter and pipes. For example to have input and output
in UTF-8 use (for interactive use): filterm UTF8-iso2 iso2-UTF8 g2p-sk or (for batch processing) iconv -f UTF-8 -t ISO_8859-2 | g2p-sk |
iconv -f ISO_8859-2 -t UTF-8
Performance of the phonetic transcription depend on the morphematic segmentation. To improve the quality of the morphematic segmentation is
possible to replace the small version of the simple morphematic dictionary in the /usr/share/g2p_sk/Exceptions/morfemy.ddat with the better
one. The syllabic segmentation is as important as morphematic one. The syllabic segmentation is provided by sylseg-sk package.
The design of the g2p-sk is language dependent. To use it for another language the all rules need to be rewritten.
OPTIONS --color
Enable color output.
--dl 1..5
Set the debug level. Control the amount of displayed information The debug level 0 displays nothing. The maximum level 5 displays
full debugging report. The default debug level is 1.
--help Display a short help text
--ofile <file_name>
Write output also in to given file.
--stats
Count and display statistic for each phone
EXAMPLES
Use standard input and debug level 3:
g2p-sk --dl 3
Process all the from file aaa.txt:
g2p-sk aaa.txt
EXIT STATUS
g2p-sk returns a zero if it succeeds to process all the input words
AUTHOR
Jozef Ivanecky (dodo (at) kanoistika.sk)
SEE ALSO sylseg-sk(1), filterm(1), iconv(1), konwert(1)version 0.4 May 17, 2009 g2p-sk(1)