Hi !
I am trying to remove doubbled entrys in a textfile only between delimiters.
Like that example but i dont know how to do that with sort or similar.
input:
output:
i would be pleasured for every help !
Mfg Fugitivus
I have a file:
Fred
Fred
Fred
Jim
Fred
Jim
Jim
If sort is executed on the listed file, shouldn't the output be?:
Fred
Fred
Fred
Fred
Jim
Jim
Jim (3 Replies)
Using the last, uniq, sort and cut commands, determine how many times the different users have logged in.
I know how to use the last command and cut command...
i came up with last | cut -f1 -d" " | uniq
i dont know if this is right, can someone please help me... thanks (1 Reply)
Does anyone have a quick and dirty way of performing a sort and uniq in perl?
How an array with data like:
this is bkupArr BOLADVICE_VN
this is bkupArr MLT6800PROD2A
this is bkupArr MLT6800PROD2A
this is bkupArr BOLADVICE_VN_7YR
this is bkupArr MLT6800PROD2A
I want to sort it... (4 Replies)
hi
i have data which is in two columns (such as below). i need to compare two rows against each other and if one row matches the other row (except for different case), and their values in the second column are different, then it prints out one of the rows (either is fine).
here is an... (5 Replies)
Hello,
I have a large data file:
1234 8888 bbb
2745 8888 bbb
9489 8888 bbb
1234 8888 aaa
4838 8888 aaa
3977 8888 aaa
I need to remove duplicate lines (where the first column is the duplicate). I have been using:
sort file.txt | uniq -w4 > newfile.txt
However, it seems to keep the... (11 Replies)
Hi All,
I have a text file with the format shown below. Some of the records are duplicated with the only exception being date (Field 15). I want to compare all duplicate records using subscriber number (field 7) and keep only those records with greater date.
... (1 Reply)
Hi again,
I have files with the following contents
datetime,ip1,port1,ip2,port2,number
How would I find out how many times ip1 field shows up a particular file? Then how would I find out how many time ip1 and port 2 shows up?
Please mind the file may contain 100k lines. (8 Replies)
Hello all,
Need to pick your brains,
I have a 10Gb file where each row is a name, I am expecting about 50 names in total. So there are a lot of repetitions in clusters.
So I want to do a
sort -u file
Will it be considerably faster or slower to use a uniq before piping it to sort... (3 Replies)
Hi All,
Below the actual file which i like to sort and Uniq -u
/opt/oracle/work/Antony/Shell_Script> cat emp.1st
2233|a.k. shukula |g.m. |sales |12/12/52 |6000
1006|chanchal singhvi |director |sales |03/09/38 |6700... (8 Replies)
Discussion started by: Antony Ankrose
8 Replies
LEARN ABOUT DEBIAN
sylseg-sk
sylseg-sk(1) USER COMMANDS sylseg-sk(1)NAME
sylseg-sk - segments a Slovak words in to the sylables
SYNOPSIS
sylseg-sk [--best] [--color] [--dl debug level] [--help] [--ofile <file_name>] [<input_file>]
DESCRIPTION
The sylabic segmentation is esential for some linguistic or speech recognition applications. Depending on the language either rule based or
statistical approach is beying used. For Slovak the statistical approach seems to be more suitable.
sylseg-sk implements one of the statistical approaches for the syllabic segmentaion. Each input word is segmented into the syllables. The
several possible segmentations are generated and sorted by the likelihood. If no input file is specified, the standard input is expected.
If input file is used then the output is written in to the file as well. The filename is input filename with the extension ".syllables".
The input output code page is ISO 8859-2. To use it with different CP use some CP convertor and pipes. For example to have input and output
in UTF-8 use (for interactive use): filterm UTF8-iso2 iso2-UTF8 sylseg-sk or (for batch processing) iconv -f UTF-8 -t ISO_8859-2 | sylseg-
sk | iconv -f ISO_8859-2 -t UTF-8
Performance of the syllabic segmentation depend on the used statistics. To improve the quality of the segmentaion is possible to train the
better system with the sylseg-sk-training tool and replace the original file located in /usr/share/sylseg_sk/sylseg-sk.stats
The design of the sylseg-sk is language independent. With retrained statistics it theoreticaly should work for any language.
OPTIONS --best Print the best result only.
--color
Enable color output.
--dl 1..5
Set the debug level. Control the amount of displayed information The debug level 0 displays nothing. The maximum level 5 displays
full debugging report. The default debug level is 1.
--help display a short help text
--ofile <file_name>
Write output also in to given file.
EXAMPLES
Use standard input and debug level 3:
sylseg-sk --dl 3
Process all the from file aaa.txt and print just the best segmentation:
sylseg-sk --best aaa.txt
EXIT STATUS
sylseg-sk returns a zero if it succeeds to process all the input words
AUTHOR
Jozef Ivanecky (dodo (at) kanoistika.sk)
SEE ALSO sylseg-sk-training(1), filterm(1), iconv(1), konwert(1)version 0.5 December 1, 2006 sylseg-sk(1)