Sorting on length with identification of number of characters
Hello,
I am writing an open-source stemmer in Java for Indic languages which admit a large number of suffixes.
The Java stemmer requires that each suffix string be sorted as per its length and that all strings of the same length are arranged in a single group, sorted alphabetically. Moreover as a header I need to specify the numeric value of the string, say
Since the languages in question have over 300 and more suffixes, trying to sort on length and identifying the length of each string and counting it becomes a difficult issue.
An example will make this clear.
Input:
Expected output
Since handling such a large database is laborious, is it possible to write a script in AWK or PERL which would enable the above output.
Your help would go a long way in putting java-based stemmers in different languages in the open-source community.
Many thanks in advance for your kind help
Last edited by Scrutinizer; 01-23-2013 at 01:16 AM..
Reason: quote tags -> code tags
Hi,
I'm using AIX version 5.3 currently. I'm trying to create a user id, e.g. andyleong, which the system prompted the length is too long.
1. I would like to know is that the length of user id is limited to maximum 8 characters for AIX.
2. Is it apply to all versions of AIX?
If no... (2 Replies)
I have a fixed length file that need to be sorted according to the following rule
IF B=1 ORDER by A,B
Else ORDER by A,C
Input file is
ABC
131
112
122
231
212
222
Output needed
ABC
112
131
122
212
231
222 (1 Reply)
Hi guys, I'm new to this forum and I'm not a UNIX expert. I can't figure out this certain problem i'm having:
I need to sort some words, some of the words are annotations (enclosed within < and >). I need to have them sorted alphabetically with all non-alphanumeric characters up front. For... (2 Replies)
i need to write a bash script that recive a list of varuables
kaka pele ronaldo beckham zidane messi rivaldo gerrard platini
i need the program to print the longest word of the list.
word in the output appears on a separate line and word order in the output is in the order Llachsicografi costs.... (1 Reply)
Hello
I've question on the requirement I am working on.
We are getting a fixed length file with "33" characters long. We are processing that file loading into DB.
Now some times we are getting a file with "35" characters long. In this case I have to remove two characters (in 22,23... (14 Replies)
Hi,
I have a big file say abc.csv. And in that file, I need to find lines whose length is less than 50 characters. How can it be achieved? Thanks in advance.
Thanks (4 Replies)
Hello,
I have a very large file: a dictionary of headwords of around 40000 and would like to have the dictionary sorted by its length i.e. the largest string first and the smallest at the end.
I have hunted for a perl or awk script on the forum which can do the job but there is none available.
I... (8 Replies)
Hello,
I have a file which has the following structure
word space Frequency
The file is around 30,000 headwords each along with its frequency. The words have different lengths. What I need is a PERL or AWK script which can sort the file on length of the headword and once the file is sorted on... (12 Replies)
My question is basically as the title says. How can I check a user inputted string is only certain characters long (for example, 3 characters long) and how do I check a user inputted string only contains certain characters (for example, it should only contain the characters 'u', 'a', 'g', and 'c')... (4 Replies)
Discussion started by: Eric1
4 Replies
LEARN ABOUT SUSE
dacs.java
DACS.JAVA(7) DACS Miscellaneous Information DACS.JAVA(7)NAME
dacs.java - DACS Java support
DESCRIPTION
This document describes support for Java provided by DACS.
To enable Java support, DACS must be configured using the --enable-java flag. It may also be necessary to specify the location of the Java
commands and include files. See dacs.install(7)[1].
Note
Java support is currently very limited and should be considered experimental. It is currently only possible to call dacsauth(1)[2] and
dacscheck(1)[3] using the Java Native Interface (JNI) provided. The command line arguments to both of these commands are exactly as
documented in their manual pages. Each argument is passed as a String.
The code is not thread-safe and is probably not suitable for being called many times within a particular process due to assumptions it
makes about dynamic memory allocation.
There are no immediate plans to improve or extend Java support. Using the existing capability to build the JNI layer, however, it
should not be difficult for an experienced Java programmer to build upon the example Java code provided.
With Java support enabled, building DACS will copy the JNI shared library and a .jar file to the library directory (default:
/usr/local/dacs/lib), and the JNI include files to the include directory (default: /usr/local/dacs/include).
The tools/java/jni distribution directory contains a simple script called javarun that demonstrates how to run dacsauth or dacscheck as a
Java application. For example, to validate the Unix password for user somebody, run (as root):
./javarun dacsauth -m unix suff -user somebody -prompt
DIAGNOSTICS
The return codes are as documented in the command's manual page.
AUTHOR
Distributed Systems Software (www.dss.ca[4])
COPYING
Copyright2003-2012 Distributed Systems Software. See the LICENSE[5] file that accompanies the distribution for licensing information.
NOTES
1. dacs.install(7)
http://dacs.dss.ca/man/dacs.install.7.html
2. dacsauth(1)
http://dacs.dss.ca/man/dacsauth.1.html
3. dacscheck(1)
http://dacs.dss.ca/man/dacscheck.1.html
4. www.dss.ca
http://www.dss.ca
5. LICENSE
http://dacs.dss.ca/man/../misc/LICENSE
DACS 1.4.27b 10/22/2012 DACS.JAVA(7)