12-06-2012
Many thanks. Am out at present. Will run the perl script and get back to you.
10 More Discussions You Might Find Interesting
1. UNIX for Advanced & Expert Users
Hi - I tried to remove ^M in a delimited file using "tr -d "\r" and "sed 's/^M//g'", but it does not work quite well. While the ^M is removed, the format of the record is still cut in half, like
a,b, c
c,d,e
The delimited file is generated using sh script by outputing a SQL query result to... (7 Replies)
Discussion started by: sirahc
7 Replies
2. Shell Programming and Scripting
Hi Experts
I am very new to perl and need to make a script using perl.
I would like to remove blanks in a text tab delimited file in in a specfic column range ( colum 21 to column 43) sample input and output shown below :
Input:
117 102 650 652 654 656
117 93 95... (3 Replies)
Discussion started by: Faisal Riaz
3 Replies
3. Shell Programming and Scripting
Hey there - a bit of background on what I'm trying to accomplish, first off. I am trying to load the data from a pipe delimited file into a database. The loading tool that I use cannot handle embedded newline characters within a field, so I need to scrub them out.
Solutions that I have tried... (7 Replies)
Discussion started by: bbetteridge
7 Replies
4. Shell Programming and Scripting
I have a large flat file with variable length fields that are pipe delimited. The file has no new line or CR/LF characters to indicate a new record. I need to parse the file and after some number of fields, I need to insert a CR/LF to start the next record.
Input file ... (2 Replies)
Discussion started by: clintrpeterson
2 Replies
5. Shell Programming and Scripting
Hi All
I wanted to know how to effectively delete some columns in a large tab delimited file.
I have a file that contains 5 columns and almost 100,000 rows
3456 f g t t
3456 g h
456 f h
4567 f g h z
345 f g
567 h j k lThis is a very large data file and tab delimited.
I need... (2 Replies)
Discussion started by: Lucky Ali
2 Replies
6. Shell Programming and Scripting
Since there are approximately 75K gsfiles and hundreds of stfiles per gsfile, this script can take hours. How can I rewrite this script, so that it's much faster? I'm not as familiar with perl but I'm open to all suggestions.
ls file.list>$split
for gsfile in `cat $split`;
do
csplit... (17 Replies)
Discussion started by: verge
17 Replies
7. Shell Programming and Scripting
Hi,
I have the following command in place
nawk -F, '!a++' file > file.uniq
It has been working perfectly as per requirements, by removing duplicates by taking into consideration only first 3 fields. Recently it has started giving below error:
bash-3.2$ nawk -F, '!a++'... (17 Replies)
Discussion started by: makn
17 Replies
8. Shell Programming and Scripting
I am working on a homonym dictionary of names i.e. names which are clustered together according to their “sound-alike” pronunciation:
An example will make this clear:
Since the dictionary is manually constructed it often happens that inadvertently two sets of “homonyms” which should be grouped... (2 Replies)
Discussion started by: gimley
2 Replies
9. UNIX for Advanced & Expert Users
I have a file size is around 24 G with 14 columns, delimiter with "|"
My requirement- can anyone provide me the fastest and best to get the below results
Number of records of the file
First column and second Column- Unique counts
Thanks for your time
Karti
------ Post updated at... (3 Replies)
Discussion started by: kartikirans
3 Replies
10. Shell Programming and Scripting
I have a large file 1.5 gb and want to sort the file.
I used the following AWK script to do the job
!x++
The script works but it is very slow and takes over an hour to do the job. I suspect this is because the file is not sorted.
Any solution to speed up the AWk script or a Perl script would... (4 Replies)
Discussion started by: gimley
4 Replies
LEARN ABOUT DEBIAN
cwdreg
CWDREG(1) General Commands Manual CWDREG(1)
NAME
cwdreg - To register characters/words into the binary format
dictionary.
SYNOPSIS
cwdreg [-D server ] -n envname
-d dicno < textdic
OR
cwdreg [-D server ] -n envname
-L filename < textdic
DEFAULT PATH
/usr/local/bin/cWnn4/cwdreg
DESCRIPTION
This function allows user to register characters/words into the specified binary dictionary, with either dictionary number dicno or dictio-
nary filename filename specified.
server is the machine name of the server. If this is not specified, the default cserver indicated by the environment variable CSERVER will
be taken.
"-n envname " must be specified. envname is the environment name. You may execute "cwnnstat -E" to see the current environment name.
Either "-d dicno " or "-L filename " must be specified.
dicno is the dictionary number. filename is the filename of the dictionary. "-L" is used for when the dictionary is from the local
machine.
"<" means to pipe the textdic as an input to "cwdreg" command.
textdic is the text file which user enters the characters/words to be registered. The format of this text file must be the same as that
in the system text format dictionary. That is,
--------------------------------------------------
| Pinyin Word Cixing Frequency |
| : : : : |
--------------------------------------------------
Refer to cWnn manual for details on dictionary.
By using "cwdreg", all the characters/words in textdic will be registered into the specified binary dictionary permanently.
NOTE
1. The parts in [ ] are options. They may be omitted.
13 May 1992 CWDREG(1)