Sorry Rudic,
When I tried your code, it gives output like this
That is not surprising. In post #1 in this thread you said you had CSV input files and used an example that used <tab> characters as the character that separates values. The code RudiC provided explicitly specified the <tab> character as the field separator.
In this post, however, there are no <tab> characters; only sequences of <space>s. And, since the number of spaces between fields is not a constant, we can't say that your field separator is a sequence of 8 <space>s or of 9 <space>s.
If you don't accurately describe your input file format, it is hard to guess at what might work with whatever random data format you decide to use when you run code that was designed to use the input format you originally specified.
These 2 Users Gave Thanks to Don Cragun For This Post:
Hi all,
in my csv file it'll look like this, and of course it may have more columns
US to UK;abc-hq-jcl;multimedia
UK to CN;def-ny-jkl;standard
DE to DM;abc-ab-klm;critical
FD to YM;la-yr-tym;standard
HY to MC;la-yr-ytm;multimedia
GT to KJ;def-ny-jrt;critical
I would like to group... (4 Replies)
Hi,
My input file is
$cat samp
1 siva
1 raja
2 siva
1 siva
2 raja
4 venkat
i want sort this name wise...alos need to remove duplicate lines.
i am using
cat samp|awk '{print $2,$1}'|sort -u
it showing
raja 1 (3 Replies)
My scenario is that I need to pick value from third column based on fourth column value, if fourth column value is 1 then first value of third column.Third column (2|3|4|6|1) values are cancatenated.
Main imp point, in my .csv file, third column is having price value with comma (1,20,300), it has... (2 Replies)
hello, I have a large file (about 1gb) that is in a file similar to the following:
I want to make it so that I can put all the duplicates where column 3 (delimited by the commas) are shown on top. Meaning all people with the same age are listed at the top.
The command I used was ... (3 Replies)
I have a .CSV file with the below format:
"column 1","column 2","column 3","column 4","column 5","column 6","column 7","column 8","column 9","column 10
"12310","42324564756","a simple string with a , comma","string with or, without commas","string 1","USD","12","70%","08/01/2013",""... (2 Replies)
input.csv:
Field1,Field2,Field3,Field4,Field4
abc ,123 ,xyz ,000 ,pqr
mno ,123 ,dfr ,111 ,bbb
output:
Field2,Field4
123 ,000
123 ,111
how to fetch the values of Field4 where Field2='123'
I don't want to fetch the values based on column position. Instead want to... (10 Replies)
Hello everyone,
I am using ksh on Solaris 10 and I'm gathering data in a CSV file that looks like this:
20170628-23:25:01,1,0,0,1,1,1,1,55,55,1
20170628-23:30:01,1,0,0,1,1,1,1,56,56,1
20170628-23:35:00,1,0,0,1,1,2,1,57,57,2
20170628-23:40:00,1,0,0,1,1,1,1,58,58,2... (6 Replies)
Hi,
I tried filtering the records in a csv file using "awk" command listed below.
awk -F"~" '$4 ~ /Active/{print }' inputfile > outputfile
The output always has all the entries.
The same command worked for different users from one of the forum links.
content of file I was... (3 Replies)
I have to sort the 4th column of an excel/csv file. I tried the following command
sort -u --field-separator=, --numeric-sort -k 2 -n dinesh.csv > test.csv
But, it's not working. Moreover, I have to do the same for more than 30 excel/csv file. So please help me to do the same. (6 Replies)
Discussion started by: dineshkumarsrk
6 Replies
LEARN ABOUT DEBIAN
unifuzz
unifuzz(1) General Commands Manual unifuzz(1)NAME
unifuzz - Emit strings designed to test Unicode handling
SYNOPSIS
unifuzz ([option flags])
DESCRIPTION
unifuzz emits strings designed to test the ability of programs intended to accept Unicode input to handle unexpected input. These include:
characters from all Unicode ranges, Private Use characters, surrogates, undefined characters, non-characters, control characters, exotic
space characters, sequences violating normalization rules, unexpected sequences (e.g. a base character from one range followed by a combin-
ing character from another range), and long sequences of combining characters. It can also generate very long lines, strings containing
embedded nulls, and ill-formed UTF-8.
COMMAND LINE FLAGS -b Restrict the output to the Basic Multilingual Plane (Plane 0).
-g Do not emit specific characters.
-h Print usage information.
-l Emit very long lines.
-n Emit string with embedded nulls.
-q Be quiet. Omit commentary.
-r <number>
Set the number of random characters to emit.
-S Scan ranges - emit a character from each range.
-s <seed>
Set the seed for the random number generator.
-u Emit ill-formed UTF-8.
-v Print version information.
The sequence of random characters is determined by a pseudorandom number generator, so the same sequence can be obtained by setting the
seed to the same value. If not set on the command line, a seed is chosen based on the time of execution. The seed used is included in the
output in a line of the form "Seed = NNNNNN" immediately preceding the random character sequence. Note that in order to obtain the same
sequence it is necessary to keep the same setting for restriction of output to the BMP.
REFERENCES
Unicode Standard, version 5.0
AUTHOR
Bill Poser
billposer@alum.mit.edu
LICENSE
GNU General Public License
April, 2008 unifuzz(1)