Split rows Post: 302346293

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

split rows

Hi I wanted to split rows based on the number of 1's present in 21st field(21st field is 40 length field) so I wrote the below awk code. However, the tool that I am using to invoke the command is not recognising the command. So, could you please help me to translate this command to sed? awk...

2. Shell Programming and Scripting

Deleting specific rows in large files having rows greater than 100000

Hi Guys, I need help in modifying a large text file containing more than 1-2 lakh rows of data using unix commands. I am quite new to the unix language the text file contains data in a pipe delimited format sdfsdfs sdfsdfsd START_ROW sdfsd|sdfsdfsd|sdfsdfasdf|sdfsadf|sdfasdf...

3. Shell Programming and Scripting

Split single rows to multiple rows ..

Hi pls help me out to short out this problem rm PAB113_011.out rm: PAB113_011.out: override protection 644 (yes/no)? n If i give y it remove the file. But i added the rm command as a part of ksh file and i tried to remove the file. Its not removing and the the file prompting as...

4. Shell Programming and Scripting

split paste them in rows

Hi, I have a file as ABC 123_456_789 234_678_901 XYZ 1100_1250_1580_1680 1175_1440_1620_1890 so on What I want my output file to look is "split by underscore and then place the contents in rows" output ABC 123 234 ABC 456 678 ABC 789 901 XYZ 1100 1175 XYZ 1250 1440...

5. Shell Programming and Scripting

MySql split rows

Dear community, I have to split string in table and list all values. I'll skip the code and jump directly to mysql query. This is the table: category title ======= ======= 7,3 title 1 1,3 title 2 1,2,3 title 3 Now, what I need is split category into single...

6. Shell Programming and Scripting

awk split columns after matching on rows and summing the last column

input: chr1 1 2 3 chr1 1 2 4 chr1 2 4 5 chr2 3 6 9 chr2 3 6 10 Code: awk '{a+=$4}END{for (i in a) print i,a}' input Output: chr112 7 chr236 19 chr124 5 Desired output: chr1 1 2 7 chr2 3 6 19 chr1 2 4 5

7. Shell Programming and Scripting

Split File based on number of rows

Hi I have a requirement, where i will receive multiple files in a folder (say: /fol1/fol2/). There will be at least 14 to 16 files. The size of the files will different, some may be 80GB or 90GB, some may be less than 5 GB (and the size of the files are very unpredictable). But the names of the...

8. Shell Programming and Scripting

Split columns into rows

Any one can help me in converting columns into rows. example I have input file 10000| 10002| 10003| 10004| 10005| I want output in below format PARTY|PART_DT 10000|12080000000 10002|13075200000 10003|13939200000 10004|1347200000 10004|133600000 10004|1152000000

9. UNIX for Beginners Questions & Answers

Split column into rows

Hi, I have input dataset as below: Cl.jenn,1051,ABCD JEN.HEA,9740|1517|8119|2145,ZZZZ,REPEAT Rich.Sm, Ann.Car,3972|4051|1064|4323|4122|2394|2574|4507 Sta.for,7777,ABCD,UUUU Sm.Ric, Ch.LRD, Eh.ab, Gr.sh, Expected output: ------------------- Cl.jenn,1051,ABCD...

10. UNIX for Beginners Questions & Answers

How to split one long column into multiple rows with 3 each ?

I have a large csv dataset like this : A value1 A value2 A value3 B value1 B value2 B value3 C value1 C value2 C value3 what I expected output is :A value1 value2 value3 B value1 value2 value3 C value1 value2 value3 I'm thinking of use like awk, columns , but haven't find a proper...

LEARN ABOUT DEBIAN

encode::guess

Encode::Guess(3pm)					User Contributed Perl Documentation					Encode::Guess(3pm)

NAME

       Encode::Guess -- Guesses encoding from data

SYNOPSIS

	 # if you are sure $data won't contain anything bogus

	 use Encode;
	 use Encode::Guess qw/euc-jp shiftjis 7bit-jis/;
	 my $utf8 = decode("Guess", $data);
	 my $data = encode("Guess", $utf8);   # this doesn't work!

	 # more elaborate way
	 use Encode::Guess;
	 my $enc = guess_encoding($data, qw/euc-jp shiftjis 7bit-jis/);
	 ref($enc) or die "Can't guess: $enc"; # trap error this way
	 $utf8 = $enc->decode($data);
	 # or
	 $utf8 = decode($enc->name, $data)

ABSTRACT

       Encode::Guess enables you to guess in what encoding a given data is encoded, or at least tries to.

DESCRIPTION

       By default, it checks only ascii, utf8 and UTF-16/32 with BOM.

	 use Encode::Guess; # ascii/utf8/BOMed UTF

       To use it more practically, you have to give the names of encodings to check (suspects as follows).  The name of suspects can either be
       canonical names or aliases.

       CAVEAT: Unlike UTF-(16|32), BOM in utf8 is NOT AUTOMATICALLY STRIPPED.

	# tries all major Japanese Encodings as well
	 use Encode::Guess qw/euc-jp shiftjis 7bit-jis/;

       If the $Encode::Guess::NoUTFAutoGuess variable is set to a true value, no heuristics will be applied to UTF8/16/32, and the result will be
       limited to the suspects and "ascii".

       Encode::Guess->set_suspects
	   You can also change the internal suspects list via "set_suspects" method.

	     use Encode::Guess;
	     Encode::Guess->set_suspects(qw/euc-jp shiftjis 7bit-jis/);

       Encode::Guess->add_suspects
	   Or you can use "add_suspects" method.  The difference is that "set_suspects" flushes the current suspects list while "add_suspects"
	   adds.

	     use Encode::Guess;
	     Encode::Guess->add_suspects(qw/euc-jp shiftjis 7bit-jis/);
	     # now the suspects are euc-jp,shiftjis,7bit-jis, AND
	     # euc-kr,euc-cn, and big5-eten
	     Encode::Guess->add_suspects(qw/euc-kr euc-cn big5-eten/);

       Encode::decode("Guess" ...)
	   When you are content with suspects list, you can now

	     my $utf8 = Encode::decode("Guess", $data);

       Encode::Guess->guess($data)
	   But it will croak if:

	   o   Two or more suspects remain

	   o   No suspects left

	   So you should instead try this;

	     my $decoder = Encode::Guess->guess($data);

	   On success, $decoder is an object that is documented in Encode::Encoding.  So you can now do this;

	     my $utf8 = $decoder->decode($data);

	   On failure, $decoder now contains an error message so the whole thing would be as follows;

	     my $decoder = Encode::Guess->guess($data);
	     die $decoder unless ref($decoder);
	     my $utf8 = $decoder->decode($data);

       guess_encoding($data, [, list of suspects])
	   You can also try "guess_encoding" function which is exported by default.  It takes $data to check and it also takes the list of
	   suspects by option.	The optional suspect list is not reflected to the internal suspects list.

	     my $decoder = guess_encoding($data, qw/euc-jp euc-kr euc-cn/);
	     die $decoder unless ref($decoder);
	     my $utf8 = $decoder->decode($data);
	     # check only ascii, utf8 and UTF-(16|32) with BOM
	     my $decoder = guess_encoding($data);

CAVEATS

       o   Because of the algorithm used, ISO-8859 series and other single-byte encodings do not work well unless either one of ISO-8859 is the
	   only one suspect (besides ascii and utf8).

	     use Encode::Guess;
	     # perhaps ok
	     my $decoder = guess_encoding($data, 'latin1');
	     # definitely NOT ok
	     my $decoder = guess_encoding($data, qw/latin1 greek/);

	   The reason is that Encode::Guess guesses encoding by trial and error.  It first splits $data into lines and tries to decode the line
	   for each suspect.  It keeps it going until all but one encoding is eliminated out of suspects list.	ISO-8859 series is just too
	   successful for most cases (because it fills almost all code points in x00-xff).

       o   Do not mix national standard encodings and the corresponding vendor encodings.

	     # a very bad idea
	     my $decoder
		= guess_encoding($data, qw/shiftjis MacJapanese cp932/);

	   The reason is that vendor encoding is usually a superset of national standard so it becomes too ambiguous for most cases.

       o   On the other hand, mixing various national standard encodings automagically works unless $data is too short to allow for guessing.

	    # This is ok if $data is long enough
	    my $decoder =
	     guess_encoding($data, qw/euc-cn
				      euc-jp shiftjis 7bit-jis
				      euc-kr
				      big5-eten/);

       o   DO NOT PUT TOO MANY SUSPECTS!  Don't you try something like this!

	     my $decoder = guess_encoding($data,
					  Encode->encodings(":all"));

       It is, after all, just a guess.	You should alway be explicit when it comes to encodings.  But there are some, especially Japanese,
       environment that guess-coding is a must.  Use this module with care.

TO DO

       Encode::Guess does not work on EBCDIC platforms.

SEE ALSO

       Encode, Encode::Encoding

perl v5.14.2							    2011-08-09							Encode::Guess(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

split rows

Discussion started by: ahmedwaseem2000

2. Shell Programming and Scripting

Deleting specific rows in large files having rows greater than 100000

Discussion started by: manish2009

3. Shell Programming and Scripting

Split single rows to multiple rows ..

Discussion started by: sri_aue

4. Shell Programming and Scripting

split paste them in rows

Discussion started by: Diya123