using sed to get rid of duplicated columns... Post: 302184154

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

remove duplicated columns

hi all, i have a file contain multicolumns, this file is sorted by col2 and col3. i want to remove the duplicated columns if the col2 and col3 are the same in another line. example fileA AA BB CC DD CC XX CC DD BB CC ZZ FF DD FF HH HH the output is AA BB CC DD BB CC ZZ FF...

2. Shell Programming and Scripting

Help removing lines with duplicated columns

Hi Guys... Please Could you help me with the following ? aaaa bbbb cccc sdsd aaaa bbbb cccc qwer as you can see, the 2 lines are matched in three fields... how can I delete this pupicate ? I mean to delete the second one if 3 fields were duplicated ? Thanks

3. Shell Programming and Scripting

How to get rid of double quote in sed.

Hi, i am using sed command to grep just a valuable data for my report generating. Thanks to the person who assists me on before thread. the problem that i encounter now is when i executed below command The output will give me like below output in between the data, there is a double quote. How...

4. Shell Programming and Scripting

get rid of xml comment by grep or sed

Hi, I would like to get rid of all comment in an xml file by grep or sed command: The content seem like this:  Anyone can help? Thanks and Regards

5. UNIX for Dummies Questions & Answers

Getting rid of selected columns

Hi All, I've got a file like this: a 1 0 0 0 1 0 0 1 1 3 3 1 4 4 4 b 1 0 0 0 1 4 4 1 3 1 1 4 4 2 2 c 1 0 0 0 2 0 0 3 3 1 3 1 1 2 4 d 1 0 0 0 2 0 0 1 1 0 0 4 4 2 4 The file has ~4200 entries. I need to exclude those columns that are zeros for all those rows that have 2 in column 6. For...

6. Shell Programming and Scripting

Manipulate columns using sed

Hello, I would like to remove the first column of lines beginning by a character (in my case is an open square bracket) and finishing by a space (or any other delimiter). For example: string1 string2 string3 to string2 string3 I found this previous topic: ...

7. UNIX for Dummies Questions & Answers

Find duplicated values in two columns out of three

hi! could u help in the following? I have the data (long list!) that looks like (three coumns white space separated): rs3094315 0.0665173 742429 rs12562034 0.0738998 758311 rs3934834 0.396449 995669 rs9442372 0.402693 1008567 rs3737728 0.406271 1011278 rs6687776 0.435429 1020428 rs9651273...

8. Shell Programming and Scripting

sed to get rid of unwanted characters

so i have strings such as this: 'postfix/local#2,5#|CRON.*12062.*root.*CMD#2,5#|roice.*NQN1#1,2#|toysprc#1,4#' i need to get rid of the "#" and the numbers between them for each of the strings above. so the desired output should be: ...

9. UNIX for Dummies Questions & Answers

sed for all columns

Hi, I would like to know how can I use sed in all columns of a file tab separated. Example of input file: 0/0:0:1,0,0 0/2:0:0,2,0 Desired output file: 1,0 0,2

10. Shell Programming and Scripting

Deleting duplicated chunks in a file using awk/sed

Hi all, I'd always appreciate all helps from this site. I would like to delete duplicated chunks of strings on the same row(?). One chunk is comprised of four lines such as: path name starting point ending point voltage number I would like to delete duplicated chunks on the same...

LEARN ABOUT DEBIAN

sort::fields

Fields(3pm)						User Contributed Perl Documentation					       Fields(3pm)

NAME

       Sort::Fields - Sort lines containing delimited fields

SYNOPSIS

	 use Sort::Fields;
	 @sorted = fieldsort [3, '2n'], @lines;
	 @sorted = fieldsort '+', [-1, -3, 0], @lines;

	 $sort_3_2n = make_fieldsort [3, '2n'], @lines;
	 @sorted = $sort_3_2n->(@lines);

DESCRIPTION

       Sort::Fields provides a general purpose technique for efficiently sorting lists of lines that contain data separated into fields.

       Sort::Fields automatically imports two subroutines, "fieldsort" and "make_fieldsort", and two variants, "stable_fieldsort" and "make_sta-
       ble_fieldsort".	"make_fieldsort" generates a sorting subroutine and returns a reference to it.	"fieldsort" is a wrapper for the
       "make_fieldsort" subroutine.

       The first argument to make_fieldsort is a delimiter string, which is used as a regular expression argument for a "split" operator.  The
       delimiter string is optional.  If it is not supplied, make_fieldsort splits each line using "/s+/".

       The second argument is an array reference containing one or more field specifiers.  The specifiers indicate what fields in the strings will
       be used to sort the data.  The specifier "1" indicates the first field, "2" indicates the second, and so on.  A negative specifier like
       "-2" means to sort on the second field in reverse (descending) order.  To indicate a numeric rather than alphabetic comparison, append "n"
       to the specifier.  A specifier of "0" means the entire string ("-0" means the entire string, in reverse order).

       The order in which the specifiers appear is the order in which they will be used to sort the data.  The primary key is first, the secondary
       key is second, and so on.

       "fieldsort [1, 2], @data" is roughly equivalent to "make_fieldsort([1, 2])->(@data)".  Avoid calling fieldsort repeatedly with the same
       sort specifiers.  If you need to use a particular sort more than once, it is more efficient to call "make_fieldsort" once and reuse the
       subroutine it returns.

       "stable_fieldsort" and "make_stable_fieldsort" are like their "unstable" counterparts, except that the items that compare the same are
       maintained in their original order.

EXAMPLES

       Some sample data (in array @data):

	 123   asd   1.22   asdd
	 32    ewq   2.32   asdd
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 23    erww  4.21   ewet
	 91    fdgs  3.43   ewet
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd

	 # alpha sort on column 1
	 print fieldsort [1], @data;

	 123   asd   1.22   asdd
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd
	 23    erww  4.21   ewet
	 32    ewq   2.32   asdd
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 91    fdgs  3.43   ewet

	 # numeric sort on column 1
	 print fieldsort ['1n'], @data;

	 23    erww  4.21   ewet
	 32    ewq   2.32   asdd
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 91    fdgs  3.43   ewet
	 123   asd   1.22   asdd
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd

	 # reverse numeric sort on column 1
	 print fieldsort ['-1n'], @data;

	 123   asd   1.22   asdd
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd
	 91    fdgs  3.43   ewet
	 51    erwt  34.2   ewet
	 43    rewq  2.12   ewet
	 32    ewq   2.32   asdd
	 23    erww  4.21   ewet

	 # alpha sort on column 2, then alpha on entire line
	 print fieldsort [2, 0], @data;

	 123   asd   1.22   asdd
	 51    erwt  34.2   ewet
	 23    erww  4.21   ewet
	 32    ewq   2.32   asdd
	 91    fdgs  3.43   ewet
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd
	 43    rewq  2.12   ewet

	 # alpha sort on column 4, then numeric on column 1, then reverse
	 # numeric on column 3
	 print fieldsort [4, '1n', '-3n'], @data;

	 32    ewq   2.32   asdd
	 123   refs  4.32   asdd
	 123   refs  3.22   asdd
	 123   asd   1.22   asdd
	 23    erww  4.21   ewet
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 91    fdgs  3.43   ewet

	 # now, splitting on either literal period or whitespace
	 # sort numeric on column 4 (fractional part of decimals) then
	 # numeric on column 3 (whole part of decimals)
	 print fieldsort '(?:.|s+)', ['4n', '3n'], @data;

	 51    erwt  34.2   ewet
	 43    rewq  2.12   ewet
	 23    erww  4.21   ewet
	 123   asd   1.22   asdd
	 123   refs  3.22   asdd
	 32    ewq   2.32   asdd
	 123   refs  4.32   asdd
	 91    fdgs  3.43   ewet

	 # alpha sort on column 4, then numeric on the entire line
	 # NOTE: produces warnings under -w
	 print fieldsort [4, '0n'], @data;

	 32    ewq   2.32   asdd
	 123   asd   1.22   asdd
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd
	 23    erww  4.21   ewet
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 91    fdgs  3.43   ewet

	 # stable alpha sort on column 4 (maintains original relative order
	 # among items that compare the same)
	 print stable_fieldsort [4], @data;

	 123   asd   1.22   asdd
	 32    ewq   2.32   asdd
	 123   refs  3.22   asdd
	 123   refs  4.32   asdd
	 43    rewq  2.12   ewet
	 51    erwt  34.2   ewet
	 23    erww  4.21   ewet
	 91    fdgs  3.43   ewet

BUGS

       Some rudimentary tests now.

       Perhaps something should be done to catch things like:

	 fieldsort '.', [1, 2], @lines;

       '.' translates to "split /./" -- probably not what you want.

       Passing blank lines and/or lines containing the wrong kind of data (alphas instead of numbers) can result in copious warning messages under
       "-w".

       If the regexp contains memory parentheses ("(...)" rather than "(?:...)"), split will function in "delimiter retention" mode, capturing the
       contents of the parentheses as well as the stuff between the delimiters.  I could imagine how this could be useful, but on the other hand I
       could also imagine how it could be confusing if encountered unexpectedly.  Caveat sortor.

       Not really a bug, but if you are planning to sort a large text file, consider using sort(1).  Unless, of course, your operating system
       doesn't have sort(1).

AUTHOR

       Joseph N. Hall, joseph@5sigma.com

SEE ALSO

       perl(1).

perl v5.8.8							    2008-03-25							       Fields(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

remove duplicated columns

Discussion started by: kamel.seg

2. Shell Programming and Scripting

Help removing lines with duplicated columns

Discussion started by: yahyaaa

3. Shell Programming and Scripting

How to get rid of double quote in sed.

Discussion started by: anakiar

4. Shell Programming and Scripting

get rid of xml comment by grep or sed

Discussion started by: RonLii