help to identify duplicate columns adjacent value Post: 302512929

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Identify duplicate words in a line using command

Hi, Let me explain the problem clearly: Let the entries in my file be: lion,tiger,bear apple,mango,orange,apple,grape unix,windows,solaris,windows,linux red,blue,green,yellow orange,maroon,pink,violet,orange,pink Can we detect the lines in which one of the words(separated by field...

2. Shell Programming and Scripting

how to identify duplicate columns in a row

Hi, How to identify duplicate columns in a row? Input data: may have 30 columns 9211480750 LK 120070417 920091030 9211480893 AZ 120070607 9205323621 O7 120090914 120090914 1420090914 2020090914 2020090914 9211479568 AZ 120070327 320090730 9211479571 MM 120070326 9211480892 MM 120070324...

3. UNIX for Dummies Questions & Answers

Duplicate columns and lines

Hi all, I have a tab-delimited file and want to remove identical lines, i.e. all of line 1,2,4 because the columns are the same as the columns in other lines. Any input is appreciated. abc gi4597 9997 cgcgtgcg $%^&*()()* abc gi4597 9997 cgcgtgcg $%^&*()()* ttt ...

4. Shell Programming and Scripting

How to calculate the difference between two adjacent columns?

Dear All, I need to find the difference between two adjacent columns. The file is having 'i' columns and i need to find the difference between two adjacent columns (like $1 difference $2; $2 difference $3; .... and $(i-1) difference $i). I have used the following coding awk '{ for (i=1; i<NF;...

5. Shell Programming and Scripting

Remove Duplicate by considering multiple columns

hi friends, my input chr1 exon 35204 35266 gene_id "GOLGB1"; transcript_id "GOLGB1"; chr1 exon 42357 42473 gene_id "GOLGB1"; transcript_id "GOLGB1"; chr1 exon 45261 45404 gene_id "GOLGB1"; transcript_id "GOLGB1"; chr1 exon 50701 50778 gene_id "GOLGB1"; transcript_id "GOLGB1";...

6. Shell Programming and Scripting

Check to identify duplicate values at first column in csv file

Hello experts, I have a requirement where I have to implement two checks on a csv file: 1. Check to see if the value in first column is duplicate, if any value is duplicate script should exit. 2. Check to verify if the value at second column is between "yes" or "no", if it is anything else...

7. Shell Programming and Scripting

Identify max value in diff columns for same row

Hi, I have a file with 1M records ABC 200 400 2.4 5.6 ABC 410 299 12 1.5 XYZ 4 5 6 7 MNO 22 40 30 70 MNO 47 55 80 150 What I want is for all the rows it should take the max value where there are duplicates output ABC 410 400 12 5.6 XYZ 4 5 6 7 MNO 47 55 80 150 How can i...

8. Shell Programming and Scripting

Count duplicate lines ignoring certain columns

I have this structure: col1 col2 col3 col4 col5 27 xxx 38 aaa ttt 2 xxx 38 aaa yyy 1 xxx 38 aaa yyy I need to collapse duplicate lines ignoring column 1 and add values of duplicate lines (col1) so it will look like this: col1 col2 col3 col4 col5 27 xxx 38 aaa ttt ...

9. Shell Programming and Scripting

Remove columns with duplicate entries

I have a 13gb file. It has the following columns: The 3rd column is basically correlation values. I want to delete those rows which are repeated between the columns: A B 0.04 B C 0.56 B B 1 A A 1 C D 1 C C 1 Desired Output: (preferably in a .csv format A,B,0.04 B,C,0.56 C,D,1...

10. Shell Programming and Scripting

Identify duplicate values at first column in csv file

Input 1,ABCD,no 2,system,yes 3,ABCD,yes 4,XYZ,no 5,XYZ,yes 6,pc,noCode used to find duplicate with regard to 2nd column awk 'NR == 1 {p=$2; next} p == $2 { print "Line" NR "$2 is duplicated"} {p=$2}' FS="," ./input.csv Now is there a wise way to de-duplicate the entire line (remove...

LEARN ABOUT LINUX

column

COLUMN(1)						    BSD General Commands Manual 						 COLUMN(1)

NAME

     column -- columnate lists

SYNOPSIS

     column [-ntx] [-c columns] [-s sep] [file ...]

DESCRIPTION

     The column utility formats its input into multiple columns.  Rows are filled before columns.  Input is taken from file operands, or, by
     default, from the standard input.	Empty lines are ignored.

     The options are as follows:

     -c      Output is formatted for a display columns wide.

     -s      Specify a set of characters to be used to delimit columns for the -t option.

     -t      Determine the number of columns the input contains and create a table.  Columns are delimited with whitespace, by default, or with
	     the characters supplied using the -s option.  Useful for pretty-printing displays.

     -x      Fill columns before filling rows.

     -n      By default, the column command will merge multiple adjacent delimiters into a single delimiter when using the -t option; this option
	     disables that behavior. This option is a Debian GNU/Linux extension.

ENVIRONMENT

     The COLUMNS, LANG, LC_ALL and LC_CTYPE environment variables affect the execution of column as described in environ(7).

EXIT STATUS

     The column utility exits 0 on success, and >0 if an error occurs.

EXAMPLES

	   (printf "PERM LINKS OWNER GROUP SIZE MONTH DAY " ; 
	   printf "HH:MM/YEAR NAME
" ; 
	   ls -l | sed 1d) | column -t

SEE ALSO

     colrm(1), ls(1), paste(1), sort(1)

HISTORY

     The column command appeared in 4.3BSD-Reno.

BUGS

     Input lines are limited to LINE_MAX (2048) bytes in length.

BSD
								   July 29, 2004							       BSD

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Identify duplicate words in a line using command

Discussion started by: srinivasan_85

2. Shell Programming and Scripting

how to identify duplicate columns in a row

Discussion started by: suresh3566

3. UNIX for Dummies Questions & Answers

Duplicate columns and lines

Discussion started by: dr_sabz

4. Shell Programming and Scripting

How to calculate the difference between two adjacent columns?

Discussion started by: Fredrick