awk solution to duplicate lines based on column Post: 302862731

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

I am a newbie to shell scripting .. I have a .csv file. It has 1000 some rows and about 7 columns... but before I insert this data to a table I have to parse it and clean it ..basing on the value of the first column..which a string of phone number type... example below.. column 1 ...

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Hi, I'm trying to create an XML sitemap of our dynamic ecommerce sites SEO Friendly URLs and am trying to create the initial page listing. I have a CSV file that looks like the following and need duplicate the lines based on a value which needs calculating. ...

3. Shell Programming and Scripting

awk print non matching lines based on column

My item was not answered on previous thread as code given did not work I wanted to print records from file2 where comparing column 1 and 16 for both files find rows where column 16 in file 1 does not match column 16 in file 2 Here was CODE give to issue ~/unix.com$ cat f1...

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Hi I have a file like this. I need to eliminate lines with first column having the same value 10 times. 13 18 1 + chromosome 1, 122638287 AGAGTATGGTCGCGGTTG 13 18 1 + chromosome 1, 128904080 AGAGTATGGTCGCGGTTG 13 18 1 - chromosome 14, 13627938 CAACCGCGACCATACTCT 13 18 1 + chromosome 1,...

5. UNIX for Dummies Questions & Answers

awk to sum column field from duplicate row/lines

Hello, I am new to Linux environment , I working on Linux script which should send auto email based on the specific condition from log file. Below is the sample log file Name m/c usage abc xxx 10 abc xxx 20 abc xxx 5 xyz ...

6. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters...

7. Shell Programming and Scripting

Remove duplicate rows based on one column

Dear members, I need to filter a file based on the 8th column (that is id), and does not mather the other columns, because I want just one id (1 line of each id) and remove the duplicates lines based on this id (8th column), and does not matter wich duplicate will be removed. example of my file...

8. Shell Programming and Scripting

Removing duplicate lines on first column based with pipe delimiter

Hi, I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines Command : sort -t'|' -nuk1 file.txt Input : 38376KZ|09/25/15|1.057 38376KZ|09/25/15|1.057 02006YB|09/25/15|0.859 12593PS|09/25/15|2.803...

9. Shell Programming and Scripting

Solution for replacement of 4th column with 3rd column in a file using awk/sed preserving delimters

input "A","B","C,D","E","F" "S","T","U,V","W","X" "AA","BB","CC,DD","EEEE","FFF" required output: "A","B","C,D","C,D","F" "S", T","U,V","U,V","X" "AA","BB","CC,DD","CC,DD","FFF" tried using awk but double quotes not preserving for every field. any help to solve this is much...

10. Shell Programming and Scripting

awk to select lines with maximum value of each record based on column value

Hello, I want to get the maximum value of each record separated by empty line based on the 3rd column of each row within each record? Input: A1 chr5D 634 7 82 707 A2 chr5D 637 6 82 713 A3 chr5D 637 5 82 713 A4 chr5D 626 1 82 704...

LEARN ABOUT OPENDARWIN

cut

CUT(1)							    BSD General Commands Manual 						    CUT(1)

NAME

     cut -- select portions of each line of a file

SYNOPSIS

     cut -b list [-n] [file ...]
     cut -c list [file ...]
     cut -f list [-d delim] [-s] [file ...]

DESCRIPTION

     The cut utility selects portions of each line (as specified by list) from each file and writes them to the standard output.  If no file argu-
     ments are specified, or a file argument is a single dash ('-'), cut reads from from the standard input.  The items specified by list can be
     in terms of column position or in terms of fields delimited by a special character.  Column numbering starts from 1.

     The list option argument is a comma or whitespace separated set of increasing numbers and/or number ranges.  Number ranges consist of a num-
     ber, a dash ('-'), and a second number and select the fields or columns from the first number to the second, inclusive.  Numbers or number
     ranges may be preceded by a dash, which selects all fields or columns from 1 to the first number.	Numbers or number ranges may be followed
     by a dash, which selects all fields or columns from the last number to the end of the line.  Numbers and number ranges may be repeated, over-
     lapping, and in any order.  It is not an error to select fields or columns not present in the input line.

     The options are as follows:

     -b list
	     The list specifies byte positions.

     -c list
	     The list specifies character positions.

     -d delim
	     Use the first character of delim as the field delimiter character instead of the tab character.

     -f list
	     The list specifies fields, delimited in the input by a single tab character.  Output fields are separated by a single tab character.

     -n      Do not split multi-byte characters.

     -s      Suppress lines with no field delimiter characters.  Unless specified, lines with no delimiters are passed through unmodified.

ENVIRONMENT

     The LANG, LC_ALL and LC_CTYPE environment variables affect the execution of cut if the -n option is specified.  Their effect is described in
     environ(7).

EXAMPLES

     Extract users' login names and shells from the system passwd(5) file as ``name:shell'' pairs:

	   cut -d : -f 1,7 /etc/passwd

     Show the names and login times of the currently logged in users:

	   who | cut -c 1-16,26-38

DIAGNOSTICS

     The cut utility exits 0 on success, and >0 if an error occurs.

SEE ALSO

     paste(1)

STANDARDS

     The cut utility conforms to IEEE Std 1003.2-1992 (``POSIX.2'').

HISTORY

     A cut command appeared in AT&T System III UNIX.

BUGS

     The -c option is a synonym for the -b option, which causes incorrect behaviour in locales that support multibyte characters.

     When operating on fields (-f option is specified), cut does not recognise multibyte characters, and the delim character is recognised in the
     middle of multibyte sequences.

BSD
								   June 6, 1993 							       BSD

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

Discussion started by: mitr

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Discussion started by: jamesfx

3. Shell Programming and Scripting

awk print non matching lines based on column

Discussion started by: sigh2010

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Discussion started by: polsum

5. UNIX for Dummies Questions & Answers

awk to sum column field from duplicate row/lines

Discussion started by: asjaiswal

6. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Discussion started by: prashob123

7. Shell Programming and Scripting

Remove duplicate rows based on one column

Discussion started by: clarissab