awk solution to duplicate lines based on column Post: 302862463

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

I am a newbie to shell scripting .. I have a .csv file. It has 1000 some rows and about 7 columns... but before I insert this data to a table I have to parse it and clean it ..basing on the value of the first column..which a string of phone number type... example below.. column 1 ...

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Hi, I'm trying to create an XML sitemap of our dynamic ecommerce sites SEO Friendly URLs and am trying to create the initial page listing. I have a CSV file that looks like the following and need duplicate the lines based on a value which needs calculating. ...

3. Shell Programming and Scripting

awk print non matching lines based on column

My item was not answered on previous thread as code given did not work I wanted to print records from file2 where comparing column 1 and 16 for both files find rows where column 16 in file 1 does not match column 16 in file 2 Here was CODE give to issue ~/unix.com$ cat f1...

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Hi I have a file like this. I need to eliminate lines with first column having the same value 10 times. 13 18 1 + chromosome 1, 122638287 AGAGTATGGTCGCGGTTG 13 18 1 + chromosome 1, 128904080 AGAGTATGGTCGCGGTTG 13 18 1 - chromosome 14, 13627938 CAACCGCGACCATACTCT 13 18 1 + chromosome 1,...

5. UNIX for Dummies Questions & Answers

awk to sum column field from duplicate row/lines

Hello, I am new to Linux environment , I working on Linux script which should send auto email based on the specific condition from log file. Below is the sample log file Name m/c usage abc xxx 10 abc xxx 20 abc xxx 5 xyz ...

6. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters...

7. Shell Programming and Scripting

Remove duplicate rows based on one column

Dear members, I need to filter a file based on the 8th column (that is id), and does not mather the other columns, because I want just one id (1 line of each id) and remove the duplicates lines based on this id (8th column), and does not matter wich duplicate will be removed. example of my file...

8. Shell Programming and Scripting

Removing duplicate lines on first column based with pipe delimiter

Hi, I have tried to remove dublicate lines based on first column with pipe delimiter . but i ma not able to get some uniqu lines Command : sort -t'|' -nuk1 file.txt Input : 38376KZ|09/25/15|1.057 38376KZ|09/25/15|1.057 02006YB|09/25/15|0.859 12593PS|09/25/15|2.803...

9. Shell Programming and Scripting

Solution for replacement of 4th column with 3rd column in a file using awk/sed preserving delimters

input "A","B","C,D","E","F" "S","T","U,V","W","X" "AA","BB","CC,DD","EEEE","FFF" required output: "A","B","C,D","C,D","F" "S", T","U,V","U,V","X" "AA","BB","CC,DD","CC,DD","FFF" tried using awk but double quotes not preserving for every field. any help to solve this is much...

10. Shell Programming and Scripting

awk to select lines with maximum value of each record based on column value

Hello, I want to get the maximum value of each record separated by empty line based on the 3rd column of each row within each record? Input: A1 chr5D 634 7 82 707 A2 chr5D 637 6 82 713 A3 chr5D 637 5 82 713 A4 chr5D 626 1 82 704...

LEARN ABOUT MOJAVE

perltrap5.18

PERLTRAP(1)						 Perl Programmers Reference Guide					       PERLTRAP(1)

NAME

       perltrap - Perl traps for the unwary

DESCRIPTION

       The biggest trap of all is forgetting to "use warnings" or use the -w switch; see perllexwarn and perlrun. The second biggest trap is not
       making your entire program runnable under "use strict".	The third biggest trap is not reading the list of changes in this version of Perl;
       see perldelta.

   Awk Traps
       Accustomed awk users should take special note of the following:

       o   A Perl program executes only once, not once for each input line.  You can do an implicit loop with "-n" or "-p".

       o   The English module, loaded via

	       use English;

	   allows you to refer to special variables (like $/) with names (like $RS), as though they were in awk; see perlvar for details.

       o   Semicolons are required after all simple statements in Perl (except at the end of a block).	Newline is not a statement delimiter.

       o   Curly brackets are required on "if"s and "while"s.

       o   Variables begin with "$", "@" or "%" in Perl.

       o   Arrays index from 0.  Likewise string positions in substr() and index().

       o   You have to decide whether your array has numeric or string indices.

       o   Hash values do not spring into existence upon mere reference.

       o   You have to decide whether you want to use string or numeric comparisons.

       o   Reading an input line does not split it for you.  You get to split it to an array yourself.	And the split() operator has different
	   arguments than awk's.

       o   The current input line is normally in $_, not $0.  It generally does not have the newline stripped.	($0 is the name of the program
	   executed.)  See perlvar.

       o   $<digit> does not refer to fields--it refers to substrings matched by the last match pattern.

       o   The print() statement does not add field and record separators unless you set $, and "$".  You can set $OFS and $ORS if you're using
	   the English module.

       o   You must open your files before you print to them.

       o   The range operator is "..", not comma.  The comma operator works as in C.

       o   The match operator is "=~", not "~".  ("~" is the one's complement operator, as in C.)

       o   The exponentiation operator is "**", not "^".  "^" is the XOR operator, as in C.  (You know, one could get the feeling that awk is
	   basically incompatible with C.)

       o   The concatenation operator is ".", not the null string.  (Using the null string would render "/pat/ /pat/" unparsable, because the
	   third slash would be interpreted as a division operator--the tokenizer is in fact slightly context sensitive for operators like "/",
	   "?", and ">".  And in fact, "." itself can be the beginning of a number.)

       o   The "next", "exit", and "continue" keywords work differently.

       o   The following variables work differently:

		 Awk	   Perl
		 ARGC	   scalar @ARGV (compare with $#ARGV)
		 ARGV[0]   $0
		 FILENAME  $ARGV
		 FNR	   $. - something
		 FS	   (whatever you like)
		 NF	   $#Fld, or some such
		 NR	   $.
		 OFMT	   $#
		 OFS	   $,
		 ORS	   $
		 RLENGTH   length($&)
		 RS	   $/
		 RSTART    length($`)
		 SUBSEP    $;

       o   You cannot set $RS to a pattern, only a string.

       o   When in doubt, run the awk construct through a2p and see what it gives you.

   C/C++ Traps
       Cerebral C and C++ programmers should take note of the following:

       o   Curly brackets are required on "if"'s and "while"'s.

       o   You must use "elsif" rather than "else if".

       o   The "break" and "continue" keywords from C become in Perl "last" and "next", respectively.  Unlike in C, these do not work within a "do
	   { } while" construct.  See "Loop Control" in perlsyn.

       o   The switch statement is called "given/when" and only available in perl 5.10 or newer.  See "Switch Statements" in perlsyn.

       o   Variables begin with "$", "@" or "%" in Perl.

       o   Comments begin with "#", not "/*" or "//".  Perl may interpret C/C++ comments as division operators, unterminated regular expressions
	   or the defined-or operator.

       o   You can't take the address of anything, although a similar operator in Perl is the backslash, which creates a reference.

       o   "ARGV" must be capitalized.	$ARGV[0] is C's "argv[1]", and "argv[0]" ends up in $0.

       o   System calls such as link(), unlink(), rename(), etc. return nonzero for success, not 0. (system(), however, returns zero for success.)

       o   Signal handlers deal with signal names, not numbers.  Use "kill -l" to find their names on your system.

   Sed Traps
       Seasoned sed programmers should take note of the following:

       o   A Perl program executes only once, not once for each input line.  You can do an implicit loop with "-n" or "-p".

       o   Backreferences in substitutions use "$" rather than "".

       o   The pattern matching metacharacters "(", ")", and "|" do not have backslashes in front.

       o   The range operator is "...", rather than comma.

   Shell Traps
       Sharp shell programmers should take note of the following:

       o   The backtick operator does variable interpolation without regard to the presence of single quotes in the command.

       o   The backtick operator does no translation of the return value, unlike csh.

       o   Shells (especially csh) do several levels of substitution on each command line.  Perl does substitution in only certain constructs such
	   as double quotes, backticks, angle brackets, and search patterns.

       o   Shells interpret scripts a little bit at a time.  Perl compiles the entire program before executing it (except for "BEGIN" blocks,
	   which execute at compile time).

       o   The arguments are available via @ARGV, not $1, $2, etc.

       o   The environment is not automatically made available as separate scalar variables.

       o   The shell's "test" uses "=", "!=", "<" etc for string comparisons and "-eq", "-ne", "-lt" etc for numeric comparisons. This is the
	   reverse of Perl, which uses "eq", "ne", "lt" for string comparisons, and "==", "!=" "<" etc for numeric comparisons.

   Perl Traps
       Practicing Perl Programmers should take note of the following:

       o   Remember that many operations behave differently in a list context than they do in a scalar one.  See perldata for details.

       o   Avoid barewords if you can, especially all lowercase ones.  You can't tell by just looking at it whether a bareword is a function or a
	   string.  By using quotes on strings and parentheses on function calls, you won't ever get them confused.

       o   You cannot discern from mere inspection which builtins are unary operators (like chop() and chdir()) and which are list operators (like
	   print() and unlink()).  (Unless prototyped, user-defined subroutines can only be list operators, never unary ones.)	See perlop and
	   perlsub.

       o   People have a hard time remembering that some functions default to $_, or @ARGV, or whatever, but that others which you might expect to
	   do not.

       o   The <FH> construct is not the name of the filehandle, it is a readline operation on that handle.  The data read is assigned to $_ only
	   if the file read is the sole condition in a while loop:

	       while (<FH>)	 { }
	       while (defined($_ = <FH>)) { }..
	       <FH>;  # data discarded!

       o   Remember not to use "=" when you need "=~"; these two constructs are quite different:

	       $x =  /foo/;
	       $x =~ /foo/;

       o   The "do {}" construct isn't a real loop that you can use loop control on.

       o   Use "my()" for local variables whenever you can get away with it (but see perlform for where you can't).  Using "local()" actually
	   gives a local value to a global variable, which leaves you open to unforeseen side-effects of dynamic scoping.

       o   If you localize an exported variable in a module, its exported value will not change.  The local name becomes an alias to a new value
	   but the external name is still an alias for the original.

       As always, if any of these are ever officially declared as bugs, they'll be fixed and removed.

perl v5.18.2							    2014-01-06							       PERLTRAP(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

duplicate row based on single column

Discussion started by: mitr

2. Shell Programming and Scripting

AWK Duplicate lines multiple times based on a calculated value

Discussion started by: jamesfx

3. Shell Programming and Scripting

awk print non matching lines based on column

Discussion started by: sigh2010

4. Shell Programming and Scripting

Perl: filtering lines based on duplicate values in a column

Discussion started by: polsum