awk joining multiple lines based on field count Post: 302980525

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Joining files based on multiple keys

I need a script (perl or awk..anything is fine) to join 3 files based on three key columns. The no of non-key columns can vary in each file. The columns are delimited by semicolon. For example, File1 Dim1;Dim2;Dim3;Fact1;Fact2;Fact3;Fact4;Fact5 ---- data delimited by semicolon --- ...

2. Shell Programming and Scripting

Multiple pattern matching using awk and getting count of lines

Hi , I have a file which has multiple rows of data, i want to match the pattern for two columns and if both conditions satisfied i have to add the counter by 1 and finally print the count value. How to proceed... I tried in this way... awk -F, 'BEGIN {cnt = 0} {if $6 == "VLY278" &&...

3. Windows & DOS: Issues & Discussions

Gawk on Windows: Joining lines only if 1st field matches

Hi.. i have two files:: file_1:: mOnkey huMAnfile_2:: Human:hates:banana i:like:*** Monkey:loves:banana dogs:kill:catsdesired output:: Monkey:loves:banana Human:hates:bananaso only when the 1st field matches from both files print it from file_2 ((case-sensitive)) i also would like...

4. Shell Programming and Scripting

Combine multiple lines in file based on specific field

Hi, I have an issue to combine multiple lines of a file. I have records as below. Fields are delimited by TAB. Each lines are ending with a new line char (\n) Input -------- ABC 123456 abcde 987 890456 7890 xyz ght gtuv ABC 5tyin 1234 789 ghty kuio ABC ghty jind 1234 678 ght ...

5. Shell Programming and Scripting

Joining lines in TXT file based on first character

Hi, I have a pipe delimeted text file where lines have been split over 2 lines and I need to join them back together. For example the file I have is similar to the following: aaa|bbb |ccc ddd|eee fff|ggg |hhh I ideally need to have it looking like the following aaa|bbb|ccc ddd|eee...

6. Shell Programming and Scripting

Awk: print lines with one of multiple pattern in the same field (column)

Hi all, I am new to using awk and am quickly discovering what a powerful pattern-recognition tool it is. However, I have what seems like a fairly basic task that I just can't figure out how to perform in one line. I want awk to find and print all the lines in which one of multiple patterns (e.g....

7. Shell Programming and Scripting

awk Parse And Create Multiple Files Based on Field Value

Hello: I am working parsing a large input file which will be broken down into multiples based on the second field in the file, in this case: STORE. The idea is to create each file with the corresponding store number, for example: Report_$STORENUM_$DATETIMESTAMP , and obtaining the...

8. Shell Programming and Scripting

awk to remove lines where field count is greather than 1 in two fields

I am trying to remove all the lines and spaces where the count in $4 or $5 is greater than 1 (more than 1 letter). The file and the output are tab-delimited. Thank you :). file X 5811530 . G C NLGN4X 17 10544696 . GA G MYH3 9 96439004 . C ...

9. Shell Programming and Scripting

awk to print lines based on text in field and value in two additional fields

In the awk below I am trying to print the entire line, along with the header row, if $2 is SNV or MNV or INDEL. If that condition is met or is true, and $3 is less than or equal to 0.05, then in $7 the sub pattern :GMAF= is found and the value after the = sign is checked. If that value is less than...

10. Shell Programming and Scripting

awk to adjust text and count based on value in field

The below awk executes as is and produces the current output. It isvery close but what Ican not seem to do is add the -exon..., the ... portion comes from $1 and the _exon is static and will never change. If there is + sign in $4 then the ... is in acending order or sequential. If there is a - in...

LEARN ABOUT HPUX

join

join(1) 						      General Commands Manual							   join(1)

NAME

       join - relational database operator

SYNOPSIS

       [options] file1 file2

DESCRIPTION

       forms,  on  the	standard output, a join of the two relations specified by the lines of file1 and file2.  If file1 or file2 is the standard
       input is used.

       file1 and file2 must be sorted in increasing collating sequence (see Environment Variables below) on the fields on which  they  are  to	be
       joined; normally the first in each line.

       The  output contains one line for each pair of lines in file1 and file2 that have identical join fields.  The output line normally consists
       of the common field followed by the rest of the line from file1, then the rest of the line from file2.

       The default input field separators are space, tab, or new-line.	In this case, multiple separators count as one field separator, and  lead-
       ing separators are ignored.  The default output field separator is a space.

       Some of the below options use the argument n.  This argument should be a or a referring to either file1 or file2, respectively.

   Options
       In addition to the normal output,
		   produce a line for each unpairable line in file n, where n is or

       Replace empty output fields by string
		   s.

       Join on field
		   m  of  both	files.	 The argument m must be delimited by space characters.	This option and the following two are provided for
		   backward compatibility.  Use of the and options ( see below ) is recommended for portability.

       Join on field
		   m of file1.

       Join on field
		   m of file2.

       Each output line comprises the fields specified in
		   list, each element of which has the form where n is a file number and m is a field number.  The common  field  is  not  printed
		   unless specifically requested.

       Use character
		   c  as a separator (tab character).  Every appearance of c in a line is significant.	The character c is used as the field sepa-
		   rator for both input and output.

       Instead of the default output,
		   produce a line only for each unpairable line in file_number, where file_number is or

       Join on field
		   f of file 1.  Fields are numbered starting with 1.

       Join on field
		   f of file 2.  Fields are numbered starting with 1.

EXTERNAL INFLUENCES

   Environment Variables
       determines the collating sequence expects from input files.

       determines the alternative blank character as an input field separator, and the interpretation of data within files as single and/or multi-
       byte characters.  also determines whether the separator defined through the option is a single- or multi-byte character.

       If  or  is  not specified in the environment or is set to the empty string, the value of is used as a default for each unspecified or empty
       variable.  If is not specified or is set to the empty string, a default of ``C'' (see lang(5)) is used instead of If any  internationaliza-
       tion variable contains an invalid setting, behaves as if all internationalization variables are set to ``C'' (see environ(5)).

   International Code Set Support
       Single- and multi-byte character code sets are supported with the exception that multi-byte-character file names are not supported.

EXAMPLES

       The following command line joins the password file and the group file, matching on the numeric group ID, and outputting the login name, the
       group name, and the login directory.  It is assumed that the files have been sorted in the collating sequence defined by the or environment
       variable on the group ID fields.

       The  following  command produces an output consisting all possible combinations of lines that have identical first fields in the two sorted
       files sf1 and sf2, with each line consisting of the first and third fields from and the second and fourth fields from

WARNINGS

       With default field separation, the collating sequence is that of with the sequence is that of a plain sort.

       The conventions of and are incongruous.

       Numeric filenames may cause conflict when the option is used immediately before listing filenames.

AUTHOR

       was developed by OSF and HP.

SEE ALSO

       awk(1), comm(1), sort(1), uniq(1).

STANDARDS CONFORMANCE

																	   join(1)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Joining files based on multiple keys

Discussion started by: Sebben

2. Shell Programming and Scripting

Multiple pattern matching using awk and getting count of lines

Discussion started by: aemunathan

3. Windows & DOS: Issues & Discussions

Gawk on Windows: Joining lines only if 1st field matches

Discussion started by: M@LIK

4. Shell Programming and Scripting

Combine multiple lines in file based on specific field

Discussion started by: ratheesh2011

5. Shell Programming and Scripting

Joining lines in TXT file based on first character

Discussion started by: fuji_s

6. Shell Programming and Scripting

Awk: print lines with one of multiple pattern in the same field (column)

Discussion started by: elgo4

7. Shell Programming and Scripting

awk Parse And Create Multiple Files Based on Field Value

Discussion started by: ec012

8. Shell Programming and Scripting

awk to remove lines where field count is greather than 1 in two fields

Discussion started by: cmccabe

9. Shell Programming and Scripting

awk to print lines based on text in field and value in two additional fields

Discussion started by: cmccabe

10. Shell Programming and Scripting

awk to adjust text and count based on value in field

Discussion started by: cmccabe

LEARN ABOUT HPUX

join