Joining fixed width files

01-31-2018

Registered User

70, 0

Join Date: Jan 2007

Last Activity: 12 April 2020, 6:06 AM EDT

Posts: 70

Thanks Given: 16

Thanked 0 Times in 0 Posts

Joining fixed width files

Hi All,

I need to join fixed width files on a column which is position 1 to 3 and need to have all the records from file1

file1.txt

Code:

Cu1nullL1L2
Cu2nullL1L2
Cu3nullL1L2

file2.txt

Code:

Cu1B1B2
Cu3B1B2

output.txt

Code:

Cu1L1B1L2B2
Cu2L1L2
Cu3L1B1L2B3

I tried but not getting the expected resuls.

Any inputs please.

Thanks
Shashi

shash

View Public Profile for shash

Find all posts by shash

01-31-2018

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

What did you try?

What operating system are you using?

What shell are you using?

Is output.txt supposed to be a fixed-width file too? If so, what fill character is supposed to be used to fill the "empty" space at the end of the Cu2 record?

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

01-31-2018

Registered User

70, 0

Join Date: Jan 2007

Last Activity: 12 April 2020, 6:06 AM EDT

Posts: 70

Thanks Given: 16

Thanked 0 Times in 0 Posts

Hi Don,

I tried to print but unable to join the files and getting error.

Code:

awk '{print substr($0,1,3),substr($0,8,2),substr($0,10,2)}' file1.txt
awk '{print substr($0,1,3),substr($0,4,2),substr($0,6,2)}' file2.txt

I 'm using bash.
output.txt should be fixed width file and if there is no matching record then it should have white spaces at the end.

Thanks
Shash

shash

View Public Profile for shash

Find all posts by shash

01-31-2018

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

How about - for exactly your sample files given in post#1 -

Code:

awk '
                {$0 = substr($0,1,3) " " substr($0,length-3,2) " " substr($0,length-1,2)
                }

NR == FNR       {T1[$1] = $2
                 T2[$1] = $3
                 next
                }

                {$0 = $1 $2 T1[$1] $3 T2[$1]
                 $0 = sprintf ("%s%*s", $0, 11-length, "")
                }

1

' file2 file1
Cu1L1B1L2B2
Cu2L1L2    
Cu3L1B1L2B2

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

01-31-2018

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Hi Shash,
Your desired output doesn't make any sense to me. When you find lines in both files for a given key, the data from the fields in both files are intermixed. When data is only found in one file, why isn't the output supposed to have the data from each field in that input file in the output columns related to those input fields. In other words, with your sample input data, why isn't the output:

Code:

Cu1L1B1L2B2
Cu2L1  L2  
Cu3L1B1L2B2

instead of the output you said you want:

Code:

Cu1L1B1L2B2
Cu2L1L2    
Cu3L1B1L2B3

especially since there is no B3 anywhere in either of your sample input files?

To get the output shown above, you could use something like:

Code:

awk '
FNR == NR {
	data1[key = substr($0, 1, 3), 1] = substr($0, 8, 2)
	data1[key, 2] = substr($0, 10, 2)
	keys[key]
	next
}
{	data2[key = substr($0, 1, 3), 1] = substr($0, 4, 2)
	data2[key, 2] = substr($0, 6, 2)
	keys[key]
}
END {	for(key in keys)
		printf("%s%2.2s%2.2s%2.2s%2.2s\n", key, data1[key, 1],
		    data2[key, 1], data1[key, 2], data2[key, 2])
}' file1.txt file2.txt > output.txt

but note that the order of the output lines may vary. If the output order matters, you need to clearly state how the output order should be determined when:

file1.txt contains keys that do not appear in file2.txt (as in your example),
file2.txt contains keys that do not appear in file1.txt, and
both files contain keys that do not appear in the other file.

Note that I asked what operating system you're using and you didn't answer.

If you're using a Solaris/SunOS system and want to try the above code, change awk to /usr/xpg4/bin/awk or nawk.

The missing 2nd components of the 2nd data1[] and data2[] assignments have been fixed as noted by RudiC in post #5.

Last edited by Don Cragun; 01-31-2018 at 06:40 PM.. Reason: Fix typos.

This User Gave Thanks to Don Cragun For This Post:

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

01-31-2018

Registered User

15,129, 5,008

Join Date: Jul 2012

Last Activity: 4 May 2020, 4:31 PM EDT

Location: Aachen, Germany

Posts: 15,129

Thanks Given: 735

Thanked 5,008 Times in 4,483 Posts

Hmmm - I'm afraid the second index is missing in the

Quote:

data1[key] = substr($0, 10, 2)

and

Quote:

data2[key] = substr($0, 6, 2)

assignments, or do I get this wrong?

This User Gave Thanks to RudiC For This Post:

RudiC

View Public Profile for RudiC

Find all posts by RudiC

01-31-2018

Registered User

12,315, 4,560

Join Date: Jul 2012

Last Activity: 22 November 2019, 4:29 PM EST

Location: San Jose, CA, USA

Posts: 12,315

Thanks Given: 952

Thanked 4,560 Times in 3,818 Posts

Quote:

Originally Posted by RudiC

Hmmm - I'm afraid the second index is missing in the and assignments, or do I get this wrong?

Yes, indeed. You got it right.

Post #5 has been fixed.

Thanks,
Don

Don Cragun

View Public Profile for Don Cragun

Find all posts by Don Cragun

Shell Programming and Scripting

Joining fixed width files

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Alter Fixed Width File

Discussion started by: vinus

2. UNIX for Dummies Questions & Answers

Length of a fixed width file

Discussion started by: Amrutha24

3. Shell Programming and Scripting

variable fixed-width fields

Discussion started by: gray380

4. Shell Programming and Scripting

Comparing two fixed width file

Discussion started by: anshul_er

5. Shell Programming and Scripting

Fixed-Width file from Oracle

Discussion started by: Amit.Sagpariya

6. Shell Programming and Scripting

Printing Fixed Width Columns

Discussion started by: cixelsyd

7. Shell Programming and Scripting

awk: creating a fixed-width single file from 2 different files

Discussion started by: tamahomekarasu

8. Shell Programming and Scripting

Removing \n within a fixed width record

Discussion started by: CKT_newbie88

9. UNIX Desktop Questions & Answers

Help with Fixed width File Parsing

Discussion started by: sate911

10. UNIX for Dummies Questions & Answers

Fixed Width file using AWK

Discussion started by: alok.benjwal