how to add duplicate lines

08-20-2010

Registered User

25, 0

Join Date: Aug 2009

Last Activity: 9 December 2016, 5:54 AM EST

Posts: 25

Thanks Given: 15

Thanked 0 Times in 0 Posts

and what i meant by horrify was about my horrible newbie script!!!

mikey11415

View Public Profile for mikey11415

Find all posts by mikey11415

08-20-2010

Registered User

1,466, 512

Join Date: Jul 2010

Last Activity: 7 April 2014, 3:02 PM EDT

Location: earth>US>UTC-5

Posts: 1,466

Thanks Given: 110

Thanked 512 Times in 491 Posts

Quote:

Originally Posted by mikey11415

and what i meant by horrify was about my horrible newbie script!!!

What is important is that you are trying, and learning. We all had to start somewhere and every once in a while I come across some of my old code and wonder why I ever did it that way!!

This User Gave Thanks to agama For This Post:

agama

View Public Profile for agama

Find all posts by agama

08-20-2010

Registered User

25, 0

Join Date: Aug 2009

Last Activity: 9 December 2016, 5:54 AM EST

Posts: 25

Thanks Given: 15

Thanked 0 Times in 0 Posts

Thanks!
I just can't believe how much you are helping me solve a major issue!!!
I have another couple of questions.
I needed to rearrange some of the columns in the datafile.
I was able to do this using awk (i am proud of myself!)
But, there are a coiple of places where I am stumped
so for example,
if I have two files

fileA
12 test

and fileB
44 junk

and I want a line in my output that is

12 44

how do I go about that with awk?

I know that i want to do something like this

awk '{print $1}' fileA > outputA
awk '{print $1}' fileB > outputB

but then how do i get the outputA and outputB onto the same line?

you have been amazingly helpful thus far...i hope that you don't mind another couple of questions!

mikey

Last edited by mikey11415; 08-20-2010 at 11:33 PM.. Reason: i figured out the second part

mikey11415

View Public Profile for mikey11415

Find all posts by mikey11415

08-21-2010

Registered User

48, 10

Join Date: Aug 2010

Last Activity: 30 August 2014, 6:29 AM EDT

Posts: 48

Thanks Given: 0

Thanked 10 Times in 9 Posts

Hello. I hope you don't mind another suggestion. Try to use getline instead:

Code:

awk '
    BEGIN {
        while (getline < ARGV[1]) {
            a = $1

            if (!(getline < ARGV[2])) {
                break
            }

            b = $1

            print a " " b
        }

        exit(0)
    }
' fileA fileB

These 2 Users Gave Thanks to konsolebox For This Post:

konsolebox

View Public Profile for konsolebox

Find all posts by konsolebox

08-21-2010

Registered User

25, 0

Join Date: Aug 2009

Last Activity: 9 December 2016, 5:54 AM EST

Posts: 25

Thanks Given: 15

Thanked 0 Times in 0 Posts

well i did this

cat ZZlinecount ZZcharcount > ZZline_char #make one file from two
awk '{printf $1 " "}' ZZline_char > ZZline_char1 #get the first column

and that WORKED

but when I cat this output with anther output, it puts them on the same line (I think because ZZline_char1 does not have an end of line)
I would like ZZline_char1 to have an end of line so that the think that i cat after this is on the next line

i am so close!

thanks for the help!!!

mikey

mikey11415

View Public Profile for mikey11415

Find all posts by mikey11415

08-21-2010

Registered User

1,466, 512

Join Date: Jul 2010

Last Activity: 7 April 2014, 3:02 PM EDT

Location: earth>US>UTC-5

Posts: 1,466

Thanks Given: 110

Thanked 512 Times in 491 Posts

Quote:

Originally Posted by mikey11415

but then how do i get the outputA and outputB onto the same line?

There are a couple of ways that you could go about this. What makes the difference is what is in the two files. A single line in each, or wanting every line processed, is pretty straight forward. If there are some lines that aren't needed it gets a bit more complicated.

Here's an example that assumes you want the first field of all lines mashed together and written to stdout. You should be able to add some extra pattern matching if you don't need all of the lines.

Code:

( sed 's/^/file1 /' file1; cat file2 ) | awk '
        /^file1/ {
                save[i++] = $2;         # we added a field, so it is $2
                next;
        }

        {
                printf( "%s %s\n", save[j++], $1 );
                if( j >= i )
                        exit( 0 );              # bail if file2 has more lines
        }
'

I think you can figure it out, but I will point out that the parenthesis round the sed and cat commands are very important. Kshell executes the commands placed the parens in a subprocess and all output from that process is piped into the awk.

If the files are largish, then there are better ways of doing this -- stuffing everything from the first file into an array isn't the best form, but is easier to understand and for a few hundred lines it is better to keep it simple.

There are other posts round this forum that do this kind of thing by putting both filenames on the awk command line, and then test FILENAME within the awk programme to determine what to do. Nothing wrong with that, but I prefer this method as it lets you dynamically supply the filenames without having to hard code them in the awk programme (they could be passed into the script and instead of file1/file2 referenced as $1/$2 or somesuch. A small amount of extra overhead in the sed processes, but a big win in terms of flexibility.

Happy to help!

---------- Post updated at 23:26 ---------- Previous update was at 23:17 ----------

Quote:

Originally Posted by mikey11415

well i did this

cat ZZlinecount ZZcharcount > ZZline_char #make one file from two
awk '{printf $1 " "}' ZZline_char > ZZline_char1 #get the first column

and that WORKED

but when I cat this output with anther output, it puts them on the same line (I think because ZZline_char1 does not have an end of line)
I would like ZZline_char1 to have an end of line so that the think that i cat after this is on the next line

i am so close!

thanks for the help!!!

mikey

Puts them on the same line because printf() is different than print. The print command automatically prints a newline while printf() does not.

A small change will solve this:

Code:

awk '{printf( "%s\n", $1 ) }' ZZline_char > ZZline_char1

And a bit of wisdom passed down by one of the original authors of awk was to always use parens with printf().

@konsolebox -- using getline() was what I referred to as the better way, but not as straight forward. You beat me to the punch

This User Gave Thanks to agama For This Post:

agama

View Public Profile for agama

Find all posts by agama

08-21-2010

Registered User

25, 0

Join Date: Aug 2009

Last Activity: 9 December 2016, 5:54 AM EST

Posts: 25

Thanks Given: 15

Thanked 0 Times in 0 Posts

hm
when I use
awk '{printf( "%s\n", $1 ) }' ZZline_char > ZZline_char1

it prints each number onto a separate line. the thing is that i want the first column from two rows from ZZline_char on the same line.
Then, on the next line, I want to cat another file

So ZZline_char looks like this
11 test
22 junk

i want a file that looks like this
11 22

then i have a second file that only has this
100

i want the final output to be
11 22
100

what i was getting was
11 22 100

now i am getting
11
22
100

you are all AMAZINGLY helpful!!!

mikey

---------- Post updated at 11:58 PM ---------- Previous update was at 11:36 PM ----------

OMG I did it!

awk '{printf $1 " "}' ZZline_char > ZZline_char1
echo " " > ZZnewline

then

cat ZZline_char1 ZZnewline ZZthird > dataset

that ZZnewline just put a line in there for me!!!

THANK YOU ALL SO MUCH!!!!

Now I can sleep!!!

All the best

Mikey

---------- Post updated 08-21-10 at 12:16 AM ---------- Previous update was 08-20-10 at 11:58 PM ----------

thanks so much everyone, that solved everything!!!

best

mikey

mikey11415

View Public Profile for mikey11415

Find all posts by mikey11415

UNIX for Dummies Questions & Answers

how to add duplicate lines

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to remove duplicate lines?

Discussion started by: nalu

2. Shell Programming and Scripting

Duplicate lines

Discussion started by: sxiong

3. UNIX for Dummies Questions & Answers

Duplicate lines in a file

Discussion started by: nsuresh316

4. Shell Programming and Scripting

Script to duplicate lines

Discussion started by: clinisbud

5. UNIX for Advanced & Expert Users

In a huge file, Delete duplicate lines leaving unique lines

Discussion started by: krishnix

6. Shell Programming and Scripting

Print duplicate lines

Discussion started by: locoroco

7. Shell Programming and Scripting

Duplicate lines in a file

Discussion started by: faiz1985

8. UNIX for Dummies Questions & Answers

Duplicate columns and lines

Discussion started by: dr_sabz

9. Shell Programming and Scripting

Duplicate Lines x 4

Discussion started by: serm

10. UNIX for Advanced & Expert Users

Duplicate lines in the file

Discussion started by: guptan