08-07-2011
Combining two single column files side-by-side
Hi,
I am looking for a sed/awk script to join two large (~300 M) single column files (one is sorted and the other is not sorted) side-by-side. I have a shell script but its taking ages to do the task so looking for an optimized fast solution.
The two files look like:
File1 (sorted)
a1
a2
a3
a4
File2 (unsorted)
b1
b2
b3
b4
Desired output:
a1 b1
a2 b2
a3 b3
a4 b4
PS: the two input file are massive.
Cheers,
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi all,
please help me out in below requirement.
I have two .xls files say abc.xls and bbc.xls , i want to merge these two files into a single file ..say xyz.xls side by side
Ex:
abc.xls
Eno Ename
101 hello
102 hero
103 india
bbc.xls
Eno Ename
201 delhi
202 Hyd
203... (0 Replies)
Discussion started by: jagadish99
0 Replies
2. Shell Programming and Scripting
Hi all,
please help me out in below requirement.
I have two .xls files say abc.xls and bbc.xls , i want to merge these two files into a single file ..say xyz.xls side by side
Ex:
abc.xls bbc.xls
Eno Ename Eno Ename
101 hello ... (2 Replies)
Discussion started by: jagadish99
2 Replies
3. Shell Programming and Scripting
Hi Everyone,
I need a shell/perl script to bring selected columns from all the files located in a directory and place them in a new file side by side.
File1:
a b c d
2 3 4 5
f g h i
..........
File2:
I II III IV
w x y z
..............
and so on many files are there...... (8 Replies)
Discussion started by: ks_reddy
8 Replies
4. Shell Programming and Scripting
I have 2 text files, both have one simple, single column. The 2 files might be the same length, or might not, and if not, it's unknown which one would be longer.
For this example, file1 is longer:
---file1
Joe
Bob
Mary
Sally
Fred
Elmer
David
---file2
Tomato
House
Car... (3 Replies)
Discussion started by: cajunfries
3 Replies
5. Shell Programming and Scripting
Input file_1:
P78811
P40108
O17861
Q6NTW1
P40986
Q6PBK1
P38264
Q6PBK1
Q9CZ49
Q1GZI0
Input file_2: (6 Replies)
Discussion started by: patrick87
6 Replies
6. Shell Programming and Scripting
Hi I'm trying to compare 3 or more files based on similar values and outputting them into 3 columns.
For example:
file1
ABC
DEF
GHI
file2
DEF
DER
file3
ABC
DER
The output should come out like this
file1 file2 file3
ABC ABC (4 Replies)
Discussion started by: zerofire123
4 Replies
7. Shell Programming and Scripting
I have about 100s of files of type text in a known directory. I want to merge all files side by side. Number of lines in all the files will remain same.
For example file1 contains
cat
dog
File 2 contains
rat
mat
Output file should be
cat rat
dog mat
Using awk I was able to... (5 Replies)
Discussion started by: kanthrajgowda
5 Replies
8. UNIX for Dummies Questions & Answers
Hi,
Thank you for your help in advance :D.
I have thousands of files like...
file01
file02
file03
.
.
.
file6324
I would like to paste every other file side by side so it looks like
file01 file02
file03 file04
...e.t.c
file6323 file6324 (3 Replies)
Discussion started by: shaz1990
3 Replies
9. Shell Programming and Scripting
Hi everyone,
I need to merge two files side by side
The files look something like this:
HOSTNAME
fishtornado-K52F 127.0.1.1
UPTIME
20:17:01 up 2:19, 3 users,
load average: 0.00, 0.04, 0.05
DISK USAGE
(Size/Used/Avail/Use%)
29G 6.5G 21G 25%
RUN QUEUE
PID COMMAND USER ... (4 Replies)
Discussion started by: FishTornado
4 Replies
10. Shell Programming and Scripting
I have 8 .csv files with 16 columns and "n" rows with no Header. I want to parse each of these .csv and get column and put the data into a new.csv. Once this is done, the new.csv should have 16 columns (2 from each input.csv) and "n" rows.
Now, I want to just take the average of Column from... (3 Replies)
Discussion started by: Zam_1234
3 Replies
JOIN(1) General Commands Manual JOIN(1)
NAME
join - relational database operator
SYNOPSIS
join [ options ] file1 file2
DESCRIPTION
Join forms, on the standard output, a join of the two relations specified by the lines of file1 and file2. If one of the file names is the
standard input is used.
File1 and file2 must be sorted in increasing ASCII collating sequence on the fields on which they are to be joined, normally the first in
each line.
There is one line in the output for each pair of lines in file1 and file2 that have identical join fields. The output line normally con-
sists of the common field, then the rest of the line from file1, then the rest of the line from file2.
Input fields are normally separated spaces or tabs; output fields by space. In this case, multiple separators count as one, and leading
separators are discarded.
The following options are recognized, with POSIX syntax.
-a n In addition to the normal output, produce a line for each unpairable line in file n, where n is 1 or 2.
-v n Like -a, omitting output for paired lines.
-e s Replace empty output fields by string s.
-1 m
-2 m Join on the mth field of file1 or file2.
-jn m Archaic equivalent for -n m.
-ofields
Each output line comprises the designated fields. The comma-separated field designators are either 0, meaning the join field, or
have the form n.m, where n is a file number and m is a field number. Archaic usage allows separate arguments for field designators.
-tc Use character c as the only separator (tab character) on input and output. Every appearance of c in a line is significant.
EXAMPLES
sort /adm/users | join -t: -a 1 -e "" - bdays
Add birthdays to password information, leaving unknown birthdays empty. The layout of is given in users(6); bdays contains sorted
lines like
tr : ' ' </adm/users | sort -k 3 3 >temp
join -1 3 -2 3 -o 1.1,2.1 temp temp | awk '$1 < $2'
Print all pairs of users with identical userids.
SOURCE
/sys/src/cmd/join.c
SEE ALSO
sort(1), comm(1), awk(1)
BUGS
With default field separation, the collating sequence is that of sort -b -ky,y; with -t, the sequence is that of sort -tx -ky,y.
One of the files must be randomly accessible.
JOIN(1)