08-07-2011
Combining two single column files side-by-side
Hi,
I am looking for a sed/awk script to join two large (~300 M) single column files (one is sorted and the other is not sorted) side-by-side. I have a shell script but its taking ages to do the task so looking for an optimized fast solution.
The two files look like:
File1 (sorted)
a1
a2
a3
a4
File2 (unsorted)
b1
b2
b3
b4
Desired output:
a1 b1
a2 b2
a3 b3
a4 b4
PS: the two input file are massive.
Cheers,
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi all,
please help me out in below requirement.
I have two .xls files say abc.xls and bbc.xls , i want to merge these two files into a single file ..say xyz.xls side by side
Ex:
abc.xls
Eno Ename
101 hello
102 hero
103 india
bbc.xls
Eno Ename
201 delhi
202 Hyd
203... (0 Replies)
Discussion started by: jagadish99
0 Replies
2. Shell Programming and Scripting
Hi all,
please help me out in below requirement.
I have two .xls files say abc.xls and bbc.xls , i want to merge these two files into a single file ..say xyz.xls side by side
Ex:
abc.xls bbc.xls
Eno Ename Eno Ename
101 hello ... (2 Replies)
Discussion started by: jagadish99
2 Replies
3. Shell Programming and Scripting
Hi Everyone,
I need a shell/perl script to bring selected columns from all the files located in a directory and place them in a new file side by side.
File1:
a b c d
2 3 4 5
f g h i
..........
File2:
I II III IV
w x y z
..............
and so on many files are there...... (8 Replies)
Discussion started by: ks_reddy
8 Replies
4. Shell Programming and Scripting
I have 2 text files, both have one simple, single column. The 2 files might be the same length, or might not, and if not, it's unknown which one would be longer.
For this example, file1 is longer:
---file1
Joe
Bob
Mary
Sally
Fred
Elmer
David
---file2
Tomato
House
Car... (3 Replies)
Discussion started by: cajunfries
3 Replies
5. Shell Programming and Scripting
Input file_1:
P78811
P40108
O17861
Q6NTW1
P40986
Q6PBK1
P38264
Q6PBK1
Q9CZ49
Q1GZI0
Input file_2: (6 Replies)
Discussion started by: patrick87
6 Replies
6. Shell Programming and Scripting
Hi I'm trying to compare 3 or more files based on similar values and outputting them into 3 columns.
For example:
file1
ABC
DEF
GHI
file2
DEF
DER
file3
ABC
DER
The output should come out like this
file1 file2 file3
ABC ABC (4 Replies)
Discussion started by: zerofire123
4 Replies
7. Shell Programming and Scripting
I have about 100s of files of type text in a known directory. I want to merge all files side by side. Number of lines in all the files will remain same.
For example file1 contains
cat
dog
File 2 contains
rat
mat
Output file should be
cat rat
dog mat
Using awk I was able to... (5 Replies)
Discussion started by: kanthrajgowda
5 Replies
8. UNIX for Dummies Questions & Answers
Hi,
Thank you for your help in advance :D.
I have thousands of files like...
file01
file02
file03
.
.
.
file6324
I would like to paste every other file side by side so it looks like
file01 file02
file03 file04
...e.t.c
file6323 file6324 (3 Replies)
Discussion started by: shaz1990
3 Replies
9. Shell Programming and Scripting
Hi everyone,
I need to merge two files side by side
The files look something like this:
HOSTNAME
fishtornado-K52F 127.0.1.1
UPTIME
20:17:01 up 2:19, 3 users,
load average: 0.00, 0.04, 0.05
DISK USAGE
(Size/Used/Avail/Use%)
29G 6.5G 21G 25%
RUN QUEUE
PID COMMAND USER ... (4 Replies)
Discussion started by: FishTornado
4 Replies
10. Shell Programming and Scripting
I have 8 .csv files with 16 columns and "n" rows with no Header. I want to parse each of these .csv and get column and put the data into a new.csv. Once this is done, the new.csv should have 16 columns (2 from each input.csv) and "n" rows.
Now, I want to just take the average of Column from... (3 Replies)
Discussion started by: Zam_1234
3 Replies
sdiff(1) User Commands sdiff(1)
NAME
sdiff - print differences between two files side-by-side
SYNOPSIS
sdiff [-l] [-s] [-o output] [-w n] filename1 filename2
DESCRIPTION
sdiff uses the output of the diff command to produce a side-by-side listing of two files indicating lines that are different. Lines of the
two files are printed with a blank gutter between them if the lines are identical, a < in the gutter if the line appears only in filename1,
a > in the gutter if the line appears only in filename2, and a | for lines that are different. (See the EXAMPLES section below.)
OPTIONS
-l Print only the left side of any lines that are identical.to
-s Do not print identical lines.
-o output Use the argument output as the name of a third file that is created as a user-controlled merge of filename1 and filename2.
Identical lines of filename1 and filename2 are copied to output. Sets of differences, as produced by diff, are printed;
where a set of differences share a common gutter character. After printing each set of differences, sdiff prompts the user
with a % and waits for one of the following user-typed commands:
l Append the left column to the output file.
r Append the right column to the output file.
s Turn on silent mode; do not print identical lines.
v Turn off silent mode.
e l Call the editor with the left column.
e r Call the editor with the right column.
e b Call the editor with the concatenation of left and right.
e Call the editor with a zero length file.
q Exit from the program.
On exit from the editor, the resulting file is concatenated to the end of the output file.
-w n Use the argument n as the width of the output line. The default line length is 130 characters.
USAGE
See largefile(5) for the description of the behavior of sdiff when encountering files greater than or equal to 2 Gbyte ( 2**31 bytes).
EXAMPLES
Example 1: An example of the sdiff command.
A sample output of sdiff follows.
x | y
a a
b <
c <
d d
> c
ENVIRONMENT VARIABLES
If any of the LC_* variables ( LC_CTYPE, LC_MESSAGES, LC_TIME, LC_COLLATE, LC_NUMERIC,
and LC_MONETARY ) (see environ(5)) are not set in the environment, the operational behavior of sdiff for each corresponding locale cate-
gory is determined by the value of the LANG environment variable. If LC_ALL is set, its contents are used to override both the LANG and
the other LC_* variables. If none of the above variables is set in the environment, the "C" locale determines how sdiff behaves.
LC_CTYPE Determines how sdiff handles characters. When LC_CTYPE is set to a valid value, sdiff can display and handle text and file-
names containing valid characters for that locale.
ATTRIBUTES
See attributes(5) for descriptions of the following attributes:
+-----------------------------+-----------------------------+
| ATTRIBUTE TYPE | ATTRIBUTE VALUE |
|Availability |SUNWesu |
|CSI |Enabled |
+-----------------------------+-----------------------------+
SEE ALSO
diff(1), ed(1), attributes(5), environ(5), largefile(5)
SunOS 5.10 20 Dec 1996 sdiff(1)