09-13-2011
Join txt files with diff cols and rows
I am a new user of Unix/Linux, so this question might be a bit simple!
I am trying to join two (very large) files that both have different # of cols and rows in each file.
I want to keep 'all' rows and 'all' cols from both files in the joint file, and the primary key variables are in the rows.
I need all rows that exist in both files to be matched up and joined. However, those rows not in one file or the other should also be kept and their data maintained in the joint file. Basically, all possible max data to be included in joint file.
Hope this makes sense!
small example of files:
file 1 =
A 1 2 3 4
B 1 2
C 1 2 3 4 5
file 2 =
A 1 2 3 4 5
B 1 2 3
C 1 2 3
D 1 2 3 4 5 6
E 1
Joint file should have =
A 1 2 3 4 5
B 1 2 3
C 1 2 3 4 5
D 1 2 3 4 5 6
E 1
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hello,
I want to compare two files. All records in file 2 that are not in file 1 should be output to file 3.
For example:
file 1
123
1234
123456
file 2
123
2345
23456
file 3 should have
2345
23456
I have looked at diff, bdiff, cmp, comm, diff3 without any luck! (2 Replies)
Discussion started by: blt123
2 Replies
2. Shell Programming and Scripting
Hi
Fields in Files 1,2,3,4 are pipe"|" separated.
Say I want to grep
col1 from File1
col3 from File2
col4 from File3
and print to File4 in the following order:
col3|col1|col4
what is the best way of doing this?
Thanks (2 Replies)
Discussion started by: vbshuru
2 Replies
3. UNIX for Dummies Questions & Answers
Hello,
My apologies if this has been posted elsewhere, I have had a look at several threads but I am still confused how to use these functions. I have two files, each with 5 columns:
File A: (tab-delimited)
PDB CHAIN Start End Fragment
1avq A 171 176 awyfan
1avq A 172 177 wyfany
1c7k A 2 7... (3 Replies)
Discussion started by: InfoSeeker
3 Replies
4. UNIX for Dummies Questions & Answers
I am a new user of Unix/Linux, so this question might be a bit simple!
I am trying to join two (very large) files that both have different # of cols and rows in each file.
I want to keep 'all' rows and 'all' cols from both files in the joint file, and the primary key variables are in the rows.... (1 Reply)
Discussion started by: BNasir
1 Replies
5. Shell Programming and Scripting
Hi,
Please help me to convert the input file to a new one.
input file:
--------
1231231231 3 A
4561223343 0 D
1231231231 1 A
1231231231 2 A
1231231231 4 D
7654343444 2 A
4561223343 1 D
4561223343 2 D
the output should be:
--------------------
1231231231 3#1#2 A
4561223343 0 D... (3 Replies)
Discussion started by: vsachan
3 Replies
6. UNIX for Dummies Questions & Answers
Hi,
I have 20 tab delimited text files that have a common column (column 1). The files are named GSM1.txt through GSM20.txt. Each file has 3 columns (2 other columns in addition to the first common column).
I want to write a script to join the files by the first common column so that in the... (5 Replies)
Discussion started by: evelibertine
5 Replies
7. Shell Programming and Scripting
Hi,
I have file with values as below
1~ab~456~ac:bd:de:ef~yyyy-mm-dd
2~cd~458~af:fg:ty:er:ty:uj:io:~yyyy-mm-dd
I want the o/p as for frist row
1~ab~456~ac~yyyy-mm-dd
1~ab~456~bd~yyyy-mm-dd
1~ab~456~de~yyyy-mm-dd
1~ab~456~ef~yyyy-mm-dd
and for the second row
2~cd~458~af~yyyy-mm-dd... (4 Replies)
Discussion started by: rithushri
4 Replies
8. UNIX for Dummies Questions & Answers
Hi all,
I'm trying to join two .txt file tab delimitated based on a common column.
File 1
transcript_id gene_id length effective_length expected_count TPM FPKM IsoPct
comp1000201_c0_seq1 comp1000201_c0 337 183.51 0.00 0.00 0.00 0.00
comp1000297_c0_seq1 ... (1 Reply)
Discussion started by: alisrpp
1 Replies
9. Shell Programming and Scripting
Guys i have 3 files,
but i want to compare and diff only the 2nd column
path=`/home/whois/doms`
for i in `cat domain.tx`
do
whois $i| sed -n '/Registry Registrant ID:/,/Registrant Email:/p' > $path/$i.registrant
whois $i| sed -n '/Registry Admin ID:/,/Admin Email:/p' > $path/$i.admin... (10 Replies)
Discussion started by: kenshinhimura
10 Replies
10. Shell Programming and Scripting
Hi all,
Is there any idea on how to automate convert the paragraph in one line in a file, this will happen after OCR the documents, OCR split every paragraph. I need to join all the paragraph in one line.
#cat file.txtThe Commission on Higher Education (CHED) was created through Republic Act... (7 Replies)
Discussion started by: lxdorney
7 Replies
LEARN ABOUT NETBSD
field_info
FORMS(3) BSD Library Functions Manual FORMS(3)
NAME
dynamic_field_info, field_info -- form library
LIBRARY
Curses Form Library (libform, -lform)
SYNOPSIS
#include <form.h>
int
dynamic_field_info(FIELD *field, int *drows, int *dcols, int *max);
int
field_info(FIELD *field, int *rows, int *cols, int *frow, int *fcol, int *nrow, int *nbuf);
DESCRIPTION
The function dynamic_field_info() returns the sizing information for the field given. The function will return the number of rows, columns
and the maximum growth of the field in the storage pointed to by the drows, dcols and max parameters respectively. Dynamic field information
cannot be requested for the default field. If the field given is not dynamic then dynamic_field_info() will simply return the size of the
actual field. The field_info() will return the number or rows, columns, field starting row, field starting column, number of off screen rows
and number of buffers in rows, cols, frow, fcol, nrow and nbuf respectively.
RETURN VALUES
The functions will return one of the following error values:
E_OK The function was successful.
E_BAD_ARGUMENT A bad argument was passed to the function.
SEE ALSO
curses(3), forms(3)
NOTES
The header <form.h> automatically includes both <curses.h> and <eti.h>.
BSD
January 1, 2001 BSD