Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Filling a tab-separated file with known missing entries in columns Post 302621361 by TheTransporter on Tuesday 10th of April 2012 09:42:31 AM
Old 04-10-2012
Filling a tab-separated file with known missing entries in columns

Hello all,

I have a file which is tab separated like that:


Code:
PHE_205_A    TIP_127_W    ARG_150_B
MET_1150_A    TIP_12_W    VAL_11_B
GLU_60_A    TIP_130_W    ARG_143_B
LEU_1033_A    TIP_203_W    ARG_14_B
SER_1092_A    TIP_203_W    
THR_1090_A    TIP_203_W    
SER_1092_A    TIP_25_W    SER_104_B
TYR_15_B    TIP_25_W    
ASP_61_A    TIP_34_W    THR_134_B
SER_1204_A    TIP_46_W    ASP_8_B
ASP_61_A    TIP_63_W    ARG_131_B
THR_90_A    TIP_76_W    TYR_49_B
THR_1090_A    TIP_91_W    SER_100_B


and I want it to be like that:
Code:
PHE_205_A    TIP_127_W    ARG_150_B
MET_1150_A    TIP_12_W    VAL_11_B
GLU_60_A    TIP_130_W    ARG_143_B
LEU_1033_A    TIP_203_W    ARG_14_B
SER_1092_A    TIP_203_W    ARG_14_B
THR_1090_A    TIP_203_W    ARG_14_B
SER_1092_A    TIP_25_W    SER_104_B
SER_1092_A    TIP_25_W    TYR_15_B   
ASP_61_A    TIP_34_W    THR_134_B
SER_1204_A    TIP_46_W    ASP_8_B
ASP_61_A    TIP_63_W    ARG_131_B
THR_90_A    TIP_76_W    TYR_49_B
THR_1090_A    TIP_91_W    SER_100_B

The missing entry is always the one found in the line before it (if it is _B, then it is always _B, similar is for _A)

I would really appreciate if somebody would give me an idea on how to re-format it....

Thanks!

Moderator's Comments:
Mod Comment Please use code tags instead of quote tags (one button to the right)

Last edited by Scrutinizer; 04-10-2012 at 11:00 AM..
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

parse file into tab separated columns

Hello, I am trying to parse a file that resembles the last three groupings into something looking like the first two lines. I've fiddled with sed and awk a bit, but can't get anything to work properly. I need them separated by some delimiter. The file is some 23,000 lines of the stuff.... ... (9 Replies)
Discussion started by: dkozel
9 Replies

2. Shell Programming and Scripting

Filling in missing columns

Hi all, I have a file that contains about 1000 rows and 800 columns. Nearly every row has 800 columns but some DONT. I want to extend the rows that dont have values with NA's. Here is an example: my file bob 2 4 5 6 8 9 4 5 tar 2 4 5 4 3 2 9 1 bro 3 5 3 4 yar 2 ... (7 Replies)
Discussion started by: gisele_l
7 Replies

3. UNIX for Dummies Questions & Answers

Sum up a decimal column in a tab separated text file and error handling

Hi, I have a small requirement where i need to sum up a column in a text file. Input file 66ab 000000 534385 -00000106350.00 66cd 000000 534485 -00013364511.00 66ad 000000 534485 -00000426548.00 672a 000000 534485 000000650339.82... (5 Replies)
Discussion started by: pssandeep
5 Replies

4. Shell Programming and Scripting

Compare two columns separated by a tab

witam potrzebuje polecenia porownujacego koumny na podstawie n-ostatnich znakow danej linnijki tj mam 2 koumny AiB zawierajace ciag dowolnych znakow (dlugosci w kazdej linijce mga byc rozne wiec uzycie substra odpada) A B ewewewabc nbgujnnabc... (3 Replies)
Discussion started by: Toudi
3 Replies

5. Shell Programming and Scripting

Convert a tab separated file using bash

Dear all, I have a file in this format (like a matrix) - A B C .. X A 1 4 2 .. 2 B 2 6 4 .. 8 C 3 5 5 .. 4 . . . ... . X . . ... . and want to convert it into a file with this format: A A = 1 A B = 4 A C = 2 ... A X = 2 B A = 2 B B = 6 etc (2 Replies)
Discussion started by: TheTransporter
2 Replies

6. UNIX for Dummies Questions & Answers

tab-separated file to matrix conversion

hello all, i have an input file like that A A X0 A B X1 A C X2 ... A Z Xx B A X1 B B X3 .... Z A Xx Z B X4 and i want to have an output like that A B C D A X0 X1 X2 Xy B X1 X3 X4 (4 Replies)
Discussion started by: TheTransporter
4 Replies

7. UNIX for Dummies Questions & Answers

Filling the empty columns in a fixed column file

Hi, I have a file with fixed number of columns (total 58 columns) delimeted by pipe (|). Due to a bug in the application the export file does not come with fixed number of columns. The missing data columns are being replaced by blank in the output file. In one line I can have 25 columns (33... (1 Reply)
Discussion started by: yale_work
1 Replies

8. Shell Programming and Scripting

Problem with a tab separated file

Hi, I have created a tab separated file from the following input file. ADDRESS1 CITY STATE POSTAL COUNTRY LON LAT 32 PRINZREGENTENSTRASSE ROSENHEIM BAYERN 83022 DEU 1212182 4785699 263 VIA DANTE ALIGHIERI BARI PUGLIA 70122 ITA 1686233 4112154 30 VIA MILANO ... (1 Reply)
Discussion started by: ramky79
1 Replies

9. Shell Programming and Scripting

How to replace & with and in tab separated file?

Hi, I have a tab separated. I want to replace all the "&" in 8th column of the file with "and" .I am trying with awk -F, -vOFS=\\t '{$8=($8=="&")?"and":$8}1' test> test1.txt My file is abc def ghk hjk lkm hgb jkluy acvf & bhj hihuhu fgg me mine he her go went has has & had hgf hgy ... (1 Reply)
Discussion started by: jagdishrout
1 Replies

10. Shell Programming and Scripting

Read a tab separated file with empty column

Hi all, I'm trying to read a tab separated file and apply some functions on each column. I have an issue with empty column. Exemple: $ #cat with the sed to allow you to see my tab $ cat foo.txt| sed 's/\t/;/g' a;1;x b;;yI wanted to something like that: while read col1 col2 col3 do ... (4 Replies)
Discussion started by: maturix
4 Replies
tabs(1) 						      General Commands Manual							   tabs(1)

NAME
tabs - set tabs on a terminal SYNOPSIS
tabs [-v[n]] [-ahuUV] file... DESCRIPTION
The tabs program clears and sets tab-stops on the terminal. This uses the terminfo clear_all_tabs and set_tab capabilities. If either is absent, tabs is unable to clear/set tab-stops. The terminal should be configured to use hard tabs, e.g., stty tab0 OPTIONS
General Options -Tname Tell tabs which terminal type to use. If this option is not given, tabs will use the $TERM environment variable. If that is not set, it will use the ansi+tabs entry. -d The debugging option shows a ruler line, followed by two data lines. The first data line shows the expected tab-stops marked with asterisks. The second data line shows the actual tab-stops, marked with asterisks. -n This option tells tabs to check the options and run any debugging option, but not to modify the terminal settings. The tabs program processes a single list of tab stops. The last option to be processed which defines a list is the one that determines the list to be processed. Implicit Lists Use a single number as an option, e.g., "-5" to set tabs at the given interval (in this case 1, 6, 11, 16, 21, etc.). Tabs are repeated up to the right margin of the screen. Explicit Lists An explicit list can be defined after the options (this does not use a "-"). The values in the list must be in increasing numeric order, and greater than zero. They are separated by a comma or a blank, for example, tabs 1,6,11,16,21 tabs 1 6 11 16 21 Use a '+' to treat a number as an increment relative to the previous value, e.g., tabs 1,+5,+5,+5,+5 which is equivalent to the 1,6,11,16,21 example. Predefined Tab-Stops X/Open defines several predefined lists of tab stops. -a Assembler, IBM S/370, first format -a2 Assembler, IBM S/370, second format -c COBOL, normal format -c2 COBOL compact format -c3 COBOL compact format extended -f FORTRAN -p PL/I -s SNOBOL -u UNIVAC 1100 Assembler PORTABILITY
X/Open describes a +m option, to set a terminal's left-margin. None of the entries in the terminal database provide this capability. The -d (debug) and -n (no-op) options are extensions not provided by other implementations. Documentation for other implementations states that there is a limit on the number of tab stops. While some terminals may not accept an arbitrary number of tab stops, this implementation will attempt to set tab stops up to the right margin of the screen, if the given list happens to be that long. SEE ALSO
tset(1), infocmp(1), ncurses(3NCURSES), terminfo(5). This describes ncurses version 5.7 (patch 20100109). tabs(1)
All times are GMT -4. The time now is 04:44 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy