Make copy of text file with columns removed (based on header) Post: 302931578

Sponsored Content

Top Forums Shell Programming and Scripting Make copy of text file with columns removed (based on header) Post 302931578 by LMHmedchem on Wednesday 14th of January 2015 01:31:21 AM

01-14-2015

Registered User

Make copy of text file with columns removed (based on header)

Hello,

I have some tab delimited text files with a three header rows. The headers look like, (sorry the tabs look so messy).

Code:

index	group	Name	input	input	input	input	input	input	input	input	input	input	input
int	char	string	double	double	double	double	double	double	double	double	double	double	double
id	group	Name	AtR_Ptb_L	flatness	inv_dx2	rvalHyd	sumLip	xv0	dxv1	Gmax	k2	Spyridin_N	Salph_N

The files could have any number of columns. What I need to do is simple. I just need to copy the file with one or more columns removed. The columns to be removed would be specified by the value in the third row. For example, I could want the files with columns "dxv1" and "k2" removed.

Code:

index	group	Name	input	input	input	input	input	input	input	input	input
int	char	string	double	double	double	double	double	double	double	double	double
id	group	Name	AtR_Ptb_L	flatness	inv_dx2	rvalHyd	sumLip	xv0	Gmax	Spyridin_N	Salph_N

The order of the remaining columns should be the same. It doesn't matter how the list of columns to be removed is formatted. It can be any kind of list.

I have read posts about how to copy specific columns with cut or awk, but not how to skip specific cols and copy everything else. One thing to do would be to find the position of the cols to be removed and use cut, but how to set that up to work in a general implementation is a bit unclear to me. I also suspect that awk would be more efficient.

Any suggestions?

LMHmedchem

LMHmedchem

View Public Profile for LMHmedchem

Find all posts by LMHmedchem

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Merging two files based on two columns to make a third file

Hi there, I'm trying to merge two files and make a third file. However, two of the columns need to match exactly in both files AND I want everything from both files in the output if the two columns match in that row. First file looks like this: chr1 10001980 T A Second...

2. Shell Programming and Scripting

Copy and Paste Columns in a Tab-Limited Text file

I have this text file with a very large number of columns (10,000+) and I want to move the first column to the position of the six column so that the text file looks like this: Before cutting and pasting ID Family Mother Father Trait Phenotype aaa bbb ...

3. UNIX for Dummies Questions & Answers

Extracting rows from a text file based on the values of two columns (given ranges)

Hi, I have a tab delimited text file with multiple columns. The second and third columns include numbers that have not been sorted. I want to extract rows where the second column includes a value between -0.01 and 0.01 (including both numbers) and the first third column includes a value between...

4. Shell Programming and Scripting

Reading columns from a text file and to make an array for each column

Hi, I am not so familiar with bash scripting and would appreciate your help here. I have a text file 'input.txt' like this: 2 3 4 5 6 7 8 9 10 I want to store each column in an array like this a ={2 5 8}, b={3 6 9}, c={4 7 10} so that i can access any element, e.g b=6 for the later use.

5. Shell Programming and Scripting

Extract columns based on header

Hi to all, I have two files. File1 has no header, two columns: sample1 A sample2 B sample3 B sample4 C sample5 A sample6 D sample7 D File2 has a header, except for the first 3 columns (chr,start,end). "sample1" is the header for the 4th ,5th ,6th columns, "sample2" is the header...

6. Emergency UNIX and Linux Support

Average columns based on header name

Hi Friends, I have files with columns like this. This sample input below is partial. Please check below for main file link. Each file will have only two rows. ...

7. UNIX for Beginners Questions & Answers

Keep only columns in first two rows based on partial header pattern.

I have this code below that only prints out certain columns from the first two rows (doesn't affect rows 3 and beyond). How can I do the same on a partial header pattern �G_TP� instead of having to know specific column numbers (e.g. 374-479)? I've tried many other commands within this pipe with no...

8. Shell Programming and Scripting

Find columns in a file based on header and print to new file

Hello, I have to fish out some specific columns from a file based on the header value. I have the list of columns I need in a different file. I thought I could read in the list of headers I need, # file with header names of required columns in required order headers_file=$2 # read contents...

9. Shell Programming and Scripting

Find header in a text file and prepend it to all lines until another header is found

I've been struggling with this one for quite a while and cannot seem to find a solution for this find/replace scenario. Perhaps I'm getting rusty. I have a file that contains a number of metrics (exactly 3 fields per line) from a few appliances that are collected in parallel. To identify the...

LEARN ABOUT SUNOS

fspec

fspec(4)							   File Formats 							  fspec(4)

NAME

       fspec - format specification in text files

DESCRIPTION

       It  is  sometimes  convenient  to maintain text files on the system with non-standard tabs, (tabs that are not set at every eighth column).
       Such files must generally be converted to a standard format, frequently by replacing all tabs with the appropriate number of spaces, before
       they  can  be  processed by system commands. A format specification occurring in the first line of a text file specifies how tabs are to be
       expanded in the remainder of the file.

       A format specification consists of a sequence of parameters separated by blanks and surrounded by the brackets <: and  :>.  Each  parameter
       consists of a keyletter, possibly followed immediately by a value. The following parameters are recognized:

       ttabs	       The t parameter specifies the tab settings for the file. The value of tabs must be one of the following:

			 o  A list of column numbers separated by commas, indicating tabs set at the specified columns.

			 o  A '-' followed immediately by an integer n, indicating tabs at intervals of n columns.

			 o  A '-' followed by the name of a ``canned'' tab specification.

		       Standard  tabs  are  specified by t-8, or equivalently, t1,9,17,25, etc. The canned tabs that are recognized are defined by
		       the tabs(1) command.

       ssize	       The s parameter specifies a maximum line size. The value of size must be an integer. Size checking is performed after  tabs
		       have been expanded, but before the margin is prepended.

       mmargin	       The m parameter specifies a number of spaces to be prepended to each line. The value of margin must be an integer.

       d	       The  d  parameter takes no value. Its presence indicates that the line containing the format specification is to be deleted
		       from the converted file.

       e	       The e parameter takes no value. Its presence indicates that the current format is to  prevail  only  until  another  format
		       specification is encountered in the file.

       Default	values,  which	are  assumed for parameters not supplied, are t-8 and m0. If the s parameter is not specified, no size checking is
       performed. If the first line of a file does not contain a format specification, the above defaults are assumed for  the	entire	file.  The
       following is an example of a line containing a format specification:

	      * <:t5,10,15 s72:> *

       If a format specification can be disguised as a comment, it is not necessary to code the d parameter.

SEE ALSO

       ed(1), newform(1), tabs(1)

SunOS 5.10							    3 Jul 1990								  fspec(4)