Sponsored Content
Top Forums Shell Programming and Scripting compare files by numerical value Post 302366502 by s-layer on Thursday 29th of October 2009 05:25:41 PM
Old 10-29-2009
compare files by numerical value

Hi everyone,

I would love to have a script that does the following:

I have one file that looks like that:

Code:
ATOM      1  BB  SER 1   1     -31.958 -25.125 -11.061  1.00  0.00      
ATOM      3  BB  GLY 1   2     -32.079 -26.085 -14.466  1.00  0.00      
ATOM      4  BB  VAL 1   3     -36.455 -21.265 -15.792  1.00  0.00      
ATOM      6  BB  SER 1   4     -37.401 -20.877 -19.029  1.00  0.00     
ATOM      8  BB  ALA 1   5     -42.701 -21.232 -18.584  1.00  0.00     
ATOM     10  BB  VAL 1   6     -47.498 -23.718 -18.979  1.00  0.00

Then I have a second file that looks like that:

Code:
1
3
6

What I want to do is: In those lines of file1 where $6 has one of the values of file2, I add an additional column in file1 $12=="cg". The output should look like this:

Code:
ATOM      1  BB  SER 1   1     -31.958 -25.125 -11.061  1.00  0.00  cg   
ATOM      3  BB  GLY 1   2     -32.079 -26.085 -14.466  1.00  0.00      
ATOM      4  BB  VAL 1   3     -36.455 -21.265 -15.792  1.00  0.00  cg    
ATOM      6  BB  SER 1   4     -37.401 -20.877 -19.029  1.00  0.00     
ATOM      8  BB  ALA 1   5     -42.701 -21.232 -18.584  1.00  0.00     
ATOM     10  BB  VAL 1   6     -47.498 -23.718 -18.979  1.00  0.00  cg

Could anyone help me please? That would be great! Normally I am using awk or perl :-)

Thank you so much,
Christine

Last edited by vgersh99; 10-29-2009 at 06:49 PM.. Reason: code tags, please!
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Listing files in numerical order

Hi, I'm trying to write a ksh script to copy a specified number of files from one directory to another. The files are named in the convention <switchname>_log.<num> and the numbers are sequential single digit onwards. I figured I could find some parameter for ls which would list the files in... (3 Replies)
Discussion started by: Steve_H
3 Replies

2. Programming

Adding files of numerical data

Hi I was hoping that maybe someone could help me with a small piece of C code. I have a number of files, which are all of similar layout ie. three lines of text and 5-6 columns of numerical data. I need to add each of the elements of the second column in one file to their counterparts in the second... (17 Replies)
Discussion started by: Boucho
17 Replies

3. UNIX for Dummies Questions & Answers

Moving files out of multiple directories and renaming them in numerical order

Hi, I have 500 directories each with multiple data files inside them. The names are sort of random. For example, one directory has files named e_1.dat, e_5.dat, e_8.dat, etc. I need to move the files to a single directory and rename them all in numerical order, from 1.dat to 1000(or some... (1 Reply)
Discussion started by: renthead720
1 Replies

4. UNIX for Dummies Questions & Answers

moving/copying files in a numerical order

Hi I am newbie to unix scripting, but i have enough knowledge to understand. I have a specific questions like, I use to collect like 3500 files per experiment, each one named like data_001.img.. data_002.img data_003.img .... data_3500.img I would like to move every 12 files in the 3500... (3 Replies)
Discussion started by: wpat
3 Replies

5. UNIX for Dummies Questions & Answers

finding and moving files based on the last three numerical characters in the filename

Hi, I have a series of files (upwards of 500) the filename format is as follows CC10-1234P1999.WGS84.p190, all in one directory. Now the last three numeric characters, in this case 999, can be anything from 001 to 999. I need to move some of them to a seperate directory, the ones I need to... (5 Replies)
Discussion started by: roche.j.mike
5 Replies

6. UNIX for Dummies Questions & Answers

List files according to the numerical value

Hi, I have a large number of files which are named as follows. VF_50, VF_100, VF_150, VF_250, VF_300, VF_350, VF_400, VF_450, VF_500. When I do an 'ls' it arranges the files in the following way VF_100, VF_150, VF_250, VF_300, VF_350, VF_400, VF_450, VF_50, VF_500. Is there a way to... (2 Replies)
Discussion started by: lost.identity
2 Replies

7. Shell Programming and Scripting

Deleting particular files with a numerical suffix

Hello I have a directory with a list of files which have a particular numerical suffix. E.g filename_0 filename_1 filename_18500 filename_10000 I want to delete all files from this directory which have a filename which have a numerical suffix greater than 10540. So any files... (5 Replies)
Discussion started by: kamal_p_99
5 Replies

8. Shell Programming and Scripting

how to extract data from numbered files using linux in the numerical order-

Hi experts, I have a list of files containing forces as the only number as follows. Force1.txt Force2.txt Force3.txt Force4.txt Force5.txt . . . . . . . . . Force100.txt I want to put all the data(only a number ) in these forces files in the file with the same order like 1,2,3 ..100 .... (2 Replies)
Discussion started by: hamnsan
2 Replies

9. Shell Programming and Scripting

Match and store numerical prefix to update files

In the bash below the unique headers of each vcf.gz are stored in a text file with the same name. That is if 16-0000-file.vcf.gz was used the header text file would be 16-0000-file_header.txt. There can be multiple vcf.gz in a directory, usually 3, that I need to fix the header in each file before... (6 Replies)
Discussion started by: cmccabe
6 Replies

10. Shell Programming and Scripting

Bash to add portion of text to files in directory using numerical match

In the below bash I am trying to rename eachof the 3 text files in /home/cmccabe/Desktop/percent by matching the numerical portion of each file to lines 3,4, or 5 in /home/cmccabe/Desktop/analysis.txt. There will always be a match between the files. When a match is found each text file in... (2 Replies)
Discussion started by: cmccabe
2 Replies
RODS(1) 						      General Commands Manual							   RODS(1)

NAME
rods - Raster3D preprocessor for ball-and-stick models SYNOPSIS
rods [-h] [-b] [-radius R] [-Bcolor Bmin Bmax] Rods reads a file describing atom colours and/or a PDB coordinate file and produces a file containing Raster3D descriptor records. The file produced by rods may be fed directly to render or it may be combined with descriptor files produced by other Raster3D utilities. EXAMPLES
To describe a simple bonds-only model coloured by residue type: cat mycolours.pdb protein.pdb | rods | render > mypicture.png To render the same molecule as ball-and-stick: cat mycolours.pdb protein.pdb | rods -b | render > mypicture.png OPTIONS
-h Suppress header records in output. By default rods will produce an output file which starts with header records containing a default set of scaling and processing options. The -h flag will suppress these header records so that the output file contains only object descrip- tors. This option is useful for producing files which describe only part of a scene, and which are to be later combined with descriptor files produced by other programs. -b By default rods will describe bonds only; the -b flag will cause it to include spheres at the atom positions also, yielding a ball-and- stick representation. -radius R By default rods will draw bonds as cylinders with a 0.2A radius. The radius option allows you to change this cylindrical radius. -Bcolor Bmin Bmax Assign colors based on B values rather than from atom or residue types. Atoms with B <= Bmin will be colored dark blue; atoms with B >= Bmax will be colored light red; atoms with Bmin < B < Bmax will be assigned colors shading smoothly through the spectrum from blue to red. DESCRIPTION
The input to rods consists of a single text file containing colour information and atomic coordinates in PDB data bank format. Coordinates are output as Raster3D descriptor records with colours and sphere radii assigned according to the COLO records described below. Ball-and- stick figures have atoms drawn at 0.2 * VanderWaals radius, connected by rods with a default 0.2A cylindrical radius. Bonds are drawn for atoms which lie closer to each other than 0.6 * (sum of VanderWaals radii). By default the output file contains a set of header records as required by the render program. The header is constructed to include a TMAT matrix corresponding to the transformation matrix contained in file setup.matrix (if it exists), or to the Eulerian angles contained in file setup.angles (if it exists). Colours are assigned to atoms using a matching process, using COLOUR records prepended to the input PDB file. If no COLOUR records are present in the input file, atoms will receive default CPK colors (C=grey, O=red, N=blue, S=yellow, P=green, other=magenta). Raster3D uses a pseudo-PDB record type with the same basic layout as the above but with COLO in the first 4 columns: Columns 1 - 4 COLO 7 - 30 Mask (described below) 31 - 38 Red component 39 - 46 Green component 47 - 54 Blue component 55 - 60 van der Waals radius in Angstroms 61 - 80 Comments Note that the Red, Green, and Blue components are in the same positions as the X, Y, and Z components of an ATOM or HETA record, and the van der Waals radius goes in place of the Occupancy. The Red, Green, and Blue components must all be in the range 0 to 1. The Mask field is used in the matching process as follows. First the program reads in and stores all the ATOM, HETA, and COLO records in input order. Then it goes through each stored ATOM/HETA record in turn, and searches for a COLO record that matches the ATOM/HETA record in all of columns 7 through 30. The first such COLO record to be found determines the colour and radius of the atom. In order that one COLO record can provide colour and radius specifications for more than one atom (e.g., based on residue or atom type, or any other criterion for which labels can be given somewhere in columns 7 through 30), the "#" symbol is used as a wildcard. I.e. a # in a COLO record matches any character in the corresponding column in an ATOM or HETA record. All other characters must match literally to count as a match. Note that the very last COLO record in the input should have # symbols in all of columns 7 through 30 in order to pro- vide a colour for any atom whose ATOM/HETA record fails to match any previous COLO record. This idea of matching masks for colour specifi- cations is due to Colin Broughton. ENVIRONMENT
The files setup.matrix and setup.angles, if they exist, affect the header records produced by rods. SOURCE
web URL: http://www.bmsc.washington.edu/raster3d/raster3d.html contact: Ethan A Merritt University of Washington, Seattle WA 98195 merritt@u.washington.edu SEE ALSO
render(l), ribbon(l), balls(l) AUTHORS
Ethan A Merritt Raster3D 8 May 1999 RODS(1)
All times are GMT -4. The time now is 08:23 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy