find strings at the start of a particular column


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting find strings at the start of a particular column
# 1  
Old 12-09-2010
find strings at the start of a particular column

Hey,

I have not posted here in a while. I have a biological data file that has 4 columns and I am interested in column 2. Column 5 contains a series of letters AGCT.

Basically the file looks something like this

Code:
Name   AGGTTTTCCCCCCC  L  Q
Name1  AGGTTTTAAAAAACC  L  Q
Name2  ATTGGGGGGGGGGG  L  Q

The file is tab delimited and if it column 2 begins with AGG then I want it to go into another file.

Thanks
Kylle
# 2  
Old 12-10-2010
awk ' { if (substr($2,0,3) == "AGG") print $0; } ' data.txt > sorted_data.txt

data.txt contains your regular data sorted_data.txt is the new file with lines that have AGG

Last edited by codecaine; 12-10-2010 at 01:52 AM..
# 3  
Old 12-10-2010
Code:
grep "^IAGG[A-Z]^I" file > newfile            # where ^I represents tab

grep "^.*^IAGG[A-Z]^I.*^I.*$" file > newfile   # for an exact match

R0H0N
# 4  
Old 12-10-2010
could try..
Code:
awk '$2 ~ /^AGG/' inputfile > outfile

This User Gave Thanks to michaelrozar17 For This Post:
# 5  
Old 12-10-2010
Quote:
Originally Posted by michaelrozar17
could try..
Code:
awk '$2 ~ /^AGG/' inputfile > outfile

I like this way. Thank you for the share.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk to find maximum and minimum from column and store in other column

Need your support for below. Please help to get required output If column 5 is INV then only consider column1 and take out duplicates/identical rows/values from column1 and then put minimum value of column6 in column7 and put maximum value in column 8 and then need to do subtract values of... (7 Replies)
Discussion started by: as7951
7 Replies

2. Shell Programming and Scripting

Grepping multiple strings from one column

I have 3-column tab separated data that looks like the following: act of+n-a-large+vn-tell-v 0.067427 act_com of+n+n-a-large-manufacturer-n 0.129922 act-act_com-com in+n-j+vn-pass-aux-restate-v 0.364499666667 com nmod+n-j+ns-invader-n 0.527521 act_com-com obj+n-a-j+vd-contribute-v 0.091413... (2 Replies)
Discussion started by: owwow14
2 Replies

3. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters... (5 Replies)
Discussion started by: prashob123
5 Replies

4. Shell Programming and Scripting

Converting Single Column into Multiple rows, but with strings to specific tab column

Dear fellows, I need your help. I'm trying to write a script to convert a single column into multiple rows. But it need to recognize the beginning of the string and set it to its specific Column number. Each Line (loop) begins with digit (RANGE). At this moment it's kind of working, but it... (6 Replies)
Discussion started by: AK47
6 Replies

5. Shell Programming and Scripting

Reading first column of file which start with space also

Hi All, I am trying to read first column of my file using command cat temp2_sample.cir|cut -d' ' -f1 The content of my file is as follow R1 pin23I pin27I R2 pin23G pin27G R3 pin27F pin27D RWire10 pin15Y pin23J VCC1 pin27W pin13Y ... (6 Replies)
Discussion started by: diehard
6 Replies

6. Shell Programming and Scripting

Find lines with matching column 1 value, retain only the one with highest value in column 2

I have a file like: I would like to find lines lines with duplicate values in column 1, and retain only one based on two conditions: 1) keep line with highest value in column 3, 2) if column 3 values are equal, retain the line with the highest value in column 4. Desired output: I was able to... (3 Replies)
Discussion started by: pathunkathunk
3 Replies

7. Shell Programming and Scripting

Count no of occurrence of the strings based on column value

Can anyone help me to count number of occurrence of the strings based on column value. Say i have 300 files with 1000 record length from which i need to count the number of occurrence string which is existing from 213 to 219. Some may be unique and some may be repeated. (8 Replies)
Discussion started by: zooby
8 Replies

8. Shell Programming and Scripting

find expression with awk in only one column, and if it fits, print whole column

Hi. How do I find an expression with awk in only one column, and if it fits, then print that whole column. 1 apple oranges 2 bannanas pears 3 cats dogs 4 hesaid shesaid echo "which number:" read NUMBER (user inputs number 2 for this example) awk " /$NUMBER/ {field to search is field... (2 Replies)
Discussion started by: glev2005
2 Replies

9. Shell Programming and Scripting

Deleting repeated strings in column 2

Hi to all, I have a file where the subject could contain "Summarized Availability Report" or only "Summarized Report" If the subject is "Summarized Availability Report" I want to apply it Scrip1 and if the subject is "Summarized Report" I want to apply it Scrip2. 1-) I would like you... (5 Replies)
Discussion started by: cgkmal
5 Replies

10. Shell Programming and Scripting

Grep strings from file and put in Column

Dear Experts, My file contains below- GET:SUB:ISI,432350414557432; RESP:0:MD,019352020633:ISI,432350414557432:T11,1:T21,1:T22,1:B16,1:T62,1:BAIC,0:BAOC,1:BOIC,0:BIRO,0:BORO,0:PAID,1; GET:SUB:ISI,432350414581060;... (2 Replies)
Discussion started by: thepurple
2 Replies
Login or Register to Ask a Question