subsetting data


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers subsetting data
# 15  
Old 04-03-2010
Quote:
Originally Posted by drl
Hi.

Observations:

1) there was no "chr10", "chr11" in your sample data set.

2) Your pattern file includes the 3 characters "chr1" on one of the lines. Where do you think a space should placed?

Best wishes ... cheers, drl
Hi drl

I used a different file in which the lines start like this:
chr1_1 strand.....
chr17_498 strand.....
chr2_0 strand.....
chr17_48 strand.....

the space is before 'strand'

I really appreciate your help.
# 16  
Old 04-03-2010
Hi.

I don't understand your response. Are you using a completely different format for the data file, is the problem solved, or what? ... cheers, drl
# 17  
Old 04-03-2010
Quote:
Originally Posted by drl
Hi.

I don't understand your response. Are you using a completely different format for the data file, is the problem solved, or what? ... cheers, drl
I guess it is a different format. Here are few lines of the data:

data1

>chr1_12 strand:+ excise_beg:16047184 excise_end:16047293
ACTTACCCGAGAACGTGCGGGAAGAGAAGATCATCGAGCATTTCAAACGGTGAGTGACACG
>chr13_124 strand:+ excise_beg:16047141 excise_end:16047250
CCGCCCAGCATGGTCCGGGAAACCAGGCATCTCTGGGTGGGCAACTTACCCGAGAACGTGCG
>chr9_2 strand:+ excise_beg:44109537 excise_end:44109646
AGGGGACCAGAAGAACCCTGGTAGAGAACTCAGGAGAAGGAGGCTAGGAA


data2
chr1_12
chr13_124
# 18  
Old 04-03-2010
Hi.

As far as I can tell, the problem is then solved.

If ask questions in the future, I suggest:

1) post a representative data set,

2) place CODE tags around data and program (script) text. You can do this by dragging your selection pointer across the text of interest, and, while it remains selected, click the # just above the forum editing window. You will then end up with text
Code:
like this

which makes it much easier for responders to read.

Best wishes ... cheers, drl
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk --> math-operation in data-record and joining with second file data

Hi! I have a pretty complex job - at least for me! i have two csv-files with meassurement-data: fileA ...... (2 Replies)
Discussion started by: IMPe
2 Replies

2. Shell Programming and Scripting

Parsing XML (and insert data) then output data (bash / Solaris)

Hi folks I have a script I wrote that basically parses a bunch of config and xml files works out were to add in the new content then spits out the data into a new file. It all works - apart from the xml and config file format in the new file with XML files the original XML (that ends up in... (2 Replies)
Discussion started by: dfinch
2 Replies

3. Shell Programming and Scripting

Generate tabular data based on a column value from an existing data file

Hi, I have a data file with : 01/28/2012,1,1,98995 01/28/2012,1,2,7195 01/29/2012,1,1,98995 01/29/2012,1,2,7195 01/30/2012,1,1,98896 01/30/2012,1,2,7083 01/31/2012,1,1,98896 01/31/2012,1,2,7083 02/01/2012,1,1,98896 02/01/2012,1,2,7083 02/02/2012,1,1,98899 02/02/2012,1,2,7083 I... (1 Reply)
Discussion started by: himanish
1 Replies

4. Shell Programming and Scripting

Converting variable space width data into CSV data in bash

Hi All, I was wondering how I can convert each line in an input file where fields are separated by variable width spaces into a CSV file. Below is the scenario what I am looking for. My Input data in inputfile.txt 19 15657 15685 Sr2dReader 107.88 105.51... (4 Replies)
Discussion started by: vharsha
4 Replies

5. UNIX for Dummies Questions & Answers

How to get data only inside polygon created by points which is part of whole data from file?

hiii, Help me out..i have a huge set of data stored in a file.This file has has 2 columns which is latitude & longitude of a region. Now i have a program which asks for the number of points & based on this number it asks the user to enter that latitude & longitude values which are in the same... (7 Replies)
Discussion started by: reva
7 Replies

6. Shell Programming and Scripting

Extract data based on match against one column data from a long list data

My input file: data_5 Ali 422 2.00E-45 102/253 140/253 24 data_3 Abu 202 60.00E-45 12/23 140/23 28 data_1 Ahmad 256 7.00E-45 120/235 140/235 22 data_4 Aman 365 8.00E-45 15/65 140/65 20 data_10 Jones 869 9.00E-45 65/253 140/253 18... (12 Replies)
Discussion started by: patrick87
12 Replies

7. Shell Programming and Scripting

subsetting lines with grep

Hi my file has two columns: GAII_4:6:100:548:645/1 GTACACAACCCCCCCCCCCCACCCCACCCCCCCCCCCCCC GAII_4:6:100:1:1242/1 AGTCTGCCCCTCCCCCTNNNNNNNTCTTTTNCCTCCTCCT GAII_4:6:100:444:504/1 GTAACACACACCCTGATACTCCCCCCTCCACAACCGCTCT I want to subset the lines that start with GT in the second column... (5 Replies)
Discussion started by: jdhahbi
5 Replies

8. UNIX for Dummies Questions & Answers

subsetting data

I have a file where the data is stored in 6 columns, I would like to subset only lines with the fourth column is blank. Can anybody help me with this? Thanks Joseph (19 Replies)
Discussion started by: jdhahbi
19 Replies

9. Shell Programming and Scripting

how to verify that copied data to remote system is identical with local data.

I have created simple shell script #!/bin/sh echo `date`; echo "Start .... find . -mtime +95 -print > /tmp/files.txt for file in `cat /tmp/files.txt` do echo "copying file - $file" /usr/local/bin/scp -p -P 2222 $file remote.hostname:/file/path echo "copid file -... (3 Replies)
Discussion started by: ynilesh
3 Replies

10. UNIX for Dummies Questions & Answers

Howto capture data from rs232port andpull data into oracle database-9i automatically

Hi, i willbe very much grateful to u if u help me out.. if i simply connect pbx machine to printer by serial port RS232 then we find this view: But i want to capture this data into database automatically when the pbx is running.The table in database will contain similar to this view inthe... (1 Reply)
Discussion started by: boss
1 Replies
Login or Register to Ask a Question