find if a position is between a given start and end position

# 1
find if a position is between a given start and end position

Hi,

I am a newbie in unix programming so maybe this is a simple question.
I would like to know how can I make a script that outputs only the values that are not between any given start and end positions

Example

file1:
2 30
40 80
82 100

file2:
ID1 1
ID2 35
ID3 80
ID4 81
ID6 160

So, I would like to save to a file the lines from file2 where the numbers from column two are not between the start and end position of file1. Therefore, the desired output would be:

Desired output file:
ID4 81
ID6 160

# 2
Is your example a true example?

Trying to figure out if better to create an intermediate file, or setup with arrays of the data of your first file.
How big could the numbers get? Your example has a maximum value of 160.
 joeyg View Public Profile for joeyg Find all posts by joeyg
# 3
Hi,

Thanks for the reply. My files are big (about 355000 lines each), and the maximum value is 161000000.
# 4

How many conditions (entries) in that first file would there be?

How many records do you anticipate in this would be included in your first file?
[Your example has three ranges, and they are 29 recs + 41 recs + 39 recs = 109 recs]
 joeyg View Public Profile for joeyg Find all posts by joeyg
# 5
There would be 355968 entries in the first file and 178845 in the second one.
The ranges of each entry in the first file are always 36.

Best regards
# 6
The ranges of each entry in the first file are always 36?

But, are they also multiples of 36?

(Lots of data to read/process and to validate against.)
 joeyg View Public Profile for joeyg Find all posts by joeyg
# 7
No, they are not always multiples of 36.

Retrieving sequences corresponding to start and end position

Hi all, I have a fasta file of a reference sequnce, I will like to retrieve sequences corresponding to a list of start and end position in another file >my_ref_seq GCCCTATAAGGGCAGAAGCTTGTCCTTCTTGTGCCAGTTATGACGTTTGTCCTAACTGCACATCTGGTAG...

Inserting value at a particular position without changing the position of other characters

Hi All, I wanted a sed/awk command to add a value/character on a particular position without disturbing the position of other characters. I have file a.txt OL 10031 Day Black Midi Good Value P01 P07 OL 10031 Day Black Short Good Value P01 P07 I want to get the output as...

Search for a string at a particular position and replace with blank based on position

Hi, I have a file with multiple lines(fixed width dat file). I want to search for '02' in the positions 45-46 and if available, in that lines, I need to replace value in position 359 with blank. As I am new to unix, I am not able to figure out how to do this. Can you please help me to achieve...

Need command or script to print all lines from 2nd position to last but one position

hi guys, i want command or script to display the content of file from 2nd position to last but one position of a file abcdefghdasdasdsd 123,345,678,345,323 434,656,656,656,656 678,878,878,989,545 4565656667,65656 i want to display the same above file without first and...

extract regions of file based on start and end position

Hi, I have a file1 of many long sequences, each preceded by a unique header line. file2 is 3-columns list: headers name, start position, end position. I'd like to extract the sequence region of file1 specified in file2. Based on a post elsewhere, I found the code: awk...

Remove text from n position to n position sed/awk

I want to remove text from nth position to nth position couple of times in same line my line is "hello is there anyone can help me with this question" I need like this ello is there anyone can help me with question 'h' is removed and 'this' removed from the line. I want to do this...

Subsitute from a position till end of line.

Hi, Having a following file's content, lets say: ABC|ANA|LDJ|||||DKD|||||| AJJ|KKDD||KKDK|||||||||||| KKD||KD|||LLLD||||LLD||||| Problem: Need to replace pipes from 8th occurrence of pipe till end. so the result should be: ABC|ANA|LDJ|||||DKD AJJ|KKDD||KKDK|||| ------- ------- ...

how to find a position and print some string in the next and same position

I need a script for... how to find a position of column data and print some string in the next line and same position position should find based on *HEADER8* in text for ex: ord123 abs 123 987HEADER89 test234 ord124 abc 124 987HEADER88 test235 ...

Add 'ENDEND' on end of each record at position is 14-20

I have file format like below and I'm trying to modify this file. I need to add 'ENDEND' end of each record. 01 ASH01 1CTCTL EDPPOO STAND 01 ASH08 0020 A1TH 101 01 ASH09 0022 A1TH 102 01 ASH09 0022 A1TH 103 01 ASH02 2CTCTL ...

check position of end of line for some specific lines

-------------------------------------------------------------------------------- Have to check in a file that the lines starting with 620 and 705 are ending at same posiotin. 82012345 62023232323 70523949558 62023255454 9999 In the above lines, i have to check the lines starting...