Sponsored Content
Top Forums Shell Programming and Scripting Help required on Length based lookup Post 302929145 by rramkrishnas on Friday 19th of December 2014 01:07:53 AM
Old 12-19-2014
Dear RudiC,

With some wild guessing I presume that you want to match entries based on the smallest common substring. But some questions remain:

Below are my comments against your Query.
Will abc always have the smallest substring or could that be in bcd as well?

abc.txt Records will have smalest substring as well as the same string will apear in bcd.txt.

Will the smallest substring always precede the longer ones?
Yes, and will be present at bcd.txt

Where is the 121 entry from bcd in the outputs? Where 123456 from abc?
121 entry is an extra entry in bcd.txt, and 123456 is presnt at bcd.txt however in abc.txt 123 record is present hence 123456 should be a match case.

---------- Post updated 12-19-14 at 11:37 AM ---------- Previous update was 12-18-14 at 05:40 PM ----------

Can any one please help me on this
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Need find a file based length

Can some please help me? Want to find files over 35 characters in length? I am running HPUX. Would it be possible with find? Thanks in advance (8 Replies)
Discussion started by: J_ang
8 Replies

2. UNIX for Advanced & Expert Users

Clueless about how to lookup and reverse lookup IP addresses under a file!!.pls help

Write a quick shell snippet to find all of the IPV4 IP addresses in any and all of the files under /var/lib/output/*, ignoring whatever else may be in those files. Perform a reverse lookup on each, and format the output neatly, like "IP=192.168.0.1, ... (0 Replies)
Discussion started by: choco4202002
0 Replies

3. Shell Programming and Scripting

SED based on file lookup

Newb here trying to figure this one out. :confused: I am trying to create a SED (or some other idea) line that will replace the data field if the original text is seen in a separate text file. The lookup file would be line delimted. For example: sed 's/<if in file>/YES/' File structure:... (3 Replies)
Discussion started by: sdlennon
3 Replies

4. Shell Programming and Scripting

Split strings based on length

Hi All I am very much in need of help splitting strings based on length in Perl. e.g., Input text is : International NOUN Corp. NOUN 's POS Tulsa NOUN Output I want is : International I In Int Inte l al nal onal NOUN Corp. C Co Cor Corp . p. rp. orp. NOUN... (2 Replies)
Discussion started by: my_Perl
2 Replies

5. UNIX for Dummies Questions & Answers

Sorting words based on length

i need to write a bash script that recive a list of varuables kaka pele ronaldo beckham zidane messi rivaldo gerrard platini i need the program to print the longest word of the list. word in the output appears on a separate line and word order in the output is in the order Llachsicografi costs.... (1 Reply)
Discussion started by: yairpg
1 Replies

6. UNIX for Dummies Questions & Answers

Length of a segment based on coordinates

Hi, I would like to have the length of a segment based on coordinates of its parts. Example input file: chr11 genes_good3.gtf aggregate_gene 1 100 gene1 chr11 genes_good3.gtf exonic_part 1 60 chr11 genes_good3.gtf exonic_part 70 100 chr11 genes_good3.gtf aggregate_gene 200 1000 gene2... (2 Replies)
Discussion started by: fadista
2 Replies

7. Shell Programming and Scripting

Append spaces the rows to make it into a required fixed length file

I want to make a script to read row by row and find its length. If the length is less than my required length then i hav to append spaces to that paritucular row. Each row contains special characters, spaces, etc. For example my file contains , 12345 abcdef 234 abcde 89012 abcdefgh ... (10 Replies)
Discussion started by: Amrutha24
10 Replies

8. Shell Programming and Scripting

Filtering duplicates based on lookup table and rules

please help solving the following. I have access to redhat linux cluster having 32gigs of ram. I have duplicate ids for variable names, in the file 1,2 are duplicates;3,4 and 5 are duplicates;6 and 7 are duplicates. My objective is to use only the first occurrence of these duplicates. Lookup... (4 Replies)
Discussion started by: ritakadm
4 Replies

9. Shell Programming and Scripting

Append 0's based on length

I'm having data like this, "8955719","186497034","0001","M","3" "8955719","186497034","0002","M","10" "8955719","186497034","0003","M","10" "8955719","186497034","0004","M","3" "8955723","186499034","0001","M","3" "8955723","186499034","0002","M","10" "8955723","186499034","0003","M","10"... (3 Replies)
Discussion started by: Artlk
3 Replies

10. Shell Programming and Scripting

Outputting sequences based on length with sed

I have this file: >ID1 AA >ID2 TTTTTT >ID-3 AAAAAAAAA >ID4 TTTTTTGGAGATCAGTAGCAGATGACAG-GGGGG-TGCACCCC Add I am trying to use this script to output sequences longer than 15 characters: sed -r '/^>/N;{/^.{,15}$/d}' The desire output would be this: >ID4... (8 Replies)
Discussion started by: Xterra
8 Replies
lsearch(n)						       Tcl Built-In Commands							lsearch(n)

__________________________________________________________________________________________________________________________________________________

NAME
lsearch - See if a list contains a particular element SYNOPSIS
lsearch ?options? list pattern _________________________________________________________________ DESCRIPTION
This command searches the elements of list to see if one of them matches pattern. If so, the command returns the index of the first match- ing element (unless the options -all or -inline are specified.) If not, the command returns -1. The option arguments indicates how the elements of the list are to be matched against pattern and must have one of the values below: MATCHING STYLE OPTIONS If all matching style options are omitted, the default matching style is -glob. If more than one matching style is specified, the last matching style given takes precedence. -exact Pattern is a literal string that is compared for exact equality against each list element. -glob Pattern is a glob-style pattern which is matched against each list element using the same rules as the string match command. -regexp Pattern is treated as a regular expression and matched against each list element using the rules described in the re_syntax refer- ence page. -sorted The list elements are in sorted order. If this option is specified, lsearch will use a more efficient searching algorithm to search list. If no other options are specified, list is assumed to be sorted in increasing order, and to contain ASCII strings. This option is mutually exclusive with -glob and -regexp, and is treated exactly like -exact when either -all or -not are specified. GENERAL MODIFIER OPTIONS These options may be given with all matching styles. -all Changes the result to be the list of all matching indices (or all matching values if -inline is specified as well.) If indices are returned, the indices will be in numeric order. If values are returned, the order of the values will be the order of those values within the input list. -inline The matching value is returned instead of its index (or an empty string if no value matches.) If -all is also specified, then the result of the command is the list of all values that matched. -not This negates the sense of the match, returning the index of the first non-matching value in the list. -start index The list is searched starting at position index. The interpretation of the index value is the same as for the command string index, | supporting simple index arithmetic and indices relative to the end of the list. CONTENTS DESCRIPTION OPTIONS These options describe how to interpret the items in the list being searched. They are only meaningful when used with the -exact and -sorted options. If more than one is specified, the last one takes precedence. The default is -ascii. -ascii The list elements are to be examined as Unicode strings (the name is for backward-compatibility reasons.) -dictionary The list elements are to be compared using dictionary-style comparisons (see lsort for a fuller description). Note that this only makes a meaningful difference from the -ascii option when the -sorted option is given, because values are only dictionary-equal when exactly equal. -integer The list elements are to be compared as integers. | -nocase | Causes comparisons to be handled in a case-insensitive manner. Has no effect if combined with the -dictionary, -integer, or -real | options. -real The list elements are to be compared as floating-point values. SORTED LIST OPTIONS These options (only meaningful with the -sorted option) specify how the list is sorted. If more than one is given, the last one takes precedence. The default option is -increasing. -decreasing The list elements are sorted in decreasing order. This option is only meaningful when used with -sorted. -increasing The list elements are sorted in increasing order. This option is only meaningful when used with -sorted. NESTED LIST OPTIONS These options are used to search lists of lists. They may be used with any other options. | -index indexList | This option is designed for use when searching within nested lists. The indexList argument gives a path of indices (much as might | be used with the lindex or lset commands) within each element to allow the location of the term being matched against. | -subindices | If this option is given, the index result from this command (or every index result when -all is also specified) will be a complete | path (suitable for use with lindex or lset) within the overall list to the term found. This option has no effect unless the -index | is also specified, and is just a convenience short-cut. EXAMPLES
Basic searching: lsearch {a b c d e} c -> 2 lsearch -all {a b c a b c} c -> 2 5 Using lsearch to filter lists: lsearch -inline {a20 b35 c47} b* -> b35 lsearch -inline -not {a20 b35 c47} b* -> a20 lsearch -all -inline -not {a20 b35 c47} b* -> a20 c47 lsearch -all -not {a20 b35 c47} b* -> 0 2 This can even do a "set-like" removal operation: lsearch -all -inline -not -exact {a b c a d e a f g a} a -> b c d e f g Searching may start part-way through the list: lsearch -start 3 {a b c a b c} c -> 5 It is also possible to search inside elements: lsearch -index 1 -all -inline {{a abc} {b bcd} {c cde}} *bc* -> {a abc} {b bcd} SEE ALSO
foreach(n), list(n), lappend(n), lindex(n), linsert(n), llength(n), lset(n), lsort(n), lrange(n), lreplace(n), string(n) | KEYWORDS
list, match, pattern, regular expression, search, string Tcl 8.5 lsearch(n)
All times are GMT -4. The time now is 09:44 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy