hii,
i want to know the shell command for finding the last occurance of a substring in string..
i can use grep command or sed to find out the occurance of a substring in a string but how do i find out the last occurance.shud i use grep amd and cut the string everytime and store it in a new... (7 Replies)
Hi,
I have a file with 3 columns in it that are comma separated and it has about 5000 lines. What I want to do is find the most common value in column 3 using awk or a shell script or whatever works! I'm totally stuck on how to do this.
e.g.
value1,value2,bob
value1,value2,bob... (12 Replies)
I have million's of records each containing exactly 50 characters and have to check the uniqueness of 4 character substring of 50 character (postion known prior) and report if any duplicates are found.
Eg. data...
AAAA00000000000000XXXX0000 0000000000... upto50 chars... (2 Replies)
Good Morning/Afternoon All,
I am using the nawk utility in korn shell to find the longest field and display that result.
My Data is as follows:
The cat ran
The elephant ran
Milly ran too
We all ran
I have tried nawk '{ if (length($1) > len) len=length($1); print $1}' filename
The... (5 Replies)
Hi,
I am trying to figure out how to get the length of the longest column in the entire file (because the length varies from one row to the other)
I was doing this at first to check how many fields I have for the first row:
awk '{print NF; exit}' file
Now, I can do this:
awk '{ if... (4 Replies)
Hello, I would like to know what is the three most abundant substrings of length 6 from col2. The file is quite large and looks like this
col1 col2
EN03 typehellobyedogcatcatdog
EN09 typehellobyebyebyebye
EN08 dogcatcatdogbyebyebyebye
EN09 catcattypehellobyebyebyebye... (9 Replies)
Hello gurus,
I have a database of possible primary signal strings
pp22
pt22dx
pp22dx
jty2234
Also I have a list of scrambled signals which has a shorter string and a longer string separated by // (double slash ). Always the shorter string of a scrambled signal will have the primary... (6 Replies)
Hello, I need to find the intersection across 10 columns. Kindly help.
my file (INPUT.csv) looks like this
4_R 4_S 8_R 8_S 12_R 12_S 24_R 24_S
LOC_Os01g01010 LOC_Os01g01010 LOC_Os01g01010 LOC_Os04g48290 LOC_Os01g01010 LOC_Os01g01010... (1 Reply)
I have a text file that has some data like:
PADHOGOA1 IOP055_VINREG5_1 ( .IO(VINREG5_1), .MONI(), .MON_D(px_IOP055_VINREG5_1_MON_D), .R0T(px_IOP054_VINREG5_0_R0T), .IO1() );
PADV30MA0 IOP056_VOUT3_IN ( .IO(VOUT3_IN), .V30M(px_IOP056_VOUT3_IN_V30M));
PADV30MA0 IOP057_VOUT3_OUT (... (2 Replies)
Hi,
Let's say I have a pipe-separated input like so:
name_10|A|BCCC|cat_1
name_11|B|DE|cat_2
name_10|A|BC|cat_3
name_11|B|DEEEEEE|cat_4
Using awk, for records with common field 2, I am trying to replace all the shortest substrings by the longest string in field 3.
In order to get the... (5 Replies)
Discussion started by: beca123456
5 Replies
LEARN ABOUT DEBIAN
lid
LID(1) User Commands LID(1)NAME
lid - Query ID database and report results
SYNOPSIS
lid [OPTION]... PATTERN...
DESCRIPTION
Query ID database and report results. By default, output consists of multiple lines, each line containing the matched identifier followed
by the list of file names in which it occurs.
-f, --file=FILE
file name of ID database
-i, --ignore-case
match PATTERN case insensitively
-l, --literal
match PATTERN as a literal string
-r, --regexp
match PATTERN as a regular expression
-w, --word
match PATTERN as a delimited word
-s, --substring
match PATTERN as a substring
Note: If PATTERN contains extended regular expression metacharacters, it is interpreted as a regular expression substring. Other-
wise, PATTERN is interpreted as a literal word.
-k, --key=STYLE
STYLE is one of `token', `pattern' or `none'
-R, --result=STYLE
STYLE is one of `filenames', `grep', `edit' or `none'
-S, --separator=STYLE
STYLE is one of `braces', `space' or `newline' and only applies to file names when `--result=filenames'
The above STYLE options control how query results are presented. Defaults are --key=token --result=filenames --separator=space
-F, --frequency=FREQ
find tokens that occur FREQ times, where FREQ is a range expressed as `N..M'. If N is omitted, it defaults to 1, if M is omitted it
defaults to MAX_USHRT
-a, --ambiguous=LEN
find tokens whose names are ambiguous for LEN chars
-x, --hex
only find numbers expressed as hexadecimal
-d, --decimal
only find numbers expressed as decimal
-o, --octal
only find numbers expressed as octal
By default, searches match numbers of any radix.
--help display this help and exit
--version
output version information and exit
REPORTING BUGS
Report bugs to bug-idutils@gnu.org
lid - 4.5 August 2012 LID(1)