Extracting 22-character strings from text using sed/awk?
Here is my task, I feel sure this can be accomplished with see/awk but can't seem to figure out how.
I have large flat file from which I need to extract every case of a pairing of characters (GG) in this case PLUS the previous 20 characters. The output should be a list (which I plan to make non-redunatant using uniq) of every 22-character string that ends in these specific 2 characters.
The input is a just flat file of just a long string of characters without line breaks (usually around 2000-10000 characters each):
The desired output would look like this:
Any help is appreciated!
Last edited by Twinklefingers; 09-15-2013 at 02:57 PM..
i have textfiles that contain a series of lines that look like this:
string0 .................................................... column3a column4a
string1**384y0439 ..................................... column3b column4b... (2 Replies)
There are a lot of ways to extract text from between two strings, but what if those strings occur multiple times and you only want the text from the first two strings? I can't seem to find anything to work here. I'm using sed to process the text after it's extracted, so I prefer a sed answer, but... (4 Replies)
I have a text wich looks like this:
clid=2 cid=6 client_database_id=35 client_nickname=Peter client_type=0|clid=3 cid=22 client_database_id=57 client_nickname=Paul client_type=0|clid=5 cid=22 client_database_id=7 client_nickname=Mary client_type=0|clid=6 cid=22 client_database_id=6... (3 Replies)
Hello,
I want to writte a script that replace two character strings by two variables with the command sed butmy solution doesn't work. I'm written this: sed "s/TTFactivevent/$TTFav/g && s/switchSLL/$SLL/g" templatefile.
I want to replace TTFactivevent by the variable $TTFav, that is a... (4 Replies)
Hi,
I've looked at a few existing posts on this, but they don't seem to work for my inputs.
I have a text file where I want to extract all the text between two strings, every time that occurs.
Eg my input file is
Anna said that she would fetch the bucket.
Anna and Ben moved the bucket.... (9 Replies)
I'd like to remove (do a pattern or precise replacement - this I can handle in SED using Regex )
---AFTER THE 1ST Occurrence ( i.e. on the 2nd occurrence - from the 2nd to fourth occurance ) of a specific string : type 1
-- After the 1st occurrence of 1 string1 till the 1st occurrence of... (4 Replies)
Hi All,
I have a file whose common patter is like this:
.I 1
.U
87049087
.S
Some text here too
.M
This is a text
.T
Some another text here
.P
Name of the book
.W
Some lines of more text. This text needs to be extracted.
.A
more text goes here too
.I 2 (2 Replies)
Hi experts,
Ive got a text file which has the following text which will occur in this format at least one time:
+=========================>>
Some stuff that evreryone should knnow
other stufsjdokajkajokajda
aijhjajcdjajcisajcqsqdqwdqad
<<=========================+
It is likely that... (8 Replies)
Hi Team -
I hope everyone has been well!
I export a file from one of our source systems that gives me more information than I need. The way the file outputs, I need to extract certain strings at different positions on the file and echo them to another file.
I can do this in batch easily,... (2 Replies)
Discussion started by: SIMMS7400
2 Replies
LEARN ABOUT ULTRIX
tr
tr(1) General Commands Manual tr(1)Name
tr - translate characters
Syntax
tr [-cds] [string1[string2]]
Description
The command copies the standard input to the standard output with substitution or deletion of selected characters. Input characters found
in string1 are mapped into the corresponding characters of string2. When string2 is short it is padded to the length of string1 by dupli-
cating its last character. Any combination of the options -cds may be used: -c complements the set of characters in string1 with respect
to the universe of characters whose ASCII codes are 0 through 0377 octal; -d deletes all input characters in string1; -s squeezes all
strings of repeated output characters that are in string2 to single characters.
In either string the notation a-b means a range of characters from a to b in increasing ASCII order. The backslash character () followed
by 1, 2 or 3 octal digits stands for the character whose ASCII code is given by those digits. A followed by any other character stands
for that character.
The following example creates a list of all the words in `file1' one per line in `file2', where a word is taken to be a maximal string of
alphabetics. The second string is quoted to protect from the Shell. 012 is the ASCII code for newline.
tr -cs A-Za-z '