Sponsored Content
Top Forums Shell Programming and Scripting Shell script or command help to extract specific contents from a long list of content Post 302359965 by patrick87 on Wednesday 7th of October 2009 08:59:21 PM
Old 10-07-2009
Shell script or command help to extract specific contents from a long list of content

Hi,

I got a long list of contents:
>sequence_1
ASSSSSSSSSSSDDDDDDDDDDDCCCCCCC
ASDSFDFFDFDFFWERERERERFSDFESFSFD
>sequence_2
ASDFDFDFFDDFFDFDSFDSFDFSDFSDFDSFASDSADSADASD
ASDFFDFDFASFASFASFAFSFFSDASFASFASFAFS
>sequence_3
VEDFGSDGSDGSDGSDGSDGSDGSDG
dDFSDFSDFSDFSDFSDFSDFSDFSDF
SDGFDGSFDGSGSDGSDGSDGSDGSDG
>sequence_4
VEDFGSDGSDGSDGSDGSDGSDGSDG
dDFSDFSDFSDFSDFSDFSDFSDFSDF
>sequence_5
ASSSSSSSSSSSDDDDDDDDDDDCCCCCCC
>sequence_6
ASDFDFDFFDDFFDFDSFDSFDFSDFSDFDSFASDSADSADASD
.
.
.
.

My desired output extract the sequence_2,5,6 only:
>sequence_2
ASDFDFDFFDDFFDFDSFDSFDFSDFSDFDSFASDSADSADASD
ASDFFDFDFASFASFASFAFSFFSDASFASFASFAFS
>sequence_5
ASSSSSSSSSSSDDDDDDDDDDDCCCCCCC
>sequence_6
ASDFDFDFFDDFFDFDSFDSFDFSDFSDFDSFASDSADSADASD

Do anybody know how I can extract specific contents from a long list of content by using shell script or any command like sed,awk,etc?
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Urgent: selecting unique specific content of a file using shell script

Hi, I have a file whose content and format at places is as given below. print coloumn .... coloumn .... coloumn .... skip 1 line print coloumn ... skip 1 line I need to select the following : print coloumn .... coloumn .... coloumn... (2 Replies)
Discussion started by: jisha
2 Replies

2. HP-UX

extract field of characters after a specific pattern - using UNIX shell script

Hello, Below is my input file's content ( in HP-UX platform ): ABCD120672-B21 1 ABCD142257-002 1 ABCD142257-003 1 ABCD142257-006 1 From the above, I just want to get the field of 13 characters that comes after 'ABCD' i.e '120672-B21'... . Could... (2 Replies)
Discussion started by: jansat
2 Replies

3. Shell Programming and Scripting

Extract specific content from a file

My input file: >sequence_1 ASSSSSSSSSSSDDDDDDDDDDDCCCCCCC ASDSFDFFDFDFFWERERERERFSDFESFSFD >sequence_2 ASDFDFDFFDDFFDFDSFDSFDFSDFSDFDSFASDSADSADASD ASDFFDFDFASFASFASFAFSFFSDASFASFASFAFS >sequence_3 VEDFGSDGSDGSDGSDGSDGSDGSDG dDFSDFSDFSDFSDFSDFSDFSDFSDF SDGFDGSFDGSGSDGSDGSDGSDGSDG My... (22 Replies)
Discussion started by: patrick87
22 Replies

4. Shell Programming and Scripting

Extract all the content after a specific data

My input: >seq_1 DSASSTRRARRRRTPRTPSLRSRRSDVTCS >seq_3 RMRLRRWRKSCSERS*RRSN >seq_8 RTTGLSERPRLPTTASRSISSRWTR >seq_10 NELPLEKGSLDSISIE >seq_9 PNQGDAREPQAHLPRRQGPRDRPLQAYA+ QVQHRRHDHSRTQH*LCRRRQREDCDRLHR >seq_4 DRGKGQAGCRRPQEGEALVRRCS>seq_6 FA*GLAAQDGEA*SGRG My output: Extract all... (22 Replies)
Discussion started by: patrick87
22 Replies

5. Shell Programming and Scripting

Extract specific data content from a long list of data

My input: Data name: ABC001 Data length: 1000 Detail info Data Direction Start_time End_time Length 1 forward 10 100 90 1 forward 15 200 185 2 reverse 50 500 450 Data name: XFG110 Data length: 100 Detail info Data Direction Start_time End_time Length 1 forward 50 100 50 ... (11 Replies)
Discussion started by: patrick87
11 Replies

6. Shell Programming and Scripting

Way to extract detail and its content above specific value problem asking

Input file: >position_10 sample:68711 coords:5453-8666 number:3 type:complete len:344 MSINQYSSDFHYHSLMWQQQQQQQQHQNDVVEEKEALFEKPLTPSDVGKLNRLVIPKQHA ERYFPLAAAAADAVEKGLLLCFEDEEGKPWRFRYSYWNSSQSYVLTKGWSRYVKEKHLDA NRTS* >position_4 sample:68711 coords:553-866 number:4 type:partial len:483... (7 Replies)
Discussion started by: patrick87
7 Replies

7. Shell Programming and Scripting

Extract all content that match exactly only specific word

Input: 21 templeta parent 35718 36554 . - . ID=parent_cluster_50.21.11; Name=Partial%20parent%20for%20training%20set; 21 templeta kids 35718 36554 . - . ID=_52; Parent=parent_cluster_5085.21.11; 21 templeta ... (7 Replies)
Discussion started by: patrick87
7 Replies

8. Shell Programming and Scripting

How can I write the specific content in the file through shell script

Hello, I need to do one thing that my script creates the file touch release.SPLASH_12_03_00_RC01.txt Now I want to update that file with some content e.g splashbuild::SPLASH_12_17_00_RC02.zip Thanks (1 Reply)
Discussion started by: anuragpgtgerman
1 Replies

9. UNIX for Dummies Questions & Answers

Extract specific contents from each line

Hi all, Happy new year! Here I have a problem with extract specific information from each line in unix: My file is the dbSNP flat file, take two SNPs for examples: REFSNP-DOCSUM-SET (FULL-DUMP) CREATED ON: 2012-06-08 10:50 rs782 | human | 9606 | snp | genotype=NO |... (6 Replies)
Discussion started by: luoruicd
6 Replies

10. Shell Programming and Scripting

Parse xml in shell script and extract records with specific condition

Hi I have xml file with multiple records and would like to extract records from xml with specific condition if specific tag is present extract entire row otherwise skip . <logentry revision="21510"> <author>mantest</author> <date>2015-02-27</date> <QC_ID>334566</QC_ID>... (12 Replies)
Discussion started by: madankumar.t@hp
12 Replies
sed(1)							      General Commands Manual							    sed(1)

Name
       sed - stream text editor

Syntax
       sed [-n] [-e script] [-f sfile] [file...]

Description
       The  command  copies  the  named  files	(standard input default) to the standard output, edited according to a script of commands.  The -f
       option causes the script to be taken from file sfile; these options accumulate.	If there is just one -e option and no -f's,  the  flag	-e
       may  be omitted.  The -n option suppresses the default output; inclusion in the script of a comment command of the form also suppresses the
       default output.	(See the description of the `#' command.)

       A script consists of editing commands of the following form:

	      [address [, address] ] function [arguments]

       Nominally, there is one command per line; but commands can be concatenated on a line by being separated with semicolons

       In normal operation cyclically copies a line of input into a pattern space (unless there is something left after a `D' command), applies in
       sequence all commands whose addresses select that pattern space, and at the end of the script copies the pattern space to the standard out-
       put (except under -n) and deletes the pattern space.

       An address is either a decimal number that counts input lines cumulatively across files, a `$' that addresses the last line of input, or  a
       context address, `/regular expression/', in the style of ed(1) modified thus:

	  o    In  a  context  address, the construction ?regular expression?, where ? is any character, is identical to regular expression. Note
	       that in the context address xabcxdefx, the second x stands for itself, so that the regular expression is abcxdef.

	  o    The escape sequence `
' matches a new line embedded in the pattern space.

	  o    A command line with no addresses selects every pattern space.

	  o    A command line with one address selects each pattern space that matches the address.

	  o    A command line with two addresses selects the inclusive range from the first pattern space that matches the first  address  through
	       the  next  pattern  space  that matches the second.  (If the second address is a number less than or equal to the line number first
	       selected, only one line is selected.)  Thereafter the process is repeated, looking again for the first address.

       Editing commands can be applied only to non-selected pattern spaces by use of the negation function `!' (below).

       In the following list of functions the maximum number of permissible addresses for each function is indicated in parentheses.

       An argument denoted text consists of one or more lines, all but the last of which end with `' to hide the new line.  Backslashes  in  text
       are  treated  like  backslashes in the replacement string of an `s' command, and may be used to protect initial blanks and tabs against the
       stripping that is done on every script line.

       An argument denoted rfile or wfile must terminate the command line and must be preceded by exactly one blank.  Each wfile is created before
       processing begins.  There can be at most 10 distinct wfile arguments.

       (1)a
       text
	       Append.	Place text on the output before reading the next input line.

       (2)b label
	       Branch to the `:' command bearing the label.  If label is empty, branch to the end of the script.

       (2)c
       text
	       Change.	 Delete  the  pattern space.  With 0 or 1 address or at the end of a 2-address range, place text on the output.  Start the
	       next cycle.

       (2)d    Delete the pattern space.  Start the next cycle.

       (2)D    Delete the initial segment of the pattern space through the first new line.  Start the next cycle.

       (2)g    Replace the contents of the pattern space by the contents of the hold space.

       (2)G    Append the contents of the hold space to the pattern space.

       (2)h    Replace the contents of the hold space by the contents of the pattern space.

       (2)H    Append the contents of the pattern space to the hold space.

       (1)i
       text
	       Insert.	Place text on the standard output.

       (2)n    Copy the pattern space to the standard output.  Replace the pattern space with the next line of input.

       (2)N    Append the next line of input to the pattern space with an embedded new line.  (The current line number changes.)

       (2)p    Print.  Copy the pattern space to the standard output.

       (2)P    Copy the initial segment of the pattern space through the first new line to the standard output.

       (1)q    Quit.  Branch to the end of the script.	Do not start a new cycle.

       (2)r rfile
	       Read the contents of rfile.  Place them on the output before reading the next input line.

       (2)s/regular expression/replacement/flags
	       Substitute the replacement string for instances of the regular expression in the pattern space.	Any character may be used  instead
	       of `/'.	For a more complete description see The flags is zero or more of

	       g       Global.	Substitute for all nonoverlapping instances of the regular expression rather than just the first one.

	       p       Print the pattern space if a replacement was made.

	       w wfile Write.  Append the pattern space to wfile if a replacement was made.

       (2)t label
	       Test.   Branch  to  the `:' command bearing the label if any substitutions have been made since the most recent reading of an input
	       line or execution of a `t'.  If label is empty, branch to the end of the script.

       (2)w wfile
	       Write.  Append the pattern space to wfile.

       (2)x    Exchange the contents of the pattern and hold spaces.

       (2)y/string1/string2/
	       Transform.  Replace all occurrences of characters in string1 with the corresponding character in string2.  The lengths  of  string1
	       and string2 must be equal.

       (2)! function
	       Don't.  Apply the function (or group, if function is `{') only to lines not selected by the address(es).

       (0): label
	       This command does nothing; it bears a label for `b' and `t' commands to branch to.

       (1)=    Place the current line number on the standard output as a line.

       (2){    Execute the following commands through a matching `}' only when the pattern space is selected.

       (0)     An empty command is ignored.

       (0)#    With one exception, any line whose first nonblank character is a number sign is a comment and is ignored.  The exception is that if
	       the first such line encountered contains only the number sign followed by the letter `n' the default output is suppressed as if the
	       -n option were in force.

Options
       -e 'command;command...'
	       Uses command;command...	as the editing script.	If no -f option is given, the -e keyword can be omitted.  For example, the follow-
	       ing two command are functionally identical:
	       % sed -e 's/DIGITAL/Digital/g' summary > summary.out
	       % sed 's/DIGITAL/Digital/g' summary > summary.out

       -f sfile
	       Uses specified file as input file of commands to be executed.  Can be used with -e option to apply both	explicit  commands  and  a
	       separate script file.

       -n      Suppresses  all	normal	output, writing only lines explicitly written by the `p' or `P' commands or by an `s' command with the `p'
	       flag.

See Also
       awk(1), ed(1), grep(1), lex(1)

																	    sed(1)
All times are GMT -4. The time now is 11:23 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy