09-07-2015
Help reformatting input file
Hi,
I have an input file that looks like this (columns are tab delimited:
HTML Code:
Data000005-RA GO:0003735 GO:0005840 GO:0006412
Data000005-RA GO:0003735
Data000009-RA GO:0003735 GO:0005622 GO:0005840 GO:0006412
Data000011-RA GO:0015232 GO:0015886 GO:0016020 GO:0017004
Data000012-RA GO:0005524
Data000012-RA GO:0015991 GO:0016820 GO:0005524
Data000040-RA GO:0004129 GO:0005506 GO:0009055 GO:0009060 GO:0016021 GO:0020037 GO:0055114
This need to be converted into a two column file and expected output is:
HTML Code:
Data000005-RA GO:0003735
Data000005-RA GO:0005840
Data000005-RA GO:0006412
Data000005-RA GO:0003735
Data000009-RA GO:0003735
Data000009-RA GO:0005622
Data000009-RA GO:0005840
Data000009-RA GO:0006412
Data000011-RA GO:0015232
Data000011-RA GO:0015886
Data000011-RA GO:0016020
Data000011-RA GO:0017004
Data000012-RA GO:0005524
Data000012-RA GO:0015991
Data000012-RA GO:0016820
Data000012-RA GO:0005524
Data000040-RA GO:0004129
Data000040-RA GO:0005506
Data000040-RA GO:0009055
Data000040-RA GO:0009060
Data000040-RA GO:0016021
Data000040-RA GO:0020037
Data000040-RA GO:0055114
Thanks for your help.
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hi,
How can I reformat a file (text file) using unix command.
This file was FTP'd from Mainframe and contains some garbage character at the end of each line.
Each line contains special characters '<soh>' at the end which should have been spaces when I view it in emacs or nedit. I couldnt do find... (2 Replies)
Discussion started by: mrjunsy
2 Replies
2. Shell Programming and Scripting
All,
I am trying to figure out a script to run in windows that will allow me to match on First column in file1 to 8th Column in File2 then
Insert file1 column2 to file2 column4 then create a new file.
File1:
12345 Sam
12346 Bob
12347 Bill
File2:... (1 Reply)
Discussion started by: darkoth
1 Replies
3. Shell Programming and Scripting
I hopefully have a simple request - I need to process multiple files reformatting the output based on tags at the beginning of each line. So the data for the new 3 lines of the output file are in the HDR line and then the details are in the DTL tagged lines.
for ifile in $indir
do
echo... (1 Reply)
Discussion started by: jason_v_brown
1 Replies
4. Shell Programming and Scripting
I have some xml files that cannot be read using a standard parser, or I am using the wrong parser. The issues seems to be spaces in some of the tags.
Here is a sample,<UgUn 2 >
<Un>
-0.426753
</Un>
</UgUn>The parser isn't able to find the number 2, so that information is lost, etc. It seems... (16 Replies)
Discussion started by: LMHmedchem
16 Replies
5. Shell Programming and Scripting
Hi all,
I have an input file like
1,date,company,,
1,date,comapny,,
2,000,,,567,ACT,00,,,,KKG,M1,D45,,67J,+4500000000
2,000,,,567,ACT,00,,,,KKG,M6,D49,,56J,+6000
2,000,,,567,ACT,00,,7,,KKG,M3,D58,,68h,-70000
2,000,,,567,ACT,00,,,,KKG,M9,D95,,34m,0.00
3,total
what i require is
1.I... (2 Replies)
Discussion started by: selvankj
2 Replies
6. Shell Programming and Scripting
Dear experts,
my problem is pretty tricky.
I want to change a file (see attached input.txt), according to another file (help.txt). The output that is desired is in output.txt. The example is attached.
Note that
-dashes should not be treated specially, they are considered normal characters,... (2 Replies)
Discussion started by: TheTransporter
2 Replies
7. Shell Programming and Scripting
Hi, I have a single colum file and I need to reformat the file so that it creates a new line every time it come to an IP address and the following lines are corresponding rows until it comes to the next IP address.
I want to turn this
172.xx.xx.xx
gwpusprdrp02_pv
seinwnprd03... (7 Replies)
Discussion started by: kieranfoley
7 Replies
8. Shell Programming and Scripting
Dear ALL,
I would really appreciate if you could help me in reformatting a file in this way:
The file refers to a list of genetic coordinates, each lines has a score value and the associated chromosome is listed in the line starting with chrom .
If more coordinates are found, the start... (2 Replies)
Discussion started by: paolo.kunder
2 Replies
9. UNIX for Dummies Questions & Answers
Hi
i want to print line which is mentioned as below
615213:1;20150725;20250722;0|11;20150831;20150831;100|14;20150725;20160723;2
in below format.
'
615213: 1;20150725;20250722;0
615213: 11;20150831;20150831;100
615213: 14;20150725;20160723;2
please help me and suggest me how to... (9 Replies)
Discussion started by: scriptor
9 Replies
10. Shell Programming and Scripting
Hi,
i've got the following output file:
170724_1600 | SYSTEM | 449 | 282 | 167 | 62
170724_1600 | CCS_SCP_DATA | 200 | 88 | 112 | 44
170724_1600 | CCS_SCP_SUBS_I | 2001 | 1751 | 250 | 87
170724_1600 | UIS_CDR_INDEX | 2001 | 1 | 2000 | 0
170724_1600 | LCP_INDEX | 200 | 5 | 195 | 2... (4 Replies)
Discussion started by: nms
4 Replies
LEARN ABOUT OSX
html::filter5.16
HTML::Filter(3) User Contributed Perl Documentation HTML::Filter(3)
NAME
HTML::Filter - Filter HTML text through the parser
NOTE
This module is deprecated. The "HTML::Parser" now provides the functionally of "HTML::Filter" much more efficiently with the the "default"
handler.
SYNOPSIS
require HTML::Filter;
$p = HTML::Filter->new->parse_file("index.html");
DESCRIPTION
"HTML::Filter" is an HTML parser that by default prints the original text of each HTML element (a slow version of cat(1) basically). The
callback methods may be overridden to modify the filtering for some HTML elements and you can override output() method which is called to
print the HTML text.
"HTML::Filter" is a subclass of "HTML::Parser". This means that the document should be given to the parser by calling the $p->parse() or
$p->parse_file() methods.
EXAMPLES
The first example is a filter that will remove all comments from an HTML file. This is achieved by simply overriding the comment method to
do nothing.
package CommentStripper;
require HTML::Filter;
@ISA=qw(HTML::Filter);
sub comment { } # ignore comments
The second example shows a filter that will remove any <TABLE>s found in the HTML file. We specialize the start() and end() methods to
count table tags and then make output not happen when inside a table.
package TableStripper;
require HTML::Filter;
@ISA=qw(HTML::Filter);
sub start
{
my $self = shift;
$self->{table_seen}++ if $_[0] eq "table";
$self->SUPER::start(@_);
}
sub end
{
my $self = shift;
$self->SUPER::end(@_);
$self->{table_seen}-- if $_[0] eq "table";
}
sub output
{
my $self = shift;
unless ($self->{table_seen}) {
$self->SUPER::output(@_);
}
}
If you want to collect the parsed text internally you might want to do something like this:
package FilterIntoString;
require HTML::Filter;
@ISA=qw(HTML::Filter);
sub output { push(@{$_[0]->{fhtml}}, $_[1]) }
sub filtered_html { join("", @{$_[0]->{fhtml}}) }
SEE ALSO
HTML::Parser
COPYRIGHT
Copyright 1997-1999 Gisle Aas.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
perl v5.16.2 2008-04-04 HTML::Filter(3)