I have two files: file 1
file 2:
file 2 are the search urls that is used for the partial match in file1 at column2. If it has the partial match it has to return the column 1 url with the partial match url in column 2 of file 1 like this:
Desired output:
I am using this script which can give me exact match url pattern at column 2 but cannot work with the partial match which is as follows:
What would be the easiest way to grep all files within a particular directory that match a partial filename? For example, searching all files that begin with "filename.txt" and are appended with the date they were created. I am using Ksh 88, btw. (3 Replies)
I am trying to find a way to test some code, but I need to rewrite a specific URL only from a specific HTTP_HOST
The call goes out to
http://SUB.DOMAIN.COM/showAssignment/7bde10b45efdd7a97629ef2fe01f7303/jsmodule/Nevow.Athena
The ID in the middle is always random due to the cookie.
I... (5 Replies)
egrep -iow '(http*+|www)*' url.txt
is this command logically incorrect to match a url pattern inside a file and display only the urls in the terminal???
Please rectify the error in my syntax , (2 Replies)
Hello, this is probably a simple request but I've been toying with it for a while.
I have a large list of devices and commands that were run with a script, now I have lines such as:
a-router-hostname-C#show ver
I want to print everything up to (and excluding) the # and everything after it... (3 Replies)
Here is what I have so far:
find . -name "*php*" -or -name "*htm*" | xargs grep -i iframe | awk -F'"' '/<iframe*/{gsub(/.\*iframe>/,"\"");print $2}'
Here is an example content of a PHP or HTM(HTML) file:
<iframe src="http://ADDRESS_1/?click=5BBB08\" width=1 height=1... (18 Replies)
Hello,
Am very new to perl , please help me here !!
I need help in reading a URL from command line using PERL:: Mechanize and needs all the contents from the URL to get into a file.
below is the script which i have written so far ,
#!/usr/bin/perl
use LWP::UserAgent;
use... (2 Replies)
In the awk below I am trying to cp and paste each matching line in f2 to $3 in f1 if $2 of f1 is in the line in f2 somewhere. There will always be a match (usually more then 1) and my actual data is much larger (several hundreds of lines) in both f1 and f2. When the line in f2 is pasted to $3 in... (4 Replies)
I have a two file as shown below,
file:1
>Contig_152_415 (REVERSE SENSE)
>Contig_152_420 (REVERSE SENSE)
>Contig_152_472 (REVERSE SENSE)
>Contig_152_484 (REVERSE SENSE)
File:2
>Contig_152:49081-49929
ATCGAGCAGCGCCGCGTGCGGTGCACCCTTGTGCAGATCGGGAGTAACCACGCGCACGGC... (2 Replies)
Discussion started by: dineshkumarsrk
2 Replies
LEARN ABOUT OSX
html::formattext
HTML::FormatText(3) User Contributed Perl Documentation HTML::FormatText(3)NAME
HTML::FormatText - Format HTML as plaintext
VERSION
version 2.10
SYNOPSIS
use HTML::TreeBuilder;
$tree = HTML::TreeBuilder->new->parse_file("test.html");
use HTML::FormatText;
$formatter = HTML::FormatText->new(leftmargin => 0, rightmargin => 50);
print $formatter->format($tree);
or, more simply:
use HTML::FormatText;
my $string = HTML::FormatText->format_file(
'test.html',
leftmargin => 0, rightmargin => 50
);
DESCRIPTION
HTML::FormatText is a formatter that outputs plain latin1 text. All character attributes (bold/italic/underline) are ignored. Formatting
of HTML tables and forms is not implemented.
HTML::FormatText is built on HTML::Formatter and documentation for that module applies to this - especially "new" in HTML::Formatter,
"format_file" in HTML::Formatter and "format_string" in HTML::Formatter.
You might specify the following parameters when constructing the formatter:
leftmargin (alias lm)
The column of the left margin. The default is 3.
rightmargin (alias rm)
The column of the right margin. The default is 72.
SEE ALSO
HTML::Formatter
INSTALLATION
See perlmodinstall for information and options on installing Perl modules.
BUGS AND LIMITATIONS
No bugs have been reported.
Please report any bugs or feature requests through the web interface at http://rt.cpan.org/Public/Dist/Display.html?Name=HTML-Format
<http://rt.cpan.org/Public/Dist/Display.html?Name=HTML-Format>.
AVAILABILITY
The project homepage is http://search.cpan.org/dist/HTML-Format <http://search.cpan.org/dist/HTML-Format>.
The latest version of this module is available from the Comprehensive Perl Archive Network (CPAN). Visit <http://www.perl.com/CPAN/> to
find a CPAN site near you, or see http://search.cpan.org/dist/HTML-Format/ <http://search.cpan.org/dist/HTML-Format/>.
The development version lives at http://github.com/nigelm/html-format <http://github.com/nigelm/html-format> and may be cloned from
git://github.com/nigelm/html-format.git <git://github.com/nigelm/html-format.git>. Instead of sending patches, please fork this project
using the standard git and github infrastructure.
AUTHORS
o Nigel Metheringham <nigelm@cpan.org>
o Sean M Burke <sburke@cpan.org>
o Gisle Aas <gisle@ActiveState.com>
COPYRIGHT AND LICENSE
This software is copyright (c) 2011 by Nigel Metheringham, 2002-2005 Sean M Burke, 1999-2002 Gisle Aas.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
perl v5.16.2 2013-08-25 HTML::FormatText(3)