Extract a pattern from xml file Post: 302688715

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to extract text from xml file

I have some xml files that got created by exporting a website from RedDot. I would like to extract the cost, course number, description, and meeting information. <?xml version="1.0" encoding="UTF-16" standalone="yes" ?> - <PAG PAG0="3AE6FCFD86D34896A82FCA3B7B76FF90" PAG3="525312"...

2. Shell Programming and Scripting

extract a number within an xml file

Hi Everyone, I have an sh script that I am working on and I have run into a little snag that I am hoping someone here can assist me with. I am using wget to retrieve an xml file from thetvdb.com. This part works ok but what I need to be able to do is extract the series ID # from the xml and put...

3. UNIX for Dummies Questions & Answers

Extract Field Value from XML file

Hi, Within a UNIX shell script I need to extract a value from an XML field. The field will contain different values but will always be 6 digits in length. E.g.: <provider-id>999999</provider-id> I've tried various ways but no luck. Any ideas how I might get the provider id (in this case...

4. Shell Programming and Scripting

Extract XML content from a file

310439 2012-01-11 03:44:42,291 INFO PutServlet:? - Content of the Message is:="1.0" encoding="UTF-8"?><ESP_SSIA_ACC_FEED> 310440 <BATCH_ID>12345678519</BATCH_ID> 310441 <UID>3498748823</UID> 310442 <FEED_TYPE>FULL</FEED_TYPE> 310443 <MART_NAME>SSIA_DM_TRANSACTIONS</MART_NAME> 310444...

5. Shell Programming and Scripting

extract a pattern from a xml file

Hello All, I want to write a shell script for extracting a content from a xml file the xml file looks like this: <Variable name="moreAxleInfo"> <type> <Table> <type> <NamedType> <type> <TypeRef...

6. Shell Programming and Scripting

Extract XML tag value from file

Hello, Hope you are doing fine. I have an log file which looks like as follows: Some junk text1 Date: Thu Mar 15 13:38:46 CDT 2012 DATA SENT SUCCESSFULL: Some jun text 2 Date: Thu Mar 15 13:38:46 CDT 2012 DATA SENT SUCCESSFULL: ...

7. Shell Programming and Scripting

Extract data from XML file

Hi , I have input file as XML. following are input data #complex.xml <?xml version="1.0" encoding="UTF-8"?> <TEST_doc xmlns="http://www.w3.org/2001/XMLSchema-instance"> <ENTRY uid="123456"> <protein> <name>PROT001</name> <organism>Human</organism> ...

8. Shell Programming and Scripting

Get extract text from xml file

Hi Collegue, i have a file say a.xml. it has contents <bpelFault><faultType>1</faultType><genericSystemFault xmlns=""><part name="payload"><v2:Fault...

9. Shell Programming and Scripting

Extract a particular xml only from an xml jar file

Hi..need help on how to extract a particular xml file only from an xml jar file... thanks!

10. Shell Programming and Scripting

Extract a value from an xml file

I have this XML file format and all in one line: Fri Dec 23 00:14:52 2016 Logged Message:689|<?xml version="1.0" encoding="UTF-8"?><PORT_RESPONSE><HEADER><ORIGINATOR>XMG</ORIGINATOR><DESTINAT...

LEARN ABOUT DEBIAN

mojo::dom

Mojo::DOM(3pm)						User Contributed Perl Documentation					    Mojo::DOM(3pm)

NAME

       Mojo::DOM - Minimalistic HTML5/XML DOM parser with CSS3 selectors

SYNOPSIS

	 use Mojo::DOM;

	 # Parse
	 my $dom = Mojo::DOM->new('<div><p id="a">A</p><p id="b">B</p></div>');

	 # Find
	 my $b = $dom->at('#b');
	 say $b->text;

	 # Walk
	 say $dom->div->p->[0]->text;
	 say $dom->div->children('p')->first->{id};

	 # Iterate
	 $dom->find('p[id]')->each(sub { say shift->{id} });

	 # Loop
	 for my $e ($dom->find('p[id]')->each) {
	   say $e->text;
	 }

	 # Modify
	 $dom->div->p->[1]->append('<p id="c">C</p>');

	 # Render
	 say $dom;

DESCRIPTION

       Mojo::DOM is a minimalistic and relaxed HTML5/XML DOM parser with CSS3 selector support. It will even try to interpret broken XML, so you
       should not use it for validation.

CASE SENSITIVITY

       Mojo::DOM defaults to HTML5 semantics, that means all tags and attributes are lowercased and selectors need to be lowercase as well.

	 my $dom = Mojo::DOM->new('<P ID="greeting">Hi!</P>');
	 say $dom->at('p')->text;
	 say $dom->p->{id};

       If XML processing instructions are found, the parser will automatically switch into XML mode and everything becomes case sensitive.

	 my $dom = Mojo::DOM->new('<?xml version="1.0"?><P ID="greeting">Hi!</P>');
	 say $dom->at('P')->text;
	 say $dom->P->{ID};

       XML detection can also be disabled with the "xml" method.

	 # Force XML semantics
	 $dom->xml(1);

	 # Force HTML5 semantics
	 $dom->xml(0);

METHODS

       Mojo::DOM implements the following methods.

   "new"
	 my $dom = Mojo::DOM->new;
	 my $dom = Mojo::DOM->new('<foo bar="baz">test</foo>');

       Construct a new Mojo::DOM object.

   "all_text"
	 my $trimmed   = $dom->all_text;
	 my $untrimmed = $dom->all_text(0);

       Extract all text content from DOM structure, smart whitespace trimming is enabled by default.

	 # "foo bar baz"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->all_text;

	 # "foo
barbaz
"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->all_text(0);

   "append"
	 $dom = $dom->append('<p>Hi!</p>');

       Append to element.

	 # "<div><h1>A</h1><h2>B</h2></div>"
	 $dom->parse('<div><h1>A</h1></div>')->at('h1')->append('<h2>B</h2>');

   "append_content"
	 $dom = $dom->append_content('<p>Hi!</p>');

       Append to element content.

	 # "<div><h1>AB</h1></div>"
	 $dom->parse('<div><h1>A</h1></div>')->at('h1')->append_content('B');

   "at"
	 my $result = $dom->at('html title');

       Find a single element with CSS3 selectors. All selectors from Mojo::DOM::CSS are supported.

	 # Find first element with "svg" namespace definition
	 my $namespace = $dom->at('[xmlns:svg]')->{'xmlns:svg'};

   "attrs"
	 my $attrs = $dom->attrs;
	 my $foo   = $dom->attrs('foo');
	 $dom	   = $dom->attrs({foo => 'bar'});
	 $dom	   = $dom->attrs(foo => 'bar');

       Element attributes.

   "charset"
	 my $charset = $dom->charset;
	 $dom	     = $dom->charset('UTF-8');

       Alias for "charset" in Mojo::DOM::HTML.

   "children"
	 my $collection = $dom->children;
	 my $collection = $dom->children('div');

       Return a Mojo::Collection object containing the children of this element, similar to "find".

	 # Show type of random child element
	 say $dom->children->shuffle->first->type;

   "content_xml"
	 my $xml = $dom->content_xml;

       Render content of this element to XML.

	 # "<b>test</b>"
	 $dom->parse('<div><b>test</b></div>')->div->content_xml;

   "find"
	 my $collection = $dom->find('html title');

       Find elements with CSS3 selectors and return a Mojo::Collection object. All selectors from Mojo::DOM::CSS are supported.

	 # Find a specific element and extract information
	 my $id = $dom->find('div')->[23]{id};

	 # Extract information from multiple elements
	 my @headers = $dom->find('h1, h2, h3')->map(sub { shift->text })->each;

   "namespace"
	 my $namespace = $dom->namespace;

       Find element namespace.

	  # Find namespace for an element with namespace prefix
	  my $namespace = $dom->at('svg > svg:circle')->namespace;

	  # Find namespace for an element that may or may not have a namespace prefix
	  my $namespace = $dom->at('svg > circle')->namespace;

   "parent"
	 my $parent = $dom->parent;

       Parent of element.

   "parse"
	 $dom = $dom->parse('<foo bar="baz">test</foo>');

       Alias for "parse" in Mojo::DOM::HTML.

	 # Parse UTF-8 encoded XML
	 my $dom = Mojo::DOM->new->charset('UTF-8')->xml(1)->parse($xml);

   "prepend"
	 $dom = $dom->prepend('<p>Hi!</p>');

       Prepend to element.

	 # "<div><h1>A</h1><h2>B</h2></div>"
	 $dom->parse('<div><h2>B</h2></div>')->at('h2')->prepend('<h1>A</h1>');

   "prepend_content"
	 $dom = $dom->prepend_content('<p>Hi!</p>');

       Prepend to element content.

	 # "<div><h2>AB</h2></div>"
	 $dom->parse('<div><h2>B</h2></div>')->at('h2')->prepend_content('A');

   "replace"
	 $dom = $dom->replace('<div>test</div>');

       Replace elements.

	 # "<div><h2>B</h2></div>"
	 $dom->parse('<div><h1>A</h1></div>')->at('h1')->replace('<h2>B</h2>');

   "replace_content"
	 $dom = $dom->replace_content('test');

       Replace element content.

	 # "<div><h1>B</h1></div>"
	 $dom->parse('<div><h1>A</h1></div>')->at('h1')->replace_content('B');

   "root"
	 my $root = $dom->root;

       Find root node.

   "text"
	 my $trimmed   = $dom->text;
	 my $untrimmed = $dom->text(0);

       Extract text content from element only (not including child elements), smart whitespace trimming is enabled by default.

	 # "foo baz"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->text;

	 # "foo
baz
"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->text(0);

   "text_after"
	 my $trimmed   = $dom->text_after;
	 my $untrimmed = $dom->text_after(0);

       Extract text content immediately following element, smart whitespace trimming is enabled by default.

	 # "baz"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->p->text_after;

	 # "baz
"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->p->text_after(0);

   "text_before"
	 my $trimmed   = $dom->text_before;
	 my $untrimmed = $dom->text_before(0);

       Extract text content immediately preceding element, smart whitespace trimming is enabled by default.

	 # "foo"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->p->text_before;

	 # "foo
"
	 $dom->parse("<div>foo
<p>bar</p>baz
</div>")->div->p->text_before(0);

   "to_xml"
	 my $xml = $dom->to_xml;

       Render this element and its content to XML.

	 # "<div><b>test</b></div>"
	 $dom->parse('<div><b>test</b></div>')->div->to_xml;

   "tree"
	 my $tree = $dom->tree;
	 $dom	  = $dom->tree(['root', [qw(text lalala)]]);

       Alias for "tree" in Mojo::DOM::HTML.

   "type"
	 my $type = $dom->type;
	 $dom	  = $dom->type('div');

       Element type.

	 # List types of child elements
	 $dom->children->each(sub { say $_->type });

   "xml"
	 my $xml = $dom->xml;
	 $dom	 = $dom->xml(1);

       Alias for "xml" in Mojo::DOM::HTML.

CHILD ELEMENTS

       In addition to the methods above, many child elements are also automatically available as object methods, which return a Mojo::DOM or
       Mojo::Collection object, depending on number of children.

	 say $dom->p->text;
	 say $dom->div->[23]->text;
	 $dom->div->each(sub { say $_->text });

ELEMENT ATTRIBUTES

       Direct hash reference access to element attributes is also possible.

	 say $dom->{foo};
	 say $dom->div->{id};

SEE ALSO

       Mojolicious, Mojolicious::Guides, <http://mojolicio.us>.

perl v5.14.2							    2012-09-05							    Mojo::DOM(3pm)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

How to extract text from xml file

Discussion started by: chrisf

2. Shell Programming and Scripting

extract a number within an xml file

Discussion started by: tret

3. UNIX for Dummies Questions & Answers

Extract Field Value from XML file

Discussion started by: pnclayt11

4. Shell Programming and Scripting

Extract XML content from a file

Discussion started by: arukuku