Problems with grep and XML Post: 302092923

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

problems with grep on solaris 5.8

Hi all, I have a problem when i grep for a particular field among all fhe files in the directory. if i do an ls -l field * i can find it. however at the moment the number of files in the directory are close to 28000 and it returns an ksh: /usr/bin/grep: arg list too long Assuming i...

2. Shell Programming and Scripting

Grep xml tags

Hi I want to get the value between to XML tags as follows <EAN>12345</EAN> so i would want to return 12345. i have tried sed and awk but can't do it. can anyone help?

3. Shell Programming and Scripting

Grep XML tags

I want to search the below XML pattern in the XML files, but the XML files would be in a .GZ files, <PRODID>LCTO84876</PRODID> <PARTNUMBER>8872AC1</PARTNUMBER> <WWPRODID>MODEL84876</WWPRODID> <COUNTRY>US</COUNTRY> <LANGUAGE>1</LANGUAGE> What's the command/script to search it ? :confused:

4. Shell Programming and Scripting

Problems in Usage of grep

Hi all, I have a file resp_cde.ats which has values as:- APPDIR=C:\Program Files\Cogny\cert PUBSDIR=C:\Program Files\Cognoy\cert\documentation TOURDIR=C:\Program Files\Cognoy\cert\tour DATADIR=C:\Program Files\Cognoy\cert\data Now I use the grep command in a shell script:- x=`grep...

5. Shell Programming and Scripting

Grep/Parse a .xml file

I have a .xml file similar to the following: <Column> <Name>FIELD1</Name> <Title>CO.</Title> </Column> <Column> <Name>FIELD2</Name> <EditField>TextBox</EditField> <ColumnSpan0>4</ColumnSpan0> <Title>NORMAL</Title> ...

6. UNIX for Dummies Questions & Answers

GREP for a tag in XML File

I have 2 XML Data files with a tag named PARTICIPATION_TYPE and i am trying to grep for that and getting unique values. However one of the xml data file data is not aligned properly like below. File 1: (works fine when i do grep) grep "PARTICIPATION_TYPE" file1.xml | sort -u Data: .......

7. UNIX for Dummies Questions & Answers

Grep content in xml file

I have an xml file with header as below. <Provider xmlns="http://www.xyzx.gov/xyz" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.xyzx.gov/xyz xyz.xsd" SCHEMA_VERSION="2.5" PROVIDER="5"> I want to get the schema version here that is 2.5 and put in a...

8. Shell Programming and Scripting

Grep some values from XML file

Dear community, I have a big XML log file containing several rows splitted by tag: <ActivityLogRecord> and </ActivityLogRecord>. An example below. What I need is read the file and extract some value from each tags and put them into one line (each line for every <ActivityLogRecord> tag). So...

9. Shell Programming and Scripting

How to grep for a word in xml?

Hi, I have the below tag/s in my xml. <foreign-server name="MOHTASHIM_SERVER"> What will be the easist way to extract MOHTASHIM_SERVER without the double quotes "" from the above tag? Desired Output:

10. UNIX for Beginners Questions & Answers

How to fetch the value from a xml using sed, GREP?

I have a simple xml file,need the output with the <value> tag and <result> tag text.xml <test-method status="FAIL" duration="45"> <value> Id=C18 </value> <result> wrong paramter </result> </test-method> <test-method status="FAIL" duration="45"> <value> Id=C19 </value> <result> Data...

LEARN ABOUT DEBIAN

encode::imaputf7

Encode::IMAPUTF7(3pm)					User Contributed Perl Documentation				     Encode::IMAPUTF7(3pm)

NAME

       Encode::IMAPUTF7 - modification of UTF-7 encoding for IMAP

SYNOPSIS

	 use Encode qw/encode decode/;
	 use Encode::IMAPUTF7;

	 print encode('IMAP-UTF-7', 'RA~Xpertoire');
	 print decode('IMAP-UTF-7', R&AOk-pertoire');

ABSTRACT

       IMAP mailbox names are encoded in a modified UTF7 when names contains international characters outside of the printable ASCII range. The
       modified UTF-7 encoding is defined in RFC2060 (section 5.1.3).

       There is another CPAN module with same purpose, Unicode::IMAPUtf7. However, it works correctly only with strings, which encoded form does
       not contain plus sign. For example, the Cyrillic string x{043f}x{0440}x{0435}x{0434}x{043b}x{043e}x{0433} is represented in UTF-7 as
       +BD8EQAQ1BDQEOwQ+BDM- Note the second plus sign 4 characters before the end.  Unicode::IMAPUtf7 encodes the above string as
       +BD8EQAQ1BDQEOwQ&BDM- which is not valid modified UTF-7 (the ampersand and the plus are swapped). The problem is solved by the current
       module, which is slightly modified Encode::Unicode::UTF7 and has nothing common with Unicode::IMAPUtf7.

RFC2060 - section 5.1.3 - Mailbox International Naming Convention
       By convention, international mailbox names are specified using a modified version of the UTF-7 encoding described in [UTF-7].  The purpose
       of these modifications is to correct the following problems with UTF-7:

       1) UTF-7 uses the "+" character for shifting; this conflicts with
	  the common use of "+" in mailbox names, in particular USENET
	  newsgroup names.

       2) UTF-7's encoding is BASE64 which uses the "/" character; this
	  conflicts with the use of "/" as a popular hierarchy delimiter.

       3) UTF-7 prohibits the unencoded usage of ""; this conflicts with
	  the use of "" as a popular hierarchy delimiter.

       4) UTF-7 prohibits the unencoded usage of "~"; this conflicts with
	  the use of "~" in some servers as a home directory indicator.

       5) UTF-7 permits multiple alternate forms to represent the same
	  string; in particular, printable US-ASCII chararacters can be
	  represented in encoded form.

       In modified UTF-7, printable US-ASCII characters except for "&" represent themselves; that is, characters with octet values 0x20-0x25 and
       0x27-0x7e.  The character "&" (0x26) is represented by the two- octet sequence "&-".

       All other characters (octet values 0x00-0x1f, 0x7f-0xff, and all Unicode 16-bit octets) are represented in modified BASE64, with a further
       modification from [UTF-7] that "," is used instead of "/".  Modified BASE64 MUST NOT be used to represent any printing US-ASCII character
       which can represent itself.

       "&" is used to shift to modified BASE64 and "-" to shift back to US- ASCII.  All names start in US-ASCII, and MUST end in US-ASCII (that
       is, a name that ends with a Unicode 16-bit octet MUST end with a "- ").

       For example, here is a mailbox name which mixes English, Japanese, and Chinese text: ~peter/mail/&ZeVnLIqe-/&U,BTFw-

REQUESTS &; BUGS
       Please report any requests, suggestions or bugs via the RT bug-tracking system at http://rt.cpan.org/ or email to
       bug-Encode-IMAPUTF7@rt.cpan.org.

       http://rt.cpan.org/NoAuth/Bugs.html?Dist=Encode-IMAPUTF7 is the RT queue for Encode::IMAPUTF7.  Please check to see if your bug has already
       been reported.

COPYRIGHT

       Copyright 2005 Sava Chankov

       Sava Chankov, sava@cpan.org

       This software may be freely copied and distributed under the same terms and conditions as Perl.

AUTHORS

       Peter Makholm <peter@makholm.net>, current maintainer

       Sava Chankov <sava@cpan.org>, original author

SEE ALSO

       perl(1), Encode.

perl v5.12.4							    2011-09-25						     Encode::IMAPUTF7(3pm)