Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Remove untagged and junk data from an XML Post 302913367 by Don Cragun on Friday 15th of August 2014 04:09:59 PM
Old 08-15-2014
Quote:
Originally Posted by IshuGupta
thanks Don...
Code:
07-04-2014 00:15:04

is still a valid data as it is between the valid tags of time stamp which start before it and ends after it.

But the data like "abcd" and "pqrs $$ faf»" is not enclosed in tags , which doesnt allow it to form a valid xml.
You know that because you know what tags are valid and what data is valid between certain tags. You can build that knowledge into a script, but, for the generic script, there is a lot of meta-data that needs to be provided to that script defining the valid tags, the valid nesting of tags, the formats allowed for data between certain tags, ... . But, you aren't going to get a parser like that from a forum like this.

If you can state some clear requirements to simplify the general problem to a more specific issue, we might be able to help.

Do you just want to throw away stuff (other than a single <newline> character after a </spd> flag) that does not start with a <?xml version="1.0" encoding="IBM037"?> tag and end with the next </spd> tag?

Most XML code I've seen would have a </xml> tag for each of the <?xml...> tags. Why aren't there any in your XML code?
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to remove junk (^Ò) character while FTPing

Hi All, I have been trying to FTP some data files from Windows directory to a UNIX server. The txt file in the windows contails the following data: "111~XYZ~1~Contact person’s phone number~COMMENTS~~~~" but the same line is appearing as "111~XYZ~1~Contact person^Òs phone number~COMMENTS~~~~"... (8 Replies)
Discussion started by: vkumbhakarna
8 Replies

2. Shell Programming and Scripting

is is possible remove junk chars from the strings?

Hello friendz, dfl;g435hkd.fg ..this is what I am getting. I want to print strings without junk chars. I want to exactly like this... dflg435hkd.fg ...need some specific operators also. for example, dot or comma should allow. plz help me out ya! ~Balan:confused: (1 Reply)
Discussion started by: balan_mca
1 Replies

3. Shell Programming and Scripting

Remove all JUNK character from file.

Hi Team, I have a file having size greater than 1 GB. What i want to do is to check if it contains any JUNK character (ie any special charater thats not on the key board stroke). This file has 532 column & seperated with ^~^. I have found some solution from the file, but it is for a... (4 Replies)
Discussion started by: Amit.Sagpariya
4 Replies

4. Shell Programming and Scripting

Remove junk characters using Perl

Guys, can you help me in removing the junk character "^S" from the below line using perl Reference Data Not Recognised ^S Where a value is provided by the consuming system, which is not reco Thanks, M.Mohan (1 Reply)
Discussion started by: mohan_xunil
1 Replies

5. UNIX for Dummies Questions & Answers

XML file shows Junk Characters in UNIX

Hello sir, I have generated XML file from VS 2005. It works well in windows but it shows some junk characters in unix. Can any help me with this problem. Thank you in advance. Hema (6 Replies)
Discussion started by: hemavenkatesh
6 Replies

6. UNIX for Dummies Questions & Answers

How to remove JUNK characters (FROM�)

Hi I have to remove the junk characters from my file. Please help.. File content : CURITY_CODE_GSD) FROM� DL_CB_SOD_EOD_VALUATION WHERE� ASOF (1 Reply)
Discussion started by: arukuku
1 Replies

7. UNIX Desktop Questions & Answers

Help me to remove junk char

I wanted to remove junk char in my csv. :mad: Input file format: "17","9986782190","0","D","2" "17","9900918331","0","D","2" "13","9986782194","0","A","2" Output file format 9986782190 9900918331 9986782194 And one more thing all the time "13"," this will be different Ex: . (2 Replies)
Discussion started by: Siddartha
2 Replies

8. Shell Programming and Scripting

Remove all junk characters from a text file

I am using flatfile, in that flat file we are getting the junk chars 1)I21001f<82>^Me<85>!h49 Service Charge 2) I21001f‚ e...!h49 Service Charge please tell me how to remove all junk chars in unix scripts. (1 Reply)
Discussion started by: Talari
1 Replies

9. UNIX for Dummies Questions & Answers

How to replace and remove few junk characters from a specific field?

I would like to remove all characters starting with "%" and ending with ")" in the 4th field - please help!! 1412007819.864 /device/services/heartbeatxx 204 0.547%!i(int=0) 0.434 0.112 1412007819.866 /device/services/heartbeatxx 204 0.547%!i(int=1) 0.423 0.123... (10 Replies)
Discussion started by: snemuk14
10 Replies

10. UNIX for Beginners Questions & Answers

Need to remove Junk characters

Hi All, I have a issue that we are getting Junk characters from source and i am not able to load that records to Database. Line breakers Junk Characters (Â and different every time) Japanese Characters Every time I am using grep command and awk -F "\007" to find them and delete that... (1 Reply)
Discussion started by: spradeep86
1 Replies
CTAGS(1)						      General Commands Manual							  CTAGS(1)

NAME
ctags - Generates "tags" and (optionally) "refs" files SYNOPSIS
ctags [-stvra] filesnames... DESCRIPTION
ctags generates the "tags" and "refs" files from a group of C source files. The "tags" file is used by Elvis' ":tag" command, control-] command, and -t option. The "refs" file is sometimes used by the ref(1) program. Each C source file is scanned for #define statements and global function definitions. The name of the macro or function becomes the name of a tag. For each tag, a line is added to the "tags" file which contains: - the name of the tag - a tab character - the name of the file containing the tag - a tab character - a way to find the particular line within the file. The filenames list will typically be the names of all C source files in the current directory, like this: $ ctags -stv *.[ch] OPTIONS
-t Include typedefs. A tag will be generated for each user-defined type. Also tags will be generated for struct and enum names. Types are considered to be global if they are defined in a header file, and static if they are defined in a C source file. -v Include variable declarations. A tag will be generated for each variable, except for those that are declared inside the body of a function. -s Include static tags. Ctags will normally put global tags in the "tags" file, and silently ignore the static tags. This flag causes both global and static tags to be added. The name of a static tag is generated by prefixing the name of the declared item with the name of the file where it is defined, with a colon in between. For example, "static foo(){}" in "bar.c" results in a tag named "bar.c:foo". -r This causes ctags to generate both "tags" and "refs". Without -r, it would only generate "tags". -a Append to "tags", and maybe "refs". Normally, ctags overwrites these files each time it is invoked. This flag is useful when you have to many files in the current directory for you to list them on a single command-line; it allows you to split the arguments among several invocations. FILES
tags A cross-reference that lists each tag name, the name of the source file that contains it, and a way to locate a particular line in the source file. refs The "refs" file contains the definitions for each tag in the "tags" file, and very little else. This file can be useful, for exam- ple, when licensing restrictions prevent you from making the source code to the standard C library readable by everybody, but you still everybody to know what arguments the library functions need. BUGS
ctags is sensitive to indenting and line breaks. Consequently, it might not discover all of the tags in a file that is formatted in an unusual way. SEE ALSO
elvis(1), refs(1) AUTHOR
Steve Kirkendall kirkenda@cs.pdx.edu CTAGS(1)
All times are GMT -4. The time now is 09:02 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy