04-03-2012
Unicode file validation
I don't want HTML_CONTENT,RICH_CONTENT,TEXT_CONTENT columns data in the file and reset of data we need to extract.
Find the attached file.
Need to extract date in between DI_UX_ROW_END tag.
Can help me using unix command using AWK.
Thanks,
9 More Discussions You Might Find Interesting
1. Programming
I have a stream of characters like "\u8BBE\u5907\u7BA1"
and i want to display it.
I tried following things already without any luck.
1) printf("%s",L("\u8BBE\u5907\u7BA1"));
2) printf("%lc",0x8BBE);
3) setlocale followed by fwide followed by wprintf
4) also changed the local manually... (3 Replies)
Discussion started by: jackdorso
3 Replies
2. UNIX for Dummies Questions & Answers
I'm using shell scripting in Applescript. When searching a file with the ANSEL character set (for GEDCOM files) using (grep '1 CHAR ANSEL' filepath) gives the expected result. When searching a UNICODE formatted file (utf-16), searching for text known to exist in the file using (grep '1 CHAR... (4 Replies)
Discussion started by: Whiterock
4 Replies
3. Shell Programming and Scripting
I have a very large file in Unix that I would like to search for all instances of the unicode character 0x17. I need to remove these characters because the character is causing my SAX Parser to throw an exception. Does anyone know how to find a unicode character in a file?
Thank you for your... (1 Reply)
Discussion started by: azelinsk
1 Replies
4. AIX
I have a unicode file which needs to be modifed in an AIX environment from within a shell (ksh). I am concerned that the modification may involve a change in the file's codepage. Is my concern correct ? If so, whats the way around ?
Thanks in advance. (0 Replies)
Discussion started by: shibajighosh
0 Replies
5. Shell Programming and Scripting
Hi there,
I have a file generated by a windows registry (it's unicode) and can't get to do some replacements on it. I want to join lines that end with backslash with the next one.
santiago@ks354286:~$ cat win.reg
ÿþWindows Registry Editor Version 5.00
... (10 Replies)
Discussion started by: chebarbudo
10 Replies
6. Shell Programming and Scripting
Hi,
I have a file in my desktop which is a unicode format. After this file is transferred to Unix using FTP, we are seeing some special character (like rectangle box type) at the first line. The same file is saved as UTF8 (using textpad tool, selecting encode to UTF-8 option) on my desktopand... (7 Replies)
Discussion started by: vfrg
7 Replies
7. Shell Programming and Scripting
Experts,
this has been dumped on me at the last minute.... i am having issue on few files where im getting files from source with BOM mark at the top of every file and i need to check for its existence and remove it.
<feff>
header
Coulmn1|column2......n
i know i can simply do sed on... (5 Replies)
Discussion started by: biztank
5 Replies
8. Shell Programming and Scripting
WE have a file coming from a server that has characters for 4-5 languages. If I download the file to my windows PC and open in Notepad ++, I can clearly see the text in different languages. Notepad++ is able to reder text that is in Portugese, French, Thai etc. My objective it to do the following:... (2 Replies)
Discussion started by: vskr72
2 Replies
9. Shell Programming and Scripting
Hi All,
I am trying to download a XML from a URL through wget and successful in that but the problem is that I have to check for some special characters inside that XML. But when I download through wget it transfers the content of the XML in plain text and I'm not able to search for those... (2 Replies)
Discussion started by: dips_ag
2 Replies
LEARN ABOUT SUSE
tracker-extract
tracker-extract(1) User Commands tracker-extract(1)
NAME
tracker-extract - Extract metadata from a file.
SYNOPSYS
tracker-extract [OPTION...] FILE...
DESCRIPTION
tracker-extract reads the file and mimetype provided in stdin and extract the metadata from this file; then it displays the metadata on the
standard output.
NOTE: If a FILE is not provided then tracker-extract will run for 30 seconds waiting for DBus calls before quitting.
OPTIONS
-?, --help
Show summary of options.
-v, --verbosity=N
Set verbosity to N. This overrides the config value. Values include 0=errors, 1=minimal, 2=detailed and 3=debug.
-f, --file=FILE
The FILE to extract metadata from. The FILE argument can be either a local path or a URI. It also does not have to be an absolute
path.
-m, --mime=MIME
The MIME type to use for the file. If one is not provided, it will be guessed automatically.
-d, --disable-shutdown
Disable shutting down after 30 seconds of inactivity.
-i, --force-internal-extractors
Use this option to force internal extractors over 3rd parties like libstreamanalyzer.
-m, --force-module=MODULE
Force a particular module to be used. This is here as a convenience for developers wanting to test their MODULE file. Only the MOD-
ULE name has to be specified, not the full path. Typically, a MODULE is installed to /usr/lib/tracker-0.7/extract-modules/. This
option can be used with or without the .so part of the name too, for example, you can use --force-module=foo
Modules are shared objects which are dynamically loaded at run time. These files must have the .so suffix to be loaded and must con-
tain the correct symbols to be authenticated by tracker-extract. For more information see the libtracker-extract reference documen-
tation.
-V, --version
Show binary version.
EXAMPLES
Using command line to extract metadata from a file:
$ tracker-extract -v 3 -f /path/to/some/file.mp3
Using a specific module to extract metadata from a file:
$ tracker-extract -v 3 -f /path/to/some/file.mp3 -m mymodule
ENVIRONMENT
TRACKER_EXTRACTORS_DIR
This is the directory which tracker uses to load the shared libraries from (used for extracting metadata for specific file types).
These are needed on each invocation of tracker-store. If unset it will default to the correct place. This is used mainly for testing
purposes.
FILES
$HOME/.config/tracker/tracker-extract.cfg
SEE ALSO
tracker-store(1), tracker-sparql(1), tracker-stats(1), tracker-info(1).
tracker-extract.cfg(5).
/usr/lib/tracker-0.7/extract-modules/
GNU
July 2007 tracker-extract(1)