Hello,
I have got one file with more than 120+ million records(35 GB in size). I have to extract some relevant data from file based on some parameter and generate other output file.
What will be the besat and fastest way to extract the ne file.
sample file format :--... (2 Replies)
I have a file containing date/time sorted data of the form
...
2009/06/10,20:59:59.950,XAG/USD,Q,1,1115, 14.3025,100,1,1
2009/06/10,20:59:59.950,XAG/USD,Q,1,1116, 14.3026,125,1,1
2009/06/10,20:59:59.950,XAG/USD,R,0,0, , 0,0,0
2009/06/10,20:59:59.950,XAG/USD,R,1,0, 14.1910,100,1,1... (6 Replies)
Hi All,
I am trying to extract data from a large text file , I want to extract lines which contains a five digit number followed by a hyphen , like
12345- , i tried with egrep ,eg : egrep "+" text.txt
but which returns all the lines which contains any number of digits followed by hyhen ,... (19 Replies)
I have a script with this statement:
/usr/xpg4/bin/awk -F"" 'NR==FNR{s=$2;next}{printf "%s\"%s\"\n", $0, s}' LOOKUP.TXT finallistnew.txt >test.txt
I want to include logic or an additional step that says if there is no data in field 3, move the whole line out of test.txt into an additional... (9 Replies)
I have the test data with 10 column separated by comma and each column has more than 1000000 rows. Can anyone help me to find empty field in all columns and delete that empty field alone and lift that column up by one row.
Data with empty field:
A74203XYZ,A21718XYZ,A72011XYZ,A41095XYZ,... (7 Replies)
So I want to put a line at the end of my script which greps for keywords from syslog.log that outputs the following after it is done:
"This file was last modified on (thisdate)"
I know I can use the following to get the date:
rtidsvb(izivanov):/home/izivanov> ll /var/adm/syslog/syslog.log ... (4 Replies)
I am trying to update an older program on a small cluster. It uses individual files to send jobs to each node. However the newer database comes as one large file, containing over 10,000 records. I therefore need to split this file. It looks like this:
HMMER3/b
NAME 1-cysPrx_C
ACC ... (2 Replies)
Hi all,
I want to remove empty field in a text file. I tried to used sed. But it failed.
Input:
LG10_PM_map_19_LEnd 1000560 G AG AG
LG10_PM_map_19_LEnd 1005621 G AG
LG10_PM_map_19_LEnd 1011214 A AG AG
LG10_PM_map_19_LEnd 1011673 T CT CT ... (3 Replies)
Dear all,
I want to extract around 300 columns from a very large file with almost 2million columns. There are no headers, but I can find out which column numbers I want. I know I can extract with the function 'cut -f2' for example just the second column but how do I do this for such a large... (1 Reply)
Hi All!!
I have a large file containing millions of records. My purpose is to extract 8 characters immediately from the given file.
222222222|ZRF|2008.pdf|2008|01/29/2009|001|B|C|C
222222222|ZRF|2009.pdf|2009|01/29/2010|001|B|C|C
222222222|ZRF|2010.pdf|2010|01/29/2011|001|B|C|C... (5 Replies)
Discussion started by: pavand
5 Replies
LEARN ABOUT SUSE
tracker-extract
tracker-extract(1) User Commands tracker-extract(1)NAME
tracker-extract - Extract metadata from a file.
SYNOPSYS
tracker-extract [OPTION...] FILE...
DESCRIPTION
tracker-extract reads the file and mimetype provided in stdin and extract the metadata from this file; then it displays the metadata on the
standard output.
NOTE: If a FILE is not provided then tracker-extract will run for 30 seconds waiting for DBus calls before quitting.
OPTIONS
-?, --help
Show summary of options.
-v, --verbosity=N
Set verbosity to N. This overrides the config value. Values include 0=errors, 1=minimal, 2=detailed and 3=debug.
-f, --file=FILE
The FILE to extract metadata from. The FILE argument can be either a local path or a URI. It also does not have to be an absolute
path.
-m, --mime=MIME
The MIME type to use for the file. If one is not provided, it will be guessed automatically.
-d, --disable-shutdown
Disable shutting down after 30 seconds of inactivity.
-i, --force-internal-extractors
Use this option to force internal extractors over 3rd parties like libstreamanalyzer.
-m, --force-module=MODULE
Force a particular module to be used. This is here as a convenience for developers wanting to test their MODULE file. Only the MOD-
ULE name has to be specified, not the full path. Typically, a MODULE is installed to /usr/lib/tracker-0.7/extract-modules/. This
option can be used with or without the .so part of the name too, for example, you can use --force-module=foo
Modules are shared objects which are dynamically loaded at run time. These files must have the .so suffix to be loaded and must con-
tain the correct symbols to be authenticated by tracker-extract. For more information see the libtracker-extract reference documen-
tation.
-V, --version
Show binary version.
EXAMPLES
Using command line to extract metadata from a file:
$ tracker-extract -v 3 -f /path/to/some/file.mp3
Using a specific module to extract metadata from a file:
$ tracker-extract -v 3 -f /path/to/some/file.mp3 -m mymodule
ENVIRONMENT
TRACKER_EXTRACTORS_DIR
This is the directory which tracker uses to load the shared libraries from (used for extracting metadata for specific file types).
These are needed on each invocation of tracker-store. If unset it will default to the correct place. This is used mainly for testing
purposes.
FILES
$HOME/.config/tracker/tracker-extract.cfg
SEE ALSO tracker-store(1), tracker-sparql(1), tracker-stats(1), tracker-info(1).
tracker-extract.cfg(5).
/usr/lib/tracker-0.7/extract-modules/
GNU July 2007 tracker-extract(1)