In a script, I am trying to extract unique text from a set of filenames.
I have certain files like below in a directory:
I need to extract just "OPEN_INV_" and ""CLOSE_INV_"
I use the following command:
This works fine. The last 8 characters before the file extension are always MMDDYYYY.
But when I have a file name with additional time stamp like:
OPEN_INV_01012011_1345.xls (here 01012011 is MMDDYYYY and 1345 is HHMM)
I am not able to get the file name part.
Please help me modify the "val" variable.
Thanks
Shankar
Last edited by Scott; 08-09-2011 at 01:05 PM..
Reason: Added code tags
I'm loading multiple delimited files into an Oracle DB using sqlldr on Unix. I would like to get only the most recent record per each unique key. There may be multiple updates for each key, but I only want the most recent one.
There is a date column in my delimited files, so I'm using cat to... (2 Replies)
I have filenames
filenameA_fg_MMDDYY.tar.gz
filenameASPQ_fg_MMDDYY.tar.gz
filenameAFTOPHYYINGH_fg_MMDDYY.tar.gz
filenameAGHYSW_fg_MMDDYY.tar.gz
Is there a way I can extract the date out of these filenames?
Thanks in advance (2 Replies)
Hi All,
I am trying to extract data from a large text file , I want to extract lines which contains a five digit number followed by a hyphen , like
12345- , i tried with egrep ,eg : egrep "+" text.txt
but which returns all the lines which contains any number of digits followed by hyhen ,... (19 Replies)
Hello all,
I have a file with following sample data
2009-08-26 05:32:01.65 spid5 Process ID 86:214 owns resources that are blocking processes on Scheduler 0.
2009-08-26 05:32:01.65 spid5 Process ID 86:214 owns resources that are blocking processes on Scheduler 0.
2009-08-26... (5 Replies)
I need some help. I have a list of files (thousands) and would like to extract some variables from the file name and save that to a file
The list of files look like:
I am trying to write the following script but I am stuck at how I can get thevariables 'doy' and 'yr' from each file and then... (5 Replies)
I have 84 files with the following names splitseqs.1, spliseqs.2 etc.
and I want to change the .number to a unique filename.
E.g.
change splitseqs.1 into splitseqs.7114_1#24
and
change spliseqs.2 into splitseqs.7067_2#4
So all the current file names are unique, so are the new file names.... (1 Reply)
Hi Gurus,
I have 100 tab-delimited text files each with 21 columns. I want to extract only 2nd and 5th column from each text file. However, the values in both 2bd and 5th column contain duplicate values but the combination of these values in a row are not duplicate. I want to extract only those... (3 Replies)
Hi,
I have a file with 20GB Pipe Delimited file where i have too many duplicate records.
I need an awk script to extract the unique records from the file and put it into another file.
Kindly help.
Thanks,
Arun (1 Reply)
In a incoming folder i have list of files like below,i want to pick the unique files to process the job. if same file contain more than one then it should pick latest date modified file to process.
drwxrwsrwx 2 n308799 infagrp 256 May 20 17:42 Final_Working
drwxrwsrwx 2... (1 Reply)
Hi there,
I need a help with extracting data from tab delimited file which look like this
#CHROM POS ID REF ALT Human Cow Dog Mouse Lizard
chr2 3033 . G C 0/0 0/0 0/0 1/1 0/0
chr3 35040 . G T 0/0 0/0 ./. 1/1 0/1
chr4 60584 . T G 1/1 1/1 0/1 1/1 0/0
chr10 7147815 . G A 0/0 1/1 0/0 0/0... (9 Replies)
Discussion started by: houkto
9 Replies
LEARN ABOUT DEBIAN
runxlrd
RUNXLRD(1) General Commands Manual RUNXLRD(1)NAME
runxlrd - extract data from Microsoft Excel spreadsheet files
SYNOPSIS
runxlrd [options] command [input-file-patterns]
DESCRIPTION
This manual page documents briefly the runxlrd command.
runxlrd offers a commandline interface to the Python xlrd module; it extracts data from Microsoft Excel spreadsheet files.
OPTIONS -h, --help
Show summary of options and exit.
-l logfilename
Contains error messages.
-v verbosity, --verbosity=verbosity
Level of information and diagnostics provided.
-p pickleable, --pickleable=pickleable
1: ensure Book object is pickleable (default); 0: don't bother.
-m mmap, --mmap=mmap
1: use mmap; 0: don't use mmap; -1: accept heuristic.
-e encoding, --encoding=encoding
Encoding override.
-f formatting, --formatting=formatting
0 (default): no fmt info 1: fmt info (all cells) 2: fmt info (margins trimmed).
-g gc, --gc=gc
0: auto gc enabled; 1: auto gc disabled, manual collect after each file; 2: no gc.
-s onesheet, --onesheet=onesheet
Restrict output to this sheet (name or index).
COMMANDS
Valid commands are
o dump
o count_records
o version
o hdr
o ov
o show
o 2rows
o 3rows
o bench
o names
o name_dump
o labels
o xfc
o hotshot
o profile
See the xlrd API documentation for the meaning of these commands.
EXAMPLES
To show the first, second and last rows of each sheet in each file, run
$ runxlrd 3rows *blah*.xls
Run
$ runxlrd -e koi8_r 3rows myfile.xls
to explicitly pass the needed codepage to xlrd, e.g. if the codepage record is missing, or if it exists but is wrong.
SEE ALSO
The xlrd API documentation, shipped in the file xlrd.html.
The xlrd website <http://www.lexicon.net/sjmachin/xlrd.htm>.
AUTHOR
runxlrd was written by John Machin.
This manual page was written by Joost van Baal <joostvb+debian@uvt.nl>, for the Debian project (but may be used by others).
xlrd 0.6.1a4 2007-05-24 RUNXLRD(1)