Need to extract 7 characters immediately after text '19' from a large file. Post: 302278763

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Performance issue in UNIX while generating .dat file from large text file

Hello Gurus, We are facing some performance issue in UNIX. If someone had faced such kind of issue in past please provide your suggestions on this . Problem Definition: /Few of load processes of our Finance Application are facing issue in UNIX when they uses a shell script having below...

2. Shell Programming and Scripting

Extract data from large file 80+ million records

Hello, I have got one file with more than 120+ million records(35 GB in size). I have to extract some relevant data from file based on some parameter and generate other output file. What will be the besat and fastest way to extract the ne file. sample file format :--...

3. Shell Programming and Scripting

extract unique pattern from large text file

Hi All, I am trying to extract data from a large text file , I want to extract lines which contains a five digit number followed by a hyphen , like 12345- , i tried with egrep ,eg : egrep "+" text.txt but which returns all the lines which contains any number of digits followed by hyhen ,...

4. Shell Programming and Scripting

extract characters from file name - script

trying to extract the numbers in this file name: fname="ebcdic.f0633.cmp_ebcdic.f0633.bin" fnametmp=${fname#*(V|v|F|f)} parse=${fnametmp%%(ENC|enc|CMP|cmp|BIN|bin)}} echo FLRECL=$parse result is FLRECL=0633.cmp_ebcdic.f0633 expected result FLRECL=0633 my guru is on holiday and i need...

5. Shell Programming and Scripting

extract last 8 characters from a file name

Dear gurus I have several files with the following format filenameCCYYMMDD , that is the last 8 characters will be the date in CCYYMMDD format. eg FILENAME20110523 . Could anyone please put me through on how to extract only the last 8 characters from the files. I am thinking of using awk,sed...

6. Shell Programming and Scripting

splitting a large text file into paragraphs

Hello all, newbie here. I've searched the forum and found many "how to split a text file" topics but none that are what I'm looking for. I have a large text file (~15 MB) in size. It contains a variable number of "paragraphs" (for lack of a better word) that are each of variable length. A...

7. Shell Programming and Scripting

Curl download zip extract large xml file

Hi i have a php script that works 100% however i don't want this to run on php because of server limits etc. Ideally if i could convert this simple php script to a shell script i can set it up to run on a cron. My mac server has curl on it. So i am assuming i should be using this to download the...

8. UNIX for Dummies Questions & Answers

Extract spread columns from large file

Dear all, I want to extract around 300 columns from a very large file with almost 2million columns. There are no headers, but I can find out which column numbers I want. I know I can extract with the function 'cut -f2' for example just the second column but how do I do this for such a large...

9. Shell Programming and Scripting

Need to extract 8 characters from a large file.

Hi All!! I have a large file containing millions of records. My purpose is to extract 8 characters immediately from the given file. 222222222|ZRF|2008.pdf|2008|01/29/2009|001|B|C|C 222222222|ZRF|2009.pdf|2009|01/29/2010|001|B|C|C 222222222|ZRF|2010.pdf|2010|01/29/2011|001|B|C|C...

10. UNIX for Beginners Questions & Answers

Command to extract empty field in a large UNIX file?

Hi All, I have records in unix file like below. In this file, we have empty fields from 4th Column to 22nd Column. I have some 200000 records in a file. I want to extract records only which have empty fields from 4th field to 22nd filed. This file is comma separated file. what is the unix...

LEARN ABOUT DEBIAN

sdfget

SDFGET(1)						User Contributed Perl Documentation						 SDFGET(1)

NAME

       sdfget - Documentation Extraction Utility

PURPOSE

       sdfget extracts documentation embedded in source code.

USAGE

	usage  : sdfget [-h[help]] [-o[out_ext]]
		[-l[log_ext]] [-O[out_dir]]
		[-f formatting_filename] [-g[get_rule]]
		[-r[rpt_file]] [-s scope] [-i]
		[-v[verbose]] file ...
       purpose: extract documentation embedded in source code
       version: 2.000	 (SDF 2.001)

       The options are:

	Option	     Description
	-h	     display help on options
	-o	     output file extension
	-l	     log file extension
	-O	     output to input file's (or explicit) directory
	-f	     filename to use when formatting the output
	-g	     rule to use to get documentation
	-r	     report file
	-s	     scope of documentation to be extracted
	-i	     only output lines not extracted
	-v	     verbose mode

DESCRIPTION

       The -h option provides help. If it is specified without a parameter, a brief description of each option is displayed. To display the
       attributes for an option, specify the option letter as a parameter.

       By default, generated output goes to standard output. To direct output to a file per input file, use the -o option to specify an extension
       for output files. If the -o option is specified without a parameter, an extension of out is assumed.

       Likewise, error messages go to standard error by default. Use the -l option to create a log file per input file. If the -l option is
       specified without a parameter, an extension of log is assumed.

       By default, generated output and log files are created in the current directory. Use the -O option to specify an explicit output directory.
       If the -O option is specified without a parameter, the input file's directory is used.

       The -f option can be used to specify a filename to use when formatting the output. This is useful when the text is coming from the standard
       input stream.

       The get-rule nominates the formatting of the embedded documentation to be extracted. All currently defined get-rules assume the
       documentation is in comment blocks in one of the following formats:

	>>section_title1::
	text of section 1, line 1
	text of section 1, line ..

	>>section_title2::
	text of section 2, line 1
	text of section 2, line ..
	>>END::

	>>section_title3:: text of section 3

       The first form is most commonly used. In this format, the text in a section extends until the end of the current "comment block" or the
       start of the next section, whichever comes first. The second form (i.e. explicitly specifying where the section ends) is useful if you wish
       to add some normal comments (i.e. non-documentation) which you do not want extracted. If the text is short, the third form can be used.
       Regardless of the format, if a section is found which is already defined, the text of the section is concatenated onto the existing text.
       This permits the documentation for each entity to be specified immediately above where it is defined in the source code.

       The -g option specifies the get-rule to use. The available get-rules differ on the prefix expected at the front of each line as shown
       below.

	Rule				  Prefix
	perl				  #
	cpp				  //
	c				  * or /*
	fortran 			  c (with 5 preceding spaces)
	eiffel				  --
	bat				  rem

       Within C code, a trailing space is required after the characters above. For other languages, a trailing space is optional. Within FORTRAN
       code, the "c" character must be preceded by exactly 5 spaces.  For other languages, zero or more whitespace characters are permitted before
       the characters above.

       For example, embedded documentation within C code looks like:

	/* >>Purpose::
	 * This library provides a high level interface
	 * to commonly used network services.
	 */

       If the -g option is not specified, perl is the default get-rule. If the -g option is specified without a parameter, the extension in
       lowercase of the filename (or the formatting filename if the text is coming from standard input) is used to guess the get_rule as shown
       below.

	Rule				  Extensions
	cpp				  cpp, c++, cc, hpp, hpp, h, java, idl
	c				  c
	fortran 			  fortran, for, f77, f
	eiffel				  eiffel, ada
	bat				  bat, cmd

       A report filename can be specified using the -r option. If the name doesn't include an extension, sdg is assumed. Reports provide a
       mechanism for:

       o   selectively extracting sections, and

       o   rudimentary reformatting (e.g. to SDF)

       If no report is specified, all sections are output in the following format:

	section_title1
	section_text1

	section_title2
	section_text2

       If -r is specified on its own, default.sdg is assumed. This report selects the set of sections (within the SDF documentation standards)
       which form the user documentation and formats them into SDF. Details on the report format are specified below. Reports are searched for in
       the current directory, then in the stdlib directory within SDF's library directory.

       The -s option can be used to specify the scope of the documentation to be extracted. (This is an experimental feature and may change so
       most users should avoid using it.)

       The -i option outputs only those lines which the get-rule did not match. This option is useful for extracting non-documentation from a file
       to give just the code.

       Note: The -r option is ignored if -i is specified.

       The -v option enables verbose mode. This is useful for seeing which rule is being used for each file.

EXAMPLES

       To extract the user documentation from a SDF application written in C++ (xyz, say) and save it into xyz.sdf:

	     sdfget -gcpp -r -osdf xyz.cpp

LIMITATIONS AND FUTURE DIRECTIONS

       It would be nicer if the get-rule was always guessed from the filename extension but changing the default from perl could break existing
       scripts. Therefore, get-rule guessing must be explicitly enabled by specifging the -g option without a parameter.

perl v5.12.4							    2011-11-09								 SDFGET(1)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Performance issue in UNIX while generating .dat file from large text file

Discussion started by: KRAMA

2. Shell Programming and Scripting

Extract data from large file 80+ million records

Discussion started by: learner16s

3. Shell Programming and Scripting

extract unique pattern from large text file

Discussion started by: shijujoe

4. Shell Programming and Scripting

extract characters from file name - script

Discussion started by: mambo2523