Sponsored Content
Top Forums Shell Programming and Scripting How to extract data from a huge file? Post 302159595 by srsahu75 on Friday 18th of January 2008 03:53:20 AM
Old 01-18-2008
Yes, I need

Yes, I need to extract information between the main tags ( inclusive of the tags ).
starting from
<dublin_core schema="dc">
to
</dublin_core>

Save the extract as dublin_core.xml in the respective folders item_* which are created from the string (item_*) before <dublin_core schema="dc">

And save another file 'contents' in each folder with the content as license.txt(tab \t)bundle:LICENSE
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

search and grab data from a huge file

folks, In my working directory, there a multiple large files which only contain one line in the file. The line is too long to use "grep", so any help? For example, if I want to find if these files contain a string like "93849", what command I should use? Also, there is oder_id number... (1 Reply)
Discussion started by: ting123
1 Replies

2. Shell Programming and Scripting

How to extract a piece of information from a huge file

Hello All, I need some assistance to extract a piece of information from a huge file. The file is like this one : database information ccccccccccccccccc ccccccccccccccccc ccccccccccccccccc ccccccccccccccccc os information cccccccccccccccccc cccccccccccccccccc... (2 Replies)
Discussion started by: Marcor
2 Replies

3. Shell Programming and Scripting

insert a header in a huge data file without using an intermediate file

I have a file with data extracted, and need to insert a header with a constant string, say: H|PayerDataExtract if i use sed, i have to redirect the output to a seperate file like sed ' sed commands' ExtractDataFile.dat > ExtractDataFileWithHeader.dat the same is true for awk and... (10 Replies)
Discussion started by: deepaktanna
10 Replies

4. Shell Programming and Scripting

How to extract a subset from a huge dataset

Hi, All I have a huge file which has 450G. Its tab-delimited format is as below x1 A 50020 1 x1 B 50021 8 x1 C 50022 9 x1 A 50023 10 x2 D 50024 5 x2 C 50025 7 x2 F 50026 8 x2 N 50027 1 : : Now, I want to extract a subset from this file. In this subset, column 1 is x10, column 2 is... (3 Replies)
Discussion started by: cliffyiu
3 Replies

5. Shell Programming and Scripting

Three Difference File Huge Data Comparison Problem.

I got three different file: Part of File 1 ARTPHDFGAA . . Part of File 2 ARTGHHYESA . . Part of File 3 ARTPOLYWEA . . (4 Replies)
Discussion started by: patrick87
4 Replies

6. Shell Programming and Scripting

Help- counting delimiter in a huge file and split data into 2 files

I’m new to Linux script and not sure how to filter out bad records from huge flat files (over 1.3GB each). The delimiter is a semi colon “;” Here is the sample of 5 lines in the file: Name1;phone1;address1;city1;state1;zipcode1 Name2;phone2;address2;city2;state2;zipcode2;comment... (7 Replies)
Discussion started by: lv99
7 Replies

7. Shell Programming and Scripting

Extract header data from one file and combine it with data from another file

Hi, Great minds, I have some files, in fact header files, of CTD profiler, I tried a lot C programming, could not get output as I was expected, because my programming skills are very poor, finally, joined unix forum with the hope that, I may get what I want, from you people, Here I have attached... (17 Replies)
Discussion started by: nex_asp
17 Replies

8. Shell Programming and Scripting

Extract few content from a huge list of files

I have a huge list of files (about 300,000) which have a pattern like this. .I 1 .U 87049087 .S Am J Emerg .M Allied Health Personnel/*; Electric Countershock/*; .T Refibrillation managed by EMT-Ds: .P ARTICLE. .W Some patients converted from ventricular fibrillation to organized... (1 Reply)
Discussion started by: shoaibjameel123
1 Replies

9. UNIX for Advanced & Expert Users

Need Optimization shell/awk script to aggreagte (sum) for all the columns of Huge data file

Optimization shell/awk script to aggregate (sum) for all the columns of Huge data file File delimiter "|" Need to have Sum of all columns, with column number : aggregation (summation) for each column File not having the header Like below - Column 1 "Total Column 2 : "Total ... ...... (2 Replies)
Discussion started by: kartikirans
2 Replies

10. UNIX for Advanced & Expert Users

File comaprsons for the Huge data files ( around 60G) - Need optimized and teh best way to do this

I have 2 large file (.dat) around 70 g, 12 columns but the data not sorted in both the files.. need your inputs in giving the best optimized method/command to achieve this and redirect the not macthing lines to the thrid file ( diff.dat) File 1 - 15 columns File 2 - 15 columns Data is... (9 Replies)
Discussion started by: kartikirans
9 Replies
exif(1) 						 command line front-end to libexif						   exif(1)

NAME
exif - shows EXIF information in JPEG files SYNOPSIS
exif [ OPTION ] [ file... ] DESCRIPTION
exif is a small command-line utility to show and change EXIF information in JPEG files. Most digital cameras produce EXIF files, which are JPEG files with extra tags that contain information about the image. The exif command- line utility allows you to read EXIF information from and write EXIF information to those files. exif internally uses the libexif library. Each input file given on the command line is acted upon in turn, using all the options given. Execution will be aborted immediately if one file is not readable or does not contain EXIF tags. As EXIF tags are read, any unknown ones are discarded and known ones are automatically converted into the correct format, if they aren't already. Corrupted MakerNote tags are also removed, but no format changes are made. OPTIONS
-v, --version Display the exif version number. -i, --ids Show ID numbers instead of tag names. -t, --tag=TAG Select only this TAG. TAG is the tag title, the short tag name, or the tag number (hexadecimal numbers are prefixed with 0x), from the IFD specified with --ifd. The tag title is dependent on the current locale, whereas name and number are locale-independent. --ifd=IFD Select a tag or tags from this IFD. Valid IFDs are "0", "1", "EXIF", "GPS", and "Interoperability". Defaults to "0". -l, --list-tags List all known EXIF tags and IFDs. A JPEG image must be provided, and those tags which appear in the file are shown with an aster- isk in the corresponding position in the list. -|, --show-mnote Show the contents of the MakerNote tag. The contents of this tag are nonstandard (and often undocumented) and may therefore not be recognized, or if they are recognized they may not necessarily be interpreted correctly. --remove Remove the tag or (if no tag is specified) the entire IFD. -s, --show-description Show description of tag. The --tag option must also be given. -e, --extract-thumbnail Extract the thumbnail, writing the thumbnail image to the file specified with --output. -r, --remove-thumbnail Remove the thumbnail from the image, writing the new image to the file specified with --output. -n, --insert-thumbnail=FILE Insert FILE as thumbnail. No attempt is made to ensure that the contents of FILE are in a valid thumbnail format. --no-fixup Do not attempt to fix EXIF specification violations when reading tags. When used in conjunction with --create-exif, this option inhibits the creation of the mandatory tags. exif will otherwise remove illegal or unknown tags, add some mandatory tags using default values, and change the data type of tags to match that required by the specification. -o, --output=FILE Write output image to FILE. If this option is not given and an image file must be written, the name used is the same as the input file with the suffix ".modified.jpeg". --set-value=VALUE Set the data for the tag specified with --tag and --ifd to VALUE. Compound values consisting of multiple components are separated with spaces. -c, --create-exif Create EXIF data if it does not exist. Mandatory tags are created with default values unless the --no-fixup option is given. This option can be used instead of specifying an input file name in most cases, to operate on the default values of the mandatory set of EXIF tags. In this case, the --output option has no effect and no file is written. -m, --machine-readable Produce output in a machine-readable (tab-delimited) format. The --xml-output and --machine-readable options are mutually exclu- sive. -w, --width=N Set the maximum width of the output to N characters (default 80). This does not apply to some output formats (e.g. XML). -x, --xml-output Produce output in an XML format (when possible). The --xml-output and --machine-readable options are mutually exclusive. -d, --debug Show debugging messages. Also, when processing a file that contains corrupted data, this option causes exif to attempt to continue processing. Normally, corrupted data causes an abort. Help options -?, --help Show help message. --usage Display brief usage message. EXAMPLES
Display all recognized EXIF tags in an image and the tag contents, with bad tags fixed: exif image.jpg Display a table listing all known EXIF tags and whether each one exists in the given image: exif --list-tags --no-fixup image.jpg Display details on all XResolution tags found in the given image: exif --tag=XResolution --no-fixup image.jpg Extract the thumbnail into the file thumbnail.jpg: exif --extract-thumbnail --output=thumbnail.jpg image.jpg Display a list of the numeric values of only the EXIF tags in the thumbnail IFD (IFD 1) and the tag values: exif --ids --ifd=1 --no-fixup image.jpg Display the meaning of tag 0x9209 in the "EXIF" IFD according to the EXIF specification: exif --show-description --ifd=EXIF --tag=0x9209 Add an Orientation tag with value "bottom - left" to an existing image: exif --output=new.jpg --ifd=0 --tag=0x0112 --set-value=4 --no-fixup image.jpg Add a YCbCr Sub-Sampling tag with value 2,1 (a.k.a YCbCr 4:2:2) to an existing image and fix the existing tags, if necessary: exif --output=new.jpg --tag=YCbCrSubSampling --ifd=0 --set-value='2 1' image.jpg List table with all known EXIF tags, highlighting mandatory ones: exif -cl AUTHOR
exif was written by Lutz Mueller <lutz@users.sourceforge.net> and numerous contributors. This man page is Copyright (C) 2002-2010 Thomas Pircher, Dan Fandrich and others. SEE ALSO
http://www.sourceforge.net/projects/libexif exif 0.6.20 2010-12-14 exif(1)
All times are GMT -4. The time now is 09:13 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy