Extract common data out of multiple files Post: 302748023

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How to rename multiple files with a common suffix

Hi, There are multiple files like file1_11 file2_11 file3_11.....and so on. How to rename them such tht the suffix _11 is removed and they become file1, file2, file3. Any help is appreciated. Regards er_ashu

2. Shell Programming and Scripting

Get common lines from multiple files

FileA chr1 31237964 NP_001018494.1 PUM1 M340L chr1 31237964 NP_055491.1 PUM1 M340L chr1 33251518 NP_037543.1 AK2 H191D chr1 33251518 NP_001616.1 AK2 H191D chr1 57027345 NP_001004303.2 C1orf168 P270S FileB chr1 ...

3. UNIX for Dummies Questions & Answers

AWK, extract data from multiple files

Hi, I'm using AWK to try to extract data from multiple files (*.txt). The script should look for a flag that occurs at a specific position in each file and it should return the data to the right of that flag. I should end up with one line for each file, each containing 3 columns:...

4. UNIX for Dummies Questions & Answers

Using AWK: Extract data from multiple files and output to multiple new files

Hi, I'd like to process multiple files. For example: file1.txt file2.txt file3.txt Each file contains several lines of data. I want to extract a piece of data and output it to a new file. file1.txt ----> newfile1.txt file2.txt ----> newfile2.txt file3.txt ----> newfile3.txt Here is...

5. Shell Programming and Scripting

Extract common words from two/more csv files

I have two (or more, to make it generic) csv files. Each line contains words separated by comma. None of words have any space. The number of words per line is not fixed. Some may have one, and some may have 12... The number of lines per file is also not fixed. What I need is to find common words...

6. Shell Programming and Scripting

Find common lines between multiple files

Hello everyone A few years Ago the user radoulov posted a fancy solution for a problem, which was about finding common lines (gene variation names) between multiple samples (files). The code was: awk 'END { for (R in rec) { n = split(rec, t, "/") if (n > 1) dup = dup ?...

7. Shell Programming and Scripting

Compare multiple files, and extract items that are common to ALL files only

I have this code awk 'NR==FNR{a=$1;next} a' file1 file2 which does what I need it to do, but for only two files. I want to make it so that I can have multiple files (for example 30) and the code will return only the items that are in every single one of those files and ignore the ones...

8. Shell Programming and Scripting

Extract data in tabular format from multiple files

Hi, I have directory with multiple files from which i need to extract portion of specif lines and insert it in a new file, the new file will contain a separate columns for each file data. Example: I need to extract Value_1 & Value_3 from all files and insert in output file as below: ...

9. Shell Programming and Scripting

Get both common and missing values from multiple files

Hi, I have 5 files with two columns. I need to merge all the 5 files based on column 1. If any of them are missing then corresponding 2nd column should be substituted by missing value. I know hoe to do this for 2 files. but how can I implement for 5 files. I tried this based on 5 files but it...

10. Shell Programming and Scripting

Merge multiple files with common header

Hi all, Say i have multiple files x1 x2 x3 x4, all with common header (date, time, year, age),, How can I merge them to one singe file "X" in shell scripting Thanks for your suggestions.

LEARN ABOUT CENTOS

data::dumpxml

DumpXML(3pm)						User Contributed Perl Documentation					      DumpXML(3pm)

NAME

       Data::DumpXML - Dump arbitrary data structures as XML

SYNOPSIS

	use Data::DumpXML qw(dump_xml);
	$xml = dump_xml(@list)

DESCRIPTION

       This module provides a single function called dump_xml() that takes a list of Perl values as its argument and produces a string as its
       result.	The string returned is an XML document that represents any Perl data structures passed to the function.  Reference loops are han-
       dled correctly.

       The following data model is used:

	  data : scalar*
	  scalar = undef | str | ref | alias
	  ref : scalar | array | hash | glob | code
	  array: scalar*
	  hash: (key scalar)*

       The distribution comes with an XML schema and a DTD that more formally describe this structure.

       As an example of the XML documents produced, the following call:

	 $a = bless [1,2], "Foo";
	 dump_xml($a);

       produces:

	 <?xml version="1.0" encoding="US-ASCII"?>
	 <data xmlns="http://www.cpan.org/.../Data-DumpXML.xsd">
	  <ref>
	   <array class="Foo">
	    <str>1</str>
	    <str>2</str>
	   </array>
	  </ref>
	 </data>

       If dump_xml() is called in a void context, then the dump is printed on STDERR automatically.  For compatibility with "Data::Dump", there is
       also an alias for dump_xml() called simply dump().

       "Data::DumpXML::Parser" is a class that can restore data structures dumped by dump_xml().

       Configuration variables

       The generated XML is influenced by a set of configuration variables.  If you modify them, then it is a good idea to localize the effect.
       For example:

	 sub my_dump_xml {
	     local $Data::DumpXML::INDENT = "";
	     local $Data::DumpXML::XML_DECL = 0;
	     local $Data::DumpXML::DTD_LOCATION = "";
	     local $Data::DumpXML::NS_PREFIX = "dumpxml";

	     return dump_xml(@_);
	 }

       The variables are:

       $Data::DumpXML::INDENT
	   You can set the variable $Data::DumpXML::INDENT to control the amount of indenting.	The variable contains the whitespace you want to
	   be used for each level of indenting.  The default is a single space.  To suppress indenting, set it to "".

       $Data::DumpXML::INDENT_STYLE
	   This variable controls where end element are placed.  If you set this variable to the value "Lisp" then end tags are not prefixed by
	   NL.	This give a more compact output.

       $Data::DumpXML::XML_DECL
	   This boolean variable controls whether an XML declaration should be prefixed to the output.	The XML declaration is the <?xml ...?>
	   thingy.  The default is 1.  Set this value to 0 to suppress the declaration.

       $Data::DumpXML::NAMESPACE
	   This variable contains the namespace used for the XML elements.  The default is to let this be a URI that actually resolve to the XML
	   schema on CPAN.  Set it to "" to disable use of namespaces.

       $Data::DumpXML::NS_PREFIX
	   This variable contains the namespace prefix to use on the elements.	The default is "", which means that a default namespace will be
	   declared.

       $Data::DumpXML::SCHEMA_LOCATION
	   This variable contains the location of the XML schema.  If this variable is non-empty, then an "xsi:schemaLocation" attribute is added
	   to the top level "data" element.  The default is not to include this, as the location can be inferred from the default XML namespace
	   used.

       $Data::DumpXML::DTD_LOCATION
	   This variable contains the location of the DTD.  If this variable is non-empty, then a <!DOCTYPE ...> is included in the output.  The
	   default is to point to the DTD on CPAN.  Set it to "" to suppress the <!DOCTYPE ...> line.

BUGS

       Class names with 8-bit characters are dumped as Latin-1, but converted to UTF-8 when restored by the Data::DumpXML::Parser.

       The content of globs and subroutines are not dumped.  They are restored as the strings "** glob **" and "** code **".

       LVALUE and IO objects are not dumped at all.  They simply disappear from the restored data structure.

SEE ALSO

       Data::DumpXML::Parser, XML::Parser, XML::Dumper, Data::Dump

AUTHORS

       The "Data::DumpXML" module is written by Gisle Aas <gisle@aas.no>, based on "Data::Dump".

       The "Data::Dump" module was written by Gisle Aas, based on "Data::Dumper" by Gurusamy Sarathy <gsar@umich.edu>.

	Copyright 1998-2003 Gisle Aas.
	Copyright 1996-1998 Gurusamy Sarathy.

       This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

perl v5.8.8							    2006-04-08							      DumpXML(3pm)