Perl script to merge cells in column1 which has same strings, for all sheets in a excel workbook


 
Thread Tools Search this Thread
Top Forums Programming Perl script to merge cells in column1 which has same strings, for all sheets in a excel workbook
# 1  
Old 08-18-2015
Perl script to merge cells in column1 which has same strings, for all sheets in a excel workbook

Perl script to merge cells

---------- Post updated at 12:59 AM ---------- Previous update was at 12:54 AM ----------

I am using below code to read files from a dir and print to excel.

Code:
open(my $in, '<', $file) or die "Could not open file: $!";
        my $rowCount = 0;
		my $colCount = 0;
        while(<$in>)
        {
			my @elements = split(',',$_);
			foreach my $el(@elements)
            {
				$wrksheet->write($rowCount,$colCount,$el);
				$colCount++;
            }
            $colCount = 0;
            $rowCount++;
        }


Last edited by Jack_Bruce; 08-24-2015 at 02:06 PM.. Reason: code tags
# 2  
Old 08-19-2015
Show the input you have and the output you want.
# 3  
Old 08-19-2015
We also would need to see the use statements as well as what you used to create the $wrksheet object.
# 4  
Old 08-20-2015
Below is the complete code which works great converting all csv files(sorted based on column1) in dest directory to a single xls with multiple tabs.

Code:
use warnings;
use Spreadsheet::WriteExcel;
my $dest = '/home/dest';
my $workbook = Spreadsheet::WriteExcel->new("test.csv");
chdir $dest or die "no such directory: $!";
if ( -d $dest ) {
    opendir my $dh, $dest or die "can't open directory: $!";
	my @files = sort { $a cmp $b } readdir($dh);
	while ( my $file = shift @files ) {
        chomp $file;
        next if $file eq '.' or $file eq '..';
        my $sheetname = `basename $file | cut -d. -f1`;
        my $wrksheet = $workbook->add_worksheet($sheetname);
        open(my $in, '<', $file) or die "Could not open file: $!";
        my $rowCount = 0;
		my $colCount = 0;
		$colCount = 0;
        while(<$in>)
        {
			my @elements = split(',',$_);
			foreach my $el(@elements)
            {
				$wrksheet->write($rowCount,$colCount,$el);
				$colCount++;
            }
            $colCount = 0;
            $rowCount++;
        }
    }
}

---------- Post updated at 07:24 PM ---------- Previous update was at 07:23 PM ----------

The xls file(with multiple rows on each sheet) which i am generating with above code will look like below:

Code:
abs-pq        tfr23 	 12345
abs-pq	tfr24	         12843
abs-pq       tfr24           12435
abs-pqrst	rts09           19923|23141
abs-pqrst	rts10	         23456
tbs-pqrst	tfr25	          21938|22143
tbs-pqrst	zzz0z	          2414|5213|4306

column1 duplicates needs to be removed and merged as one cell.
column2 duplicates needs to be removed based on merged column1 only.( first three lines of below )
column3 should be unchanged.

need output like below(ignore ___ just filled gaps to show difference) :
Code:
abs-pq  tfr23 12345
______tfr24 12843
___________12435
abs-pqrst  rts09 19923|23141
_________rts10	23456
tbs-pqrst	tfr25	21938|22143
_________zzz0z	2414|5213|4306

any help in extending my original code to meet the requirement is greatly appreciated.

Last edited by Corona688; 08-20-2015 at 12:13 PM..
# 5  
Old 08-20-2015
I cannot tell what the output you want is supposed to look like. Try posting your output again, this time with code tags, [code] stuff [/code]
# 6  
Old 08-21-2015
Code:
INPUT:
abs-pq	        tfr23	12345
abs-pq	        tfr24	12843
abs-pq	        tfr24   12435
abs-pqrst	tfr24   19923|23141
abs-pqrst	rts10	23456
tbs-pqrst	tfr25	21938|22143
tbs-pqrst	zzz0z	2414|5213|4306

OUTPUT REQUIRED:
abs-pq	        tfr23	  12345
	        tfr24     12843
			  12435
abs-pqrst	tfr24     19923|23141
        	rts10	  23456
tbs-pqrst	tfr25	  21938|22143
	        zzz0z	  2414|5213|4306

NOTE: cell B4 also have tfr24 but it should not be merged since column1 value is different.

Last edited by Jack_Bruce; 08-21-2015 at 03:18 AM..
# 7  
Old 08-21-2015
Something like:
Code:
#! /usr/bin/perl

use strict;
use warnings;
use Spreadsheet::WriteExcel;

my $file = shift @ARGV;
my $workbook = Spreadsheet::WriteExcel->new($file) or die "can't create worksheet: $!";

my $dest = shift @ARGV;
chdir $dest or die "no such directory: $!";
opendir my $dh, '.' or die "can't open directory: $!";
my @files = sort grep { m{^[^.]} } readdir($dh);
close $dh; 

foreach my $file (@files) {
    open my $in, '<', $file or die "Could not open file: $!";

    $file =~ s{\..*$}{};
    my $worksheet   = $workbook->add_worksheet($file);

    my @prevRow = ();
    my $row = 0;

    while(<$in>) {
        my @currRow  = split(',', $_);
        my $col = 0;

        while ($col < @currRow) {
            last if @prevRow < $col || $currRow[$col] ne $prevRow[$col];
            $worksheet->write($row, $col, '');
            $col++;
        }

        while ($col < @currRow) {
            $worksheet->write($row, $col, $currRow[$col]);
            $col++;
        }

        @prevRow = @currRow;
        $row++;
    }
}

Invoked with:
Code:
scriptname workbookfile sourcedir...

Some notes:
  • if ( -d $dest ) { was not needed, the previous chdir would have failed if $dest was not a directory.
  • chomp $file is not needed, a trailing newline is an acceptable (if not appreciated) character in a directory entry
  • Be reluctant to use system calls in a perlscript. In this case, my $sheetname = `basename $file | cut -d. -f1`; was replaced by $file =~ s{\..*$}{}; as $file is already a "basename" and you were just trimming off everything after the first ".".
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Help with Perl script for identifying dupes in column1

Dear all, I have a large dictionary database which has the following structure source word=target word e.g. book=livre Since the database is very large in spite of all the care taken, it so happens that at times the source word is repeated e.g. book=livre book=tome Since I want to... (7 Replies)
Discussion started by: gimley
7 Replies

2. Shell Programming and Scripting

Write two csv files into one excel with multiple sheets

I have requirement to write two CSV files to one single excel with multiple sheets. Data present in the two files should sit in excel as different sheets. How can we achieve this using shell script? 1.csv 2. csv 1,2,3,4 5,6,7,8 XXXXX YYYYY Res.excel 1.csv data... (1 Reply)
Discussion started by: duplicate
1 Replies

3. Shell Programming and Scripting

Perl script to Merge contents of 2 different excel files in a single excel file

All, I have an excel sheet Excel1.xls that has some entries. I have one more excel sheet Excel2.xls that has entries only in those cells which are blank in Excel1.xls These may be in different workbooks. They are totally independent made by 2 different users. I have placed them in a... (1 Reply)
Discussion started by: Anamika08
1 Replies

4. Shell Programming and Scripting

Merge two cells in excel via UNIX?

Hi UNIX Gods! Is it possible to merge two cells in .csv file using unix commands? Imagine that this is my present csv file opened via excel: Gate Reports| | fatal alerts | 200 | is is possible to make it look like this using unix? Gate Reports | fatal... (1 Reply)
Discussion started by: 4dirk1
1 Replies

5. Shell Programming and Scripting

modify Existing MS excel workbook in perl

Hi I need to modify an excel file in perl and for which I installed perl in Linux 1. Open a existing excel file 2. delete an unwanted Sheet called "summary" 3. and i want to insert some data into range of cells ( B1:B11) 4. Remove unwanted value called "Sum" repeated in the... (1 Reply)
Discussion started by: luke_devon
1 Replies

6. Shell Programming and Scripting

Sending SQL Queries output to different Excel sheets

Hi, I need your help in sedning sql queries output to different excel sheets. My requirement is like this: Query1: Select name from table1 where status = 'Complete' Query2: Select name from table1 where status = 'Failed' Query3: Select name from table1 where status = 'Ignored' ... (4 Replies)
Discussion started by: parvathi_rd
4 Replies

7. Shell Programming and Scripting

PERL: Split Excel Workbook to Indiv Excel files

Hi, I am trying to find a way to read an excel work book with multiple worksheets. And write each worksheet into a new excel file using perl. My environment is Unix. For example: I have an excel workbook TEST.xls and it has Sheet1, Sheet2, Sheet3 worksheets. I would like to create... (2 Replies)
Discussion started by: sandeep78
2 Replies

8. Shell Programming and Scripting

How to format excel sheets in UNIX??

Hi, I have generated an excel sheet using a shell script. i have converted the output text file to an excel and got the desired output. However, in a particular column in the excel the values of the numbers start with 0. e.g. 078393343, 00342442, etc. But, in the resulting excel I get as... (2 Replies)
Discussion started by: Vijay06
2 Replies

9. Shell Programming and Scripting

Multiple excel work sheets through UNIX

Hi, There is this requirement to create multiple work sheets in an MS Excel file through UNIX. We normally can create one work sheet in unix by either tab or comma delimiting and appending .xls or .csv to the file name, but can we create multiple work sheets. Regards, Puspendu (1 Reply)
Discussion started by: puspendu
1 Replies

10. Programming

creating more than 2 excel sheets in C

Hi, I have a question about using C to write excel file. There is a way to write into a sheet into a file excel, like this: FILE * File; pMyFile = fopen("test.xls", "w+"); fprintf(File, "\n\n"); /* go to the row 2 of the column A*/ fprintf(File, "%s", "Hi pippo!"); /* write... (7 Replies)
Discussion started by: manceryder
7 Replies
Login or Register to Ask a Question