Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Extracting data from PDF files into CSV file Post 302553025 by yazu on Tuesday 6th of September 2011 11:44:54 AM
Old 09-06-2011
1. Write a script to convert a file in csv file and check it on your files.
2. Convert all files to csv files.
3. Extract needed information.

The first step is possible using a lot of tools - I used Poppler:
Code:
pdftotext -layout 01.pdf -| sed '$d' |  tail -n5 | sed -r 's/  +/,/g; s/ //g' 
Georgia,-,1,2,9,29,963,938,1283,-,10,5,19
SouthCarolina,-,74,63,158,19,4680,4362,9454,-,25,18,10
Alabama,-,14,28,24,8,1026,1026,1049,-,6,9,6
Florida,3,92,88,186,85,3222,3233,5409,-,13,9,4
California,2,199,169,582,60,5640,5259,10475,-,30,33,64

Try the first step (and check it on your files). Then it's easy to manage the second and third steps.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Perl - extracting data from .csv files

PROJECT: Extracting data from an employee timesheet. The timesheets are done in excel (for user ease) and then converted to .csv files that look like this (see color code key below): ,,,,,,,,,,,,,,,,,,, 9/14/2003,<-- Week Ending,,,,,,,,,,,,,,,,,, Craig Brennan,,,,,,,,,,,,,,,,,,,... (3 Replies)
Discussion started by: kregh99
3 Replies

2. Shell Programming and Scripting

extracting data from files..

frnds, I m having prob woth doing some 2-3 task simultaneously... what I want is... I have lots ( lacs ) of files in a dir... I want.. these info from arround 2-3 months files filename convention is - abc20080403sdas.xyz ( for todays files ) I want 1. total no of files for 1 dec... (1 Reply)
Discussion started by: clx
1 Replies

3. Shell Programming and Scripting

Compare two csv files by two colums and create third file combining data from them.

I've got two large csv text table files with different number of columns each. I have to compare them based on first two columns and create resulting file that would in case of matched first two columns include all values from first one and all values (except first two colums) from second one. I... (5 Replies)
Discussion started by: agb2008
5 Replies

4. Shell Programming and Scripting

extracting data from a .csv file

I have a .csv file equipment,bandtype abc,aws def,mmds ghi,umts jkl,mmds I can get the equipment from `hostname`. In my script i want to check what is the hostname. then see if it exists in the.csv file. if it does then i want to store the second parameter(bandtype) for the corresponding... (3 Replies)
Discussion started by: lassimanji
3 Replies

5. Solaris

Convert csv file into pdf file from putty

Hi, My requirement is that i have to convenrt a csv file inyo a pdf file . So is there any command which will do that ??? thanks Sambuddha (2 Replies)
Discussion started by: Sambuddha
2 Replies

6. Shell Programming and Scripting

Script for extracting data from csv file based on column values.

Hi all, I am new to shell script.I need your help to write a shell script. I need to write a shell script to extract data from a .csv file where columns are ',' separated. The file has 5 columns having values say column 1,column 2.....column 5 as below along with their valuesm.... (3 Replies)
Discussion started by: Vivekit82
3 Replies

7. Shell Programming and Scripting

How to create or convert to pdf files from csv files using shell script?

Hi, Can anyone help me how to convert a .csv file to a .pdf file using shell script Thanks (2 Replies)
Discussion started by: ssk250
2 Replies

8. Shell Programming and Scripting

Compare 2 files of csv file and match column data and create a new csv file of them

Hi, I am newbie in shell script. I need your help to solve my problem. Firstly, I have 2 files of csv and i want to compare of the contents then the output will be written in a new csv file. File1: SourceFile,DateTimeOriginal /home/intannf/foto/IMG_0713.JPG,2015:02:17 11:14:07... (8 Replies)
Discussion started by: refrain
8 Replies

9. Shell Programming and Scripting

Extracting data from specific rows and columns from multiple csv files

I have a series of csv files in the following format eg file1 Experiment Name,XYZ_07/28/15, Specimen Name,Specimen_001, Tube Name, Control, Record Date,7/28/2015 14:50, $OP,XYZYZ, GUID,abc, Population,#Events,%Parent All Events,10500, P1,10071,95.9 Early Apoptosis,1113,11.1 Late... (6 Replies)
Discussion started by: pawannoel
6 Replies

10. Shell Programming and Scripting

Extracting part of data from files

Hi All, I have log files as below. log1.txt <table name="content_analyzer" primary-key="id"> <type="global" /> </table> <table name="content_analyzer2" primary-key="id"> <type="global" /> </table> Time taken: 1.008 seconds ID = gd54321bbvbvbcvb <table name="content_analyzer"... (7 Replies)
Discussion started by: ROCK_PLSQL
7 Replies
XML2(1) 						      General Commands Manual							   XML2(1)

NAME
xml2 - convert xml documents in a flat format 2xml - convert flat format into xml html2 - convert html documents in a flat format 2html - convert flat format into html csv2 - convert csv files in a flat format 2csv - convert flat format into csv SYNOPSIS
<xml2|2xml|html2|2html|csv2|2csv> > outfile < infile DESCRIPTION
There are six tools. Except csv2 and and 2csv they don't take any command-line arguments. They are all simple filters which can be used to read files from standard input in one format and output it to standard output in another format. The flat format used by the tools is specific to these tools. It is a syntax for representing structured markup in a way that makes it easy to process with line-oriented tools. The same format is used for HTML and XML; in fact, you can think of html2 as converting HTML to XHTML and running xml2 on the result; likewise 2html and 2xml. (Of course, this isn't how the implementation works.) SEE ALSO
This program does normally not include any documentation in form of manpages. However it has a real excellent documentation online with a lot of example. In fact this manpage was based on this documentation. Please find it on: http://dan.egnor.name/xml2/ref Examples can be found here: http://dan.egnor.name/xml2/examples AUTHOR
xml2 was written by Dan Egnor. This manpage was written by Patrick Schoenfeld <schoenfeld@in-medias-res.com> for the Debian project, but may be used by others under the same terms as xml2 is distributed. BUGS
Bugs can be reported through the Debian Bug tracking system. 7h february 2008 XML2(1)
All times are GMT -4. The time now is 06:14 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy