The UNIX and Linux Forums  
Hello and Welcome from United States to the UNIX and Linux Forums! Thank You for Visiting and Joining Our Global Community.

Go Back   The UNIX and Linux Forums > Top Forums > Shell Programming and Scripting
.
google unix.com



Shell Programming and Scripting Post questions about KSH, CSH, SH, BASH, PERL, PHP, SED, AWK and OTHER shell scripts and shell scripting languages here.

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
"too big" and "not enough memory" errors in shell script jerardfjay AIX 11 03-16-2009 11:09 PM
script running with "ksh" dumping core but not with "sh" simhe02 HP-UX 9 11-04-2008 08:52 PM
#!/bin/sh script fails at StringA | tr "[x]" "[y]" by_tg UNIX for Dummies Questions & Answers 3 02-22-2008 12:17 PM
Development Releases: Linux Mint 4.0 Beta "Fluxbox", 4.0 Alpha "Debian" iBot UNIX and Linux RSS News 0 01-04-2008 03:00 PM
Explain the line "mn_code=`env|grep "..mn"|awk -F"=" '{print $2}'`" Lokesha UNIX for Dummies Questions & Answers 4 12-20-2007 01:52 AM

Closed Thread
English Japanese Spanish French German Portuguese Italian Dutch Swedish Russian Norwegian Hungarian Hebrew Danish Powered by Powered by Google
 
LinkBack Thread Tools Search this Thread Rate Thread Display Modes
  #1 (permalink)  
Old 03-18-2009
coolkid coolkid is offline
Registered User
  
 

Join Date: Jan 2008
Posts: 69
Script for "Crawling a doc"

Hi Everyone
How you doing all.Im planning to write a script that will crawl a MS-Document
and should take the values from it.Is it possible at all.Im not a scripting guru just want to know your thoughts..

Im planning to do some thing like this:

Microsoft Document has:

Servername: abc.abc.com

Port:443

I would like to write a script that would crawl particular document and should fetch me those values..

Appreciate your help guys
-K
  #2 (permalink)  
Old 03-18-2009
cfajohnson's Avatar
cfajohnson cfajohnson is offline Forum Advisor  
Shell programmer, author
  
 

Join Date: Mar 2007
Location: Toronto, Canada
Posts: 2,361

First, convert it to a text file. There is a command, antiword, to extract the text from a MS .doc file.
  #3 (permalink)  
Old 03-18-2009
coolkid coolkid is offline
Registered User
  
 

Join Date: Jan 2008
Posts: 69
Hi John
Thanks for the quick reply.I have to use this thing at work and I see it doesnt come with linux/unix by default and we have to install it is a freeware .Is there any other way around.

Thanks
Kev
  #4 (permalink)  
Old 03-18-2009
cfajohnson's Avatar
cfajohnson cfajohnson is offline Forum Advisor  
Shell programmer, author
  
 

Join Date: Mar 2007
Location: Toronto, Canada
Posts: 2,361
Quote:
Originally Posted by coolkid View Post
Hi John
Thanks for the quick reply.I have to use this thing at work and I see it doesnt come with linux/unix by default and we have to install it is a freeware .Is there any other way around.

Why do you want to use a Unix shell script if you are not in a Unix environment?
  #5 (permalink)  
Old 03-18-2009
coolkid coolkid is offline
Registered User
  
 

Join Date: Jan 2008
Posts: 69
Smile

We use unix systems to process the requests and our users give us what they need using MS Docs...So I thought instead of manually reading all the values from MS Doc crawling the .doc would be a great idea which ofcourse will reduce my time.
  #6 (permalink)  
Old 03-18-2009
methyl methyl is offline
Registered User
  
 

Join Date: Mar 2008
Posts: 1,163
A tatty way:
strings document|grep "what you want"
  #7 (permalink)  
Old 03-18-2009
coolkid coolkid is offline
Registered User
  
 

Join Date: Jan 2008
Posts: 69
Smile

Hi methyl
Miraculously it did worked man.Iam able to get the values exactly what Iam looking for.Thanks buddy.
Sponsored Links
Closed Thread

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On




All times are GMT -4. The time now is 06:56 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited. Language Translations Powered by .
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios
The UNIX and Linux Forums Content Copyright ©1993-2009. All Rights Reserved.Ad Management by RedTyger

Content Relevant URLs by vBSEO 3.2.0