Script for "Crawling a doc"


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Script for "Crawling a doc"
# 1  
Old 03-18-2009
Script for "Crawling a doc"

Hi Everyone
How you doing all.Im planning to write a script that will crawl a MS-Document
and should take the values from it.Is it possible at all.Im not a scripting guru just want to know your thoughts..

Im planning to do some thing like this:

Microsoft Document has:

Servername: abc.abc.com

Port:443

I would like to write a script that would crawl particular document and should fetch me those values..

Appreciate your help guys
-K
# 2  
Old 03-18-2009

First, convert it to a text file. There is a command, antiword, to extract the text from a MS .doc file.
# 3  
Old 03-18-2009
Hi John
Thanks for the quick reply.I have to use this thing at work and I see it doesnt come with linux/unix by default and we have to install it is a freeware .Is there any other way around.

Thanks
Kev
# 4  
Old 03-18-2009
Quote:
Originally Posted by coolkid
Hi John
Thanks for the quick reply.I have to use this thing at work and I see it doesnt come with linux/unix by default and we have to install it is a freeware .Is there any other way around.

Why do you want to use a Unix shell script if you are not in a Unix environment?
# 5  
Old 03-18-2009
Bug

We use unix systems to process the requests and our users give us what they need using MS Docs...So I thought instead of manually reading all the values from MS Doc crawling the .doc would be a great idea which ofcourse will reduce my time.
# 6  
Old 03-18-2009
A tatty way:
strings document|grep "what you want"
# 7  
Old 03-18-2009
Bug

Hi methyl
Miraculously it did worked man.Iam able to get the values exactly what Iam looking for.Thanks buddy.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. AIX

Apache 2.4 directory cannot display "Last modified" "Size" "Description"

Hi 2 all, i have had AIX 7.2 :/# /usr/IBMAHS/bin/apachectl -v Server version: Apache/2.4.12 (Unix) Server built: May 25 2015 04:58:27 :/#:/# /usr/IBMAHS/bin/apachectl -M Loaded Modules: core_module (static) so_module (static) http_module (static) mpm_worker_module (static) ... (3 Replies)
Discussion started by: penchev
3 Replies

2. Shell Programming and Scripting

Bash script - Print an ascii file using specific font "Latin Modern Mono 12" "regular" "9"

Hello. System : opensuse leap 42.3 I have a bash script that build a text file. I would like the last command doing : print_cmd -o page-left=43 -o page-right=22 -o page-top=28 -o page-bottom=43 -o font=LatinModernMono12:regular:9 some_file.txt where : print_cmd ::= some printing... (1 Reply)
Discussion started by: jcdole
1 Replies

3. UNIX for Dummies Questions & Answers

Using "mailx" command to read "to" and "cc" email addreses from input file

How to use "mailx" command to do e-mail reading the input file containing email address, where column 1 has name and column 2 containing “To” e-mail address and column 3 contains “cc” e-mail address to include with same email. Sample input file, email.txt Below is an sample code where... (2 Replies)
Discussion started by: asjaiswal
2 Replies

4. UNIX for Dummies Questions & Answers

"Help with bash script" - "License Server and Patch Updates"

Hi All, I'm completely new to bash scripting and still learning my way through albeit vey slowly. I need to know where to insert my server names', my ip address numbers through out the script alas to no avail. I'm also searching on how to save .sh (bash shell) script properly.... (25 Replies)
Discussion started by: profileuser
25 Replies

5. Shell Programming and Scripting

awk command to replace ";" with "|" and ""|" at diferent places in line of file

Hi, I have line in input file as below: 3G_CENTRAL;INDONESIA_(M)_TELKOMSEL;SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL My expected output for line in the file must be : "1-Radon1-cMOC_deg"|"LDIndex"|"3G_CENTRAL|INDONESIA_(M)_TELKOMSEL"|LAST|"SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL" Can someone... (7 Replies)
Discussion started by: shis100
7 Replies

6. Shell Programming and Scripting

Simplify Bash Script Using "sed" Or "awk"

Input file: 2 aux003.net3.com error12 6 awn0117.net1.com error13 84 aux008 error14 29 aux001.ha.ux.isd.com error12 209 aux002.vm.ux.isd.com error34 21 alx0027.vm.net2.com error12 227 dux001.net5.com error123 22 us008.dot.net2.com error121 13 us009.net2.com error129Expected Output: 2... (4 Replies)
Discussion started by: sQew
4 Replies

7. Solaris

Who has "Solaris 10 Advanced System Administration(SA-202-S10)" latest revision doc?

Dear All, I am going to examine SA202 but I got SA-202-S10 revision A.1 so it might be out of date. Who has "Solaris 10 Advanced System Administration(SA-202-S10)" latest revision document? Could you share it ? Thank you in advance (2 Replies)
Discussion started by: unitipon
2 Replies

8. UNIX for Advanced & Expert Users

Crawling a MS-Doc using scripting

Hi Everyone How you doing all.Im planning to write a script that will crawl a MS-Document and should take the values from it.Is it possible at all.Im not a scripting guru just want to know your thoughts.. Im planning to do some thing like this: Microsoft Document has: Servername:... (1 Reply)
Discussion started by: coolkid
1 Replies

9. UNIX for Dummies Questions & Answers

Explain the line "mn_code=`env|grep "..mn"|awk -F"=" '{print $2}'`"

Hi Friends, Can any of you explain me about the below line of code? mn_code=`env|grep "..mn"|awk -F"=" '{print $2}'` Im not able to understand, what exactly it is doing :confused: Any help would be useful for me. Lokesha (4 Replies)
Discussion started by: Lokesha
4 Replies
Login or Register to Ask a Question