Sponsored Content
Full Discussion: Math with wordcount output
Top Forums UNIX for Dummies Questions & Answers Math with wordcount output Post 302441017 by methyl on Thursday 29th of July 2010 06:37:42 AM
Old 07-29-2010
lorem ipsum surely?

Anyway there appears to be a problem with your pipeline.

Quote:
gunzip path/filename.pages | cat index.xml | xsltproc path/index.xsl | wc -w
If we assume that you are processing a file called "index.xml" from the archive filename.pages, we need to remove one of the pipe characters. Then the pipeline makes sense. We can then adjust the total (assuming that the faulty pipeline was not giving you wrong figures by reading "index.xml" before gunzip finished).
My guess at the pipeline may be flawed - see below.

Code:
gunzip path/filename.pages
words_total=`cat index.xml | xsltproc path/index.xsl | wc -w`
if [ ${words_total} -ge 522 ]
then
	words_total=`expr ${words_total} - 522`
fi
echo $words_total


Afterthought: The whole script could be flawed. I am not familiar with xsltproc and its parameters and don't know if each archive includes a file called "index.xml" or where "path/index.xsl" comes from. Can you try each program in turn to be sure that the flow (particularly to pipelines) is not flawed and that the output of the pipeline is something which can be counted with "wc -w". I'm starting to wonder whether this should be a sequence of commands with no pipeline at all?

Last edited by methyl; 07-29-2010 at 08:18 AM.. Reason: Afterthought:
 

9 More Discussions You Might Find Interesting

1. Programming

math.h not working? o.0

Alright, umm i cant get this to work. im looking at some example and a book i have. when i try to compile my program i get an error message. ld: 0711-317 ERROR: Undefined symbol: .sqrt ld: 0711-345 Use the -bloadmap or -bnoquiet option to obtain more information. I did #include<math.h> after my... (2 Replies)
Discussion started by: primal
2 Replies

2. Programming

something about <math.h>

Hi, I got an easy problem for you but really difficult for me 'cause I am pretty new to this field I got header file <math.h> included in my .c file , then I write the code as below: k = sqrt(i); /* both variables k and i are int */ then I cc temp.c it says like this undefined... (4 Replies)
Discussion started by: blf0
4 Replies

3. Shell Programming and Scripting

test wordcount

Hello I want to run this test: If wordcount of a command is 0 then echo XXXX else echo YYYYY if ; then echo "all devices are created on RAID10"; else echo "Some devices are created on non-RAID10"; fi I receive this message -bash: command substitution: line 1: syntax error near... (3 Replies)
Discussion started by: melanie_pfefer
3 Replies

4. Shell Programming and Scripting

math help

$ x=1 $ y=1.5 $ z=$((x*y)) bash: 1.5: syntax error: invalid arithmetic operator (error token is ".5") What's wrong? (2 Replies)
Discussion started by: rockbike
2 Replies

5. Shell Programming and Scripting

Help with wordcount

Hi, this is my first post so bare with me I have a file which I want to count the amounts of the same line in the file and display by using a shell script for example file1 apple apple apple. bananasi would like the output to be apple 2 apple. 1 bananas 1however, i need this... (2 Replies)
Discussion started by: legolad
2 Replies

6. Shell Programming and Scripting

Need help with AWK math

I am trying to do some math, so that I can compare the average of six numbers to a variable. Here is what it looks like (note that when I divide really big numbers, it isn't a real number): $ tail -n 6 named.stats | awk -F\, '{print$1}' 1141804 1140566 1139429 1134210 1084682 895045... (3 Replies)
Discussion started by: brianjb
3 Replies

7. Shell Programming and Scripting

awk math and csv output

Hi I have this list 592;1;Z:\WB\DOCS;/FS3_100G/FILER112/BU/MPS/DOCS;;;;\\FILER112\BUMPS-DOCS\;580,116,544,878 Bytes;656,561 ;77,560 592;2;Z:\WB\FOCUS;/FS3_100G/FILER112/BU/MPS/FOCUS;;;;\\FILER112\BUMPS-FOCUS\;172,430 Bytes;6 ;0 ... (12 Replies)
Discussion started by: nakaedu
12 Replies

8. Programming

Math function with C

I have int miles, yards; float kilometers; float kilometers2; miles = 26; yards = 385; kilometers = 1.609 * (miles + yards / 1760.0); where int/float remains a float. How ever if I change it to kilometers = 1.609 * (miles + yards / 1760); ... (7 Replies)
Discussion started by: Fingerz
7 Replies

9. UNIX for Dummies Questions & Answers

Math

i have file (my_file.txt) that looks like this: 000000000000010000 000000000000010000 000000000000005000 000000000000005000 000000000000005000 000000000000005000 000000000000005000 000000000000005000 000000000000005000 000000000000005000 all said and one, it should look... (11 Replies)
Discussion started by: lawsongeek
11 Replies
htload(1)						      General Commands Manual							 htload(1)

NAME
htload - reads in an ASCII-text version of the document database SYNOPSIS
htload [options] DESCRIPTION
Htload reads in an ASCII-text version of the document database in the same form as the -t option of htdig and htdump. Note that this will overwrite data in your databases, so this should be used with great care. OPTIONS
-a Use alternate work files. Tells htload to append .work to database files, allowing it to operate on a second set of databases. -c configfile Use the specified configfile instead of the default. -i Initial. Do not use any old databases. This is accomplished by first erasing the databases. -v Verbose mode. This doesn't have much effect. File Formats Document Database Each line in the file starts with the document id followed by a list of fieldname : value separated by tabs. The fields always appear in the order listed below: u URL t Title a State (0 = normal, 1 = not found, 2 = not indexed, 3 = obsolete) m Last modification time as reported by the server s Size in bytes H Excerpt h Meta description l Time of last retrieval L Count of the links in the document (outgoing links) b Count of the links to the document (incoming links or backlinks) c HopCount of this document g Signature of the document used for duplicate-detection e E-mail address to use for a notification message from htnotify n Date to send out a notification e-mail message S Subject for a notification e-mail message d The text of links pointing to this document. (e.g. <a href="docURL">description</a>) A Anchors in the document (i.e. <A NAME=...) Word Database While htdump and htload don't deal with the word database directly, it's worth mentioning it here because you need to deal with it when copying the ASCII databases from one system to another. The initial word database produced by htdig is already in ASCII format, and a binary version of it is produced by htmerge, for use by htsearch. So, when you copy over the ASCII version of the document database produced by htdump, you need to copy over the wordlist as well, then run htload to make the binary document database on the target system, followed by running htmerge to make the word index. Each line in the word list file starts with the word followed by a list of fieldname : value separated by tabs. The fields always appear in the order listed below, with the last two being optional: i Document ID l Location of word in document (1 to 1000) w Weight of word based on scoring factors c Count of word's appearances in document, if more than 1 a Anchor number if word occurred after a named anchor FILES
/etc/htdig/htdig.conf The default configuration file. /var/lib/htdig/db/db.docs The default ASCII document database file. /var/lib/htdig/db/db.wordlist The default ASCII word database file. SEE ALSO
Please refer to the HTML pages (in the htdig-doc package) /usr/share/doc/htdig-doc/html/index.html and the manual pages htdig(1) , htmerge(1) and htdump(1) for a detailed description of ht://Dig and its commands. AUTHOR
This manual page was written by Stijn de Bekker, based on the HTML documentation of ht://Dig. 15 October 2001 htload(1)
All times are GMT -4. The time now is 09:59 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy