Sponsored Content
Special Forums UNIX Desktop Questions & Answers Normalizing files for sentence count Post 302735109 by jim mcnamara on Friday 23rd of November 2012 11:50:14 AM
Old 11-23-2012
I am translating your requirement to mean count all of the . ! and ? characters in a file.
This is part of what it means to find sentences. It will have problems, ex.: in text with numbers that have decimals in them. And sentences that end in an ellipsis.... < that is one! Neat. I made a self-referential sentence.

Code:
awk '{ total+=gsub(/[\.\?\!]/,"", $0); next}
END{print "total sentences=",total} ' somefile.txt

You have to decide on the correctness of your approach, based on your data.
This User Gave Thanks to jim mcnamara For This Post:
 

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Script to ask for a sentence and then count number of spaces in the sentence

Hi People, I need some Help to write a unix script that asks for a sentence to be typed out then with the sentence. Counts the number of spaces within the sentence and then echo's out "The Number Of Spaces In The Sentence is 4" as a example Thanks Danielle (12 Replies)
Discussion started by: charlie101208
12 Replies

2. Shell Programming and Scripting

Count todays created files and old files

Hello experts, I used following approach to get listing of all files of remote server. Now I have remote server file information on same server. I am getting listing in the output.txt I want to count today's created files and old files. I want to compare the numbers... (11 Replies)
Discussion started by: dipeshvshah
11 Replies

3. UNIX for Dummies Questions & Answers

Count number of files in directory excluding existing files

Hi, Please let me know how to find out number of files in a directory excluding existing files..The existing file format will be unknown..each time.. Thanks (3 Replies)
Discussion started by: ammu
3 Replies

4. Shell Programming and Scripting

Count Files

I was wondering if anyone could help me with this problem: Write a script called countFiles that takes two arguments, the initial directory and the number of levels and returns the count of all files (including directories) in the directories and subdirectories up to the number of levels. ... (4 Replies)
Discussion started by: clammy
4 Replies

5. Shell Programming and Scripting

[grep] how to grep a sentence which has quotation marks "sentence"

I would like to check with grep in this configuration file: { "alt-speed-down": 200, "alt-speed-enabled": true, "alt-speed-time-begin": 1140, "alt-speed-time-day": 127, "...something..." : true, ... } "alt-speed-enabled" (the third line of the file) is setted to... (2 Replies)
Discussion started by: ciro314
2 Replies

6. Programming

Normalizing date value to a single timezone

Hi, Am trying to get a normalized date value irrespective of the time zone of the machine in which following code is run. When the following code is run in 2 different machines with TZ=UTC and TZ=PDT, I get 2 different values. I simply want to normalize the output that is specific to a... (3 Replies)
Discussion started by: matrixmadhan
3 Replies

7. UNIX for Dummies Questions & Answers

How to count different id from a files?

Hi Guys, Please help for counting different task_id:- file name is: sms_push_123.ac:011:045 file records: Now we need to output like: (1 Reply)
Discussion started by: aaditya321
1 Replies

8. Shell Programming and Scripting

Error files count while coping files from source to destination locaton as well count success full

hi All, Any one answer my requirement. I have source location src_dir="/home/oracle/arun/IRMS-CM" My Target location dest_dir="/home/oracle/arun/LiveLink/IRMS-CM/$dc/$pc/$ct" my source text files check with below example.text file content $fn "\t" $dc "\t" $pc "\t" ... (3 Replies)
Discussion started by: sravanreddy
3 Replies

9. Shell Programming and Scripting

Shell script for field wise record count for different Files .csv files

Hi, Very good wishes to all! Please help to provide the shell script for generating the record counts in filed wise from the .csv file My question: Source file: Field1 Field2 Field3 abc 12f sLm 1234 hjd 12d Hyd 34 Chn My target file should generate the .csv file with the... (14 Replies)
Discussion started by: Kirands
14 Replies
dadadodo(1)						      General Commands Manual						       dadadodo(1)

NAME
dadadodo - exterminate all rational thought SYNOPSIS
dadadodo [ options ] [ input-files ] DESCRIPTION
dadadodo is a program that analyses texts for Markov chains of word probabilities and then generates random sentences based on those proba- bilities. Sometimes these sentences are nonsense, but sometimes they cut right through to the heart of the matter and reveal hidden mean- ings. OPTIONS
dadadodo accepts the following options: -c, -count n Generate n sentences. -h, -help Show summary of options and exit. -html Output HTML instead of plain text. -l, -load file Load compiled data from file ('-' for standard input). -o, -output file Save compiled data in file ('-' for standard output). -p, -pause s Delay s seconds between paragraphs. -w, -columns columns Format output for a device columns character cells in width. If not specified, the value of the environment variable COLUMNS is used to determine the width. If that variable is not defined, a width of 72 is assumed. NOTES
Non-option arguments are input files. These should be text files, but may be mail folders or HTML. MIME messages are handled sensibly. When no output file is specified, sentences will be generated from the input data directly. However, loading a saved file is far faster than re-parsing the text files each time. ENVIRONMENT
COLUMNS Determines the width (in character cells) of the output if the -w, -columns option is not used. If not set, a width of 72 is assumed. SEE ALSO
dadadodo's upstream website is http://www.jwz.org/dadadodo/. AUTHOR
dadadodo was written by Jamie Zawinski. This manual page was written by Sudhakar Chandrasekharan <thaths@netscape.com>, based on the program's usage message. dadadodo(1)
All times are GMT -4. The time now is 06:20 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy