Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Need Shell Script for document searching Post 302445370 by l20N1N on Sunday 15th of August 2010 09:10:30 PM
Old 08-15-2010
Script so far


Quote:
Originally Posted by thegeek
not a shell script... try google desktop, using which will make you to deliver without spending time in designing, coding and testing....!!
I will be looking through files of which its content is mixed with gibberish and chances are the files will be on a mounted drive but that tool will come in handy for other things.

in the meantime, what I have so far is:

Code:
cat *.txt | uniq -c > results.txt

this code takes the contents (even the non readible) of all the txt files in a directory and eliminates duplicates correct?

can this work with other file types such as .doc, .pdf, etc?

how would the code be written so that it only takes the readible content?


Last edited by l20N1N; 08-16-2010 at 05:50 AM..
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Shell Script for searching files with date as filter

Hi , Assume today's date is 10-May-2002. I want to get a list of files which were last modified since 01-May-2002. If I run the script after 5 days, it should still list me the files modified from 01-May-2002 till today. I also plan to pass the date 01-May-2002 as an argument to the shell script... (3 Replies)
Discussion started by: kanakaraj_s
3 Replies

2. Programming

UNIX Shell Script to Create a Document of a PLSQL code.

Hi All, I am supposed to present the documentation for the PLSQL code (PACKAGES, PROCEDURE, FUNCTIONS) of my application. There are sufficient comments in my code. Has anyone written any Shell Script Utility which can parse the PLSQL code and generate some kind of document ( preferrably HTML not... (1 Reply)
Discussion started by: gauravsachan
1 Replies

3. Shell Programming and Scripting

can anyone help with shell script command about searching word with grep command?

i want to search in the current directory all the files that contain one word for example "hello" i want to achieve it with the grep command but not with the grep * (2 Replies)
Discussion started by: aintour
2 Replies

4. Shell Programming and Scripting

Here document and shell scripting

Hi I am unable to spot a error with my here document in the shell script. for sid in $sids do ORACLE_SID=sid ORACLE_HOME=$(grep $sid $ORATAB_LOC |cut -f2 -d':') $ORACLE_HOME/bin/sqlplus -s /nolog <<EOF @$HOME/sql/checkdb-status.sql EOF done I am getting the error... (8 Replies)
Discussion started by: xiamin
8 Replies

5. Shell Programming and Scripting

Shell script for searching a record,copy to a file and then delete it

Hi, I have a requirement in hand: I have a file with millions of records say file 1.I have another file, say file 2 which has 2000 records in it. The requirement is to read file2 , and remove the read record from file 1 and move i to a seperate file, file 3. For eg: Read file 2, get the... (5 Replies)
Discussion started by: kumara2010
5 Replies

6. Shell Programming and Scripting

Searching a delimited Key value pairs in shell script

Hello, I have property file with key value pairs separated by pipe , I am trying to write a script which reads the property file and search and print value of specific key. I tried with Sed, I am successfull. The file is as follows ... (4 Replies)
Discussion started by: ANK
4 Replies

7. Shell Programming and Scripting

Oracle Shell script | here document `EOF' unclosed

Hi folks I m creating script which is give me below error. $ ./function.ksh ./function.ksh: here document `EOF' unclosed Inside the script is #!/bin/ksh export ORACLE_SID=OECDV1 export ORACLE_HOME=/u01/app/oracle/product/10.2.0 export PATH=$ORACLE_HOME/bin:$PATH echo "sql is... (3 Replies)
Discussion started by: tapia
3 Replies

8. UNIX for Dummies Questions & Answers

Help searching for patterns occuring near one another in document

I'm very new to unix and linux, so I apologize if the answer to this question should be obvious.. What I would like to know is, is there a way to search a text document ( opened in less, or some other text viewer) for any two or more patterns that appear near one another in the document? In... (1 Reply)
Discussion started by: Colonel Panic
1 Replies

9. Shell Programming and Scripting

HERE Document in Shell Script

Hi, I have a shell script to install one of our products. It comprises of commands that are specific to the product installation. These commands require user inputs at different stages. To avoid manual feeding of inputs every time, I tried using HERE document. it is like- #! /usr/bin ... (1 Reply)
Discussion started by: nishant.kansal@
1 Replies

10. UNIX for Beginners Questions & Answers

UNIX Shell Script to Remove MongoDB Document-Based on Many inputs

Here I am in a position to write a Unix Shell script(.sh) to remove MongoDB collection documents. I know how to remove based on a condition like below and it works for me. eval 'db.Collection.remove({TimeStamp:{$lte: "'$var'"}}) But I need to change the remove statement based on a new parameter... (1 Reply)
Discussion started by: senthilmadhanT
1 Replies
PUBTAL(1)						      General Commands Manual							 PUBTAL(1)

NAME
PubTal -- Generate a web site from content and templates, and upload it to its destination SYNOPSIS
updateSite [options] site.config [content-dir | content-type ] [...] uploadSite [options] site.config [destination-dir | destination-file ] [...] DESCRIPTION
This manual page documents briefly the updateSite and uploadSite commands from the PubTal package This manual page was written for the Debian distribution because the original program does not have a manual page. Instead, it has docu- mentation in HTML format; see below. updateSite generates HTML pages starting from a series of content files and templates written in the ZPT language. The command takes a minimum of one parameter, the location of the website configuration file. The updateSite command first reads in the configuration file and so locates the content, template, and destination directories (defaulting to content, template, and dest respectively). If only one parameter is passed to updateSite, it will then find each file under the content directory and build the corresponding output files. If more than one parameter is passed, then each extra parameter is taken as content to build. If the parameter is a directory under the content directory, then any content files in that directory will be built recursively. If the parameter is a file in the content directory (or one of its children), then just that file will be built. Take for example: updateSite new-site/site.config new-site/content/index.txt new-site/content/examples/ This will build the index.txt file into index.html and all content files under content/examples/ will be built into their corresponding output files. uploadSite uploads the generated pages. This takes a minimum of one parameter, the location of the website configuration file. The uploadSite command first reads in the configuration file and so locates the destination directories. If only one parameter is passed to uploadSite, it will then find each file under the destination directory that needs to be uploaded. The logic used to determine whether a file needs to be uploaded is explained in the Uploading section. If more than one parameter is passed, then each extra parameter is taken as destination to upload. If the parameter is a directory under the destination directory then any files in that directory will be considered for upload recursively. If the parameter is a file in the destination directory (or one of its children), then just that file will be considered for upload. Take for example: uploadSite new-site/site.config new-site/dest/index.html new-site/dest/examples/ This will examine the index.html file and all files under dest/examples/ when determining what files to upload. OPTIONS
A summary of options is included below. For a complete description, see the HTML documentation. Options for updateSite -h --help Prints out the help description for the command. -a --all Builds all classes of content, not just content in the "normal" class. --class classParams Builds all classes given as supplied in a comma separated list. --logging Turns on logging. --logfile name Logs to the file "name" (defaults to updateSite.log). --debug Turns on debug logging; implies --logging. --debug-simpletal Turns on debug logging of SimpleTAL, implies --logging. Options for uploadSite -h --help Prints out the help description for the command. -a --all Checks all files are uploaded, not just PubTal generated ones. --force Uploads files even if PubTal thinks they are up-to-date. --dry-run Prints out what would have been done, but takes no action. --logging Turns on logging. --logfile name Logs to the file "name" (defaults to updateSite.log). --debug Turns on debug logging; implies --logging. SEE ALSO
The programs are documented fully by HTML documentation available in /usr/share/doc/pubtal/html. AUTHOR
This manual page was written by Luis Rodrigo Gallardo Cruz rodrigo@nul-unu.com for the Debian system (but may be used by others). Permis- sion is granted to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 or any later version published by the Free Software Foundation. On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL. PUBTAL(1)
All times are GMT -4. The time now is 04:38 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy