Yes, making a pass across your 12,000 record data file for each entry in the list isn't very efficient. First thing I'll point out is that your for loop will not be listing the contents of the list, but the file name. You'd need something like this:
This reads the contents of list1.txt placing each line into the variable i. Still not efficient, but I wanted to point out the problem with your code.
Using awk, you can make one pass across each file. Way more efficient in terms of numer of i/o operations, but not as efficient as writing a programme to do the same thing in C.
You could make this more efficient by tracking most recently used files and allowing awk to keep some number (100) open and closing the rest. The programme would be executing far less opens/closes on the output files. You'd probably not have any issue keeping 212 of them open, but if your target list grows, or your system has smallish quotas on open files, you could have issues which is why I suggested closing the file after each write. Another, and easier, way would be to write a single output file of the form <filename> <text> as an intermediate file. Once the initial processing is finished, the intermediate file could be sorted and a single pass made to write each separate file. This has the advantage of opening/closing each output file just once and thus avoids the efficiency problems in my example above.
The need for the delete stems from some awk implementations which create an entry in the hash when the test is made (when targets[foo] does not exist). Without the delete, the hash will eventually contain an entry for every word in the list2.txt file rather than just the ones from the first list. These extra entries all have the value 0, so the programme works, but the memory usage is unnecessarily large. The delete statement prevents awk from keeping entries in the target hash that have a zero value, but it adds to the execution time.
Last edited by agama; 07-31-2011 at 05:03 PM..
Reason: additional thought about output
hi all,
i would like to search in a directory. all files they were found shoul be opend and looked about a keyword. if keyword is found i want to see the name of the file. i've rtfm of find and have a command like this :
find /etc -exec cat \{}\ | grep KEYWORD
but don't work, and :
find... (4 Replies)
I've following sed command working fine -
sed '/search_pattern1/ !s/pattern1/pattern2/" file
Now, I want to search two patterns - search_pattern1 and search_pattern2 .
How can put these into above sed statement ?
Thanks in advance. (12 Replies)
Hi All,
I have almost 1000+ files and I want to search specific pattern. Looking forwarded your input. Pls note that need to ignore words in between /* */
Search for: "insert into xyz" (Which procedure contain all 3).
Expected output:
procedure test1
procedure test2
procedure test3
File... (12 Replies)
Hi Friends,
I have been trying to write the script since morning and reached some where now. but i think i am stuck in the final step. please help
I want to search the strings below in red in the be be searched in the directories below. How can i do that in my shell script.
Thanks
Adi
... (8 Replies)
Hello,
I want to test if i find the word CACCIA AND idlck in a file, i have to print a message Ok.
For that , i need to user a awk command with a && logical.
Can you help me ?
:confused:
### CACCIA: DEBUT ###
if $(grep -wqi "$2" /etc/passwd); then
&& rm /etc/security/.idlck
... (3 Replies)
Hello,
I try to print out with sed or awk the 21.18 between "S3 Temperature" and "GrdC" in a text file.
The blanks are all real blanks no tabs.
Only the two first chars from temperture are required. So the "21" i need as output.
S3 Temperatur 21.18 GrdC No Alarm
... (3 Replies)
Hi,
Need your help for this scripting issue I have. I am not really good at this, so seeking your help.
I have a file looking similar to this:
Hello, i am human and name=ABCD.
How are you?
Hello, i am human and name=PQRS.
I am good.
Hello, i am human and name=ABCD.
Good bye.
Hello, i... (12 Replies)
Hy there all. Im new here. Olso new to terminal & bash, but it seams that for me it's much easyer to undarsatnd scripts than an actual programming language as c or anyother languare for that matter.
S-o here is one og my home works s-o to speak.
Write a shell script which:
-only works as a... (1 Reply)
hi,
i need to replace all words in any quote position and then need to change the words inside the file thousand of raw.
textfile data :
"Ninguno","Confirma","JuicioABC"
"JuicioCOMP","Recurso","JuicioABC"
"JuicioDELL","Nulidad","Nosino"
"Solidade","JuicioEUR","Segundo"
need... (1 Reply)
Discussion started by: benjietambling
1 Replies
LEARN ABOUT SUSE
create_text_search_dictionary
CREATE TEXT SEARCH DICTIONARY(7) SQL Commands CREATE TEXT SEARCH DICTIONARY(7)NAME
CREATE TEXT SEARCH DICTIONARY - define a new text search dictionary
SYNOPSIS
CREATE TEXT SEARCH DICTIONARY name (
TEMPLATE = template
[, option = value [, ... ]]
)
DESCRIPTION
CREATE TEXT SEARCH DICTIONARY creates a new text search dictionary. A text search dictionary specifies a way of recognizing interesting or
uninteresting words for searching. A dictionary depends on a text search template, which specifies the functions that actually perform the
work. Typically the dictionary provides some options that control the detailed behavior of the template's functions.
If a schema name is given then the text search dictionary is created in the specified schema. Otherwise it is created in the current
schema.
The user who defines a text search dictionary becomes its owner.
Refer to in the documentation for further information.
PARAMETERS
name The name of the text search dictionary to be created. The name can be schema-qualified.
template
The name of the text search template that will define the basic behavior of this dictionary.
option The name of a template-specific option to be set for this dictionary.
value The value to use for a template-specific option. If the value is not a simple identifier or number, it must be quoted (but you can
always quote it, if you wish).
The options can appear in any order.
EXAMPLES
The following example command creates a Snowball-based dictionary with a nonstandard list of stop words.
CREATE TEXT SEARCH DICTIONARY my_russian (
template = snowball,
language = russian,
stopwords = myrussian
);
COMPATIBILITY
There is no CREATE TEXT SEARCH DICTIONARY statement in the SQL standard.
SEE ALSO
ALTER TEXT SEARCH DICTIONARY [alter_text_search_dictionary(7)], DROP TEXT SEARCH DICTIONARY [drop_text_search_dictionary(7)]
SQL - Language Statements 2010-05-14 CREATE TEXT SEARCH DICTIONARY(7)