10-21-2015
If you want to find one of 10k items in a large file, there's no straight, easy way to avoid comparing each item against each line in file. What you can do is
break when found.
You could try a
binary search algorithm.
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
hello all
greeting for the day
i have a text file as the following
text.xml
abcd<FIELD>123.456</FIELD>efgh
i need to replace the value between <FIELD> and </FIELD> by using awk command.
please throw some light on this.
thank you very very much
Erik (5 Replies)
Discussion started by: erikshek
5 Replies
2. Shell Programming and Scripting
Hello!
I have text file:
From aaa@bbb Fri Jun 1 10:04:29 2010
--____OSPHWOJQGRPHNTTXKYGR____
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
My code '234565'. ... (2 Replies)
Discussion started by: candyme
2 Replies
3. Shell Programming and Scripting
Hello friends!
Help me pls to write correct awk and grep statements for my task:
I have got files with name filename.txt
It has such structure:
Start of file
FROM: address@domen.com (12...890) abc
DATE: 11/23/2009 on Std
SUBJECT: any subject
End of file
So, I must check,
if this file... (4 Replies)
Discussion started by: candyme
4 Replies
4. Shell Programming and Scripting
How to reverse search for a matched string in a file. Get line# of the first matched line. I am getting '2' into 'lineNum' variable.
But it feels like I am using too many commands. Is there a better more efficiant way to do this on Unix?
abc.log
aaaaaaaaaaaaa
bbbbbbbbbbbbb... (11 Replies)
Discussion started by: kchinnam
11 Replies
5. Shell Programming and Scripting
Hi,
I have an XML file with around 1 billion rows in it and i am trying to find the number of times a particular tag occurs in it. The solution i am using works but takes a lot of time (~1 hr) .Please help me with an efficient way to do this.
Lets say the input file is
<Root>
... (13 Replies)
Discussion started by: Sheel
13 Replies
6. Homework & Coursework Questions
Use and complete the template provided. The entire template must be completed. If you don't, your post may be deleted!
1. The problem statement, all variables and given/known data:
Write a template main.c file via shell script to make it easier for yourself later.
The issue here isn't writing... (2 Replies)
Discussion started by: george3isme
2 Replies
7. Shell Programming and Scripting
Hi,
I am trying to populate an array with data from a text file. I have a working method using awk but it is too slow and inefficent. See below.
The text file has 70,000 lines. As awk is a line editor it reads each line of the file until it gets to the required line and then processes it.... (3 Replies)
Discussion started by: carlr
3 Replies
8. Shell Programming and Scripting
Hello,
Some time ago a helpful awk file was provided on the forum which I give below:
NR==FNR{A=$0;next}{for(j in A){split(A,P,"=");for(i=1;i<=NF;i++){if($i==P){$i=P}}}}1
While it works beautifully on English and Latin characters i.e. within the ASCII range of 127, the moment a character beyond... (6 Replies)
Discussion started by: gimley
6 Replies
9. Shell Programming and Scripting
Hi Friends,
I have a very big text file, that has code for multiple functions. I have scan through the file and write each function in seperate file. All functions starts with
BEGIN DSFNC
Identifier "ABCDDataValidationfnc"
and ends with
END DSFNC
I need create a file(using identifier)... (2 Replies)
Discussion started by: anandapani
2 Replies
10. Shell Programming and Scripting
Hi guys,
I have a text file named file1.txt that is formatted like this:
001 , ID , 20000
002 , Name , Brandon
003 , Phone_Number , 616-234-1999
004 , SSNumber , 234-23-234
005 , Model , Toyota
007 , Engine ,V8
008 , GPS , OFF
and I have file2.txt formatted like this:
... (2 Replies)
Discussion started by: An0mander
2 Replies
LEARN ABOUT DEBIAN
approx-gc
APPROX-GC(8) System Manager's Manual APPROX-GC(8)
NAME
approx-gc - garbage-collect the cache of Debian archive files
SYNOPSIS
approx-gc [OPTION]...
DESCRIPTION
approx-gc scans the cache created by approx(8) and finds files that are corrupted or no longer needed. With no options specified, these
files are listed on standard output and removed from the cache.
A corrupted file is one whose size or checksum does not match the value specified in the Packages or Sources file.
An unneeded file is one that is not referenced from any distribution's Packages or Sources file.
approx-gc may take several minutes to finish.
OPTIONS
-c file, --config file
Specify an additional configuration file. May be used multiple times.
-f, --fast
Don't perform checksum validation.
-k, --keep, -s, --simulate
Don't remove files from the cache.
-q, --quiet
Don't print file names.
-v, --verbose
Print the reason for removal of each file.
EXAMPLES
To remove all unneeded or corrupted files from the cache:
approx-gc --quiet
This is run as a weekly cron(8) job.
To list the files that would be removed from the cache, without actually doing so:
approx-gc --keep
FILES
/etc/approx/approx.conf
Configuration file for approx and related programs.
/var/cache/approx
Default cache directory for archive files.
SEE ALSO
approx.conf(5), approx(8), cron(8)
AUTHOR
Eric Cooper <ecc@cmu.edu>
May 2011 APPROX-GC(8)