Script to Compare a large number of files.


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting Script to Compare a large number of files.
# 1  
Old 10-29-2008
Script to Compare a large number of files.

I have a large Filesystem on an AIX server and another one on a Red Hat box. I have syncd the two filesystems using rsysnc.

What Im looking for is a script that would compare to the two filesystems to make sure the bits match up and the number of files match up.

its around 2.8 million files


Any easy way to do this?

Zipp
San Antonio, Tx
# 2  
Old 10-29-2008
rsync can do that, man rsync ( --checksum , --dry-run)
# 3  
Old 10-29-2008
Would use a one-way hash on every file off each of the boxes > toalocalfile
Then run a diff on on the two comparison files.

cksum or md5sum maybe like this:
Code:
find . /path/to/file -type f -exec cksum {} \; >  RHbox
ssh <..connect to solaris.........>  . /path/to/file -type f -exec cksum {} \; > SOLbox
sort -o RHbox rhbox
sort -o SOLbox SOLbox
diff RHbox SOLbox

We use something like this for a check on migrating filesystems. For 2.8M files this is going to take a loooong time.
# 4  
Old 10-29-2008
If rsync can do it let rsync play the game.
# 5  
Old 10-30-2008
I used rsync once to move the files over but the files were corrupted.

It moved a file over with the right permissions and even the right size but when I went to open the file ... there wasnt anything in it.
# 6  
Old 10-31-2008
The best option is to copy files using scp and use rsync to sync the two systems.
You should run small tests using small files/directory and if everything work as expected move to the big stuff.
And don't forget to read and understand the rsync manual. Base on rsync FAQ you need 80MB Ram for 800k files... for your 2.8M files you will need 320+MB Ram.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Script to compare files in 2 folders and delete the large file

Hello, my first thread here. I've been searching and fiddling around for about a week and I cannot find a solution.:confused: I have been converting all of my home videos to HEVC and sometimes the files end up smaller and sometimes they don't. I am currently comparing all the video files... (5 Replies)
Discussion started by: Josh52180
5 Replies

2. Shell Programming and Scripting

Removing large number of temp files

Hi All, I am having a situation now to delete a huge number of temp files created during run times approx. 16700+ files. We have never imagined that we will get this this much big list of files during run time. It worked fine for lesser no of files in the list. But when list is huge we are... (7 Replies)
Discussion started by: mad man
7 Replies

3. Shell Programming and Scripting

Sftp large number of files

Want to sftp large number of files ... approx 150 files will come to server every minute. (AIX box) Also need make sure file has been sftped successfully... Please let me know : 1. What is the best / faster way to transfer files? 2. should I use batch option -b so that connectivity will be... (3 Replies)
Discussion started by: vegasluxor
3 Replies

4. UNIX for Dummies Questions & Answers

Rename a large number of files in subdirectories

Hi, I have a large number of subdirectories (>200), and in each of these directories there is a file with a name like "opp1234.dat". I'd like to know how I could change the names of these files to say "out.dat" in all these subdirectories in one go. Thanks! (5 Replies)
Discussion started by: lost.identity
5 Replies

5. UNIX for Dummies Questions & Answers

Delete large number of files

Hi. I need to delete a large number of files listed in a txt file. There are over 90000 files in the list. Some of the directory names and some of the file names do have spaces in them. In the file, each line is a full path to a file: /path/to/the files/file1 /path/to/some other/files/file 2... (4 Replies)
Discussion started by: inakajin
4 Replies

6. Shell Programming and Scripting

Concatenation of a large number of files

Hellow i have a large number of files that i want to concatenate to one. these files start with the word 'VOICE_' for example VOICE_0000000000 VOICE_1223o23u0 VOICE_934934927349 I use the following code: cat /ODS/prepaid/CDR_FLOW/MEDIATION/VOICE_* >> /ODS/prepaid/CDR_FLOW/WORK/VOICE ... (10 Replies)
Discussion started by: chriss_58
10 Replies

7. Shell Programming and Scripting

Need help combining large number of text files

Hi, i have more than 1000 data files(.txt) like this first file format: 178.83 554.545 179.21 80.392 second file: 178.83 990.909 179.21 90.196 etc. I want to combine them to the following format: 178.83,554.545,990.909,... 179.21,80.392,90.196,... (7 Replies)
Discussion started by: mr_monocyte
7 Replies

8. Shell Programming and Scripting

Compare 2 Large files

Hi, i want to compare 2 large files both around 300 mb. They are text files having only one column of numbers. I wish to get the unique values in file2. I tried using diff but it gave an error of memory exhausted. Both files are sorted and i am running on a 1gb ram core 2 duo 2ghz. Help!! Thanks... (4 Replies)
Discussion started by: bezudar
4 Replies

9. Shell Programming and Scripting

script to splite large file to number of small files

Dear All, Could you please help me to split a file contain around 240,000,000 line to 4 files all equally likely , note that we need to maintain that the end of each file should started by start flage (MSISDN) and ended by end flag (End), also the number of the line between the... (10 Replies)
Discussion started by: ahmed.gad
10 Replies

10. Shell Programming and Scripting

moving large number of files

I have a task to move more than 35000 files every two hours, from the same directory to another directory based on a file that has the list of filenames I tried the following logics (1) find . -name \*.dat > list for i in `cat list` do mv $i test/ done (2) cat list|xargs -i mv "{}"... (7 Replies)
Discussion started by: bryan
7 Replies
Login or Register to Ask a Question