Alternative to cp command

06-07-2018

Registered User

101, 4

Join Date: Sep 2017

Last Activity: 15 July 2020, 7:10 PM EDT

Posts: 101

Thanks Given: 96

Thanked 4 Times in 4 Posts

Alternative to cp command

Good Afternoon,

I'm backing up a folder from one NAS to another using a unix script using

Code:

cp

. Its a lot of files and takes several days to complete. Most of the files don't change from week to week. Is there a command that would be quicker?

Also note, the backup needs to be ready-to-use in an instant- not in an archive or something that would need to be extracted or anything.

Last edited by Stellaman1977; 06-07-2018 at 03:10 PM.. Reason: Added one more sentance

Stellaman1977

View Public Profile for Stellaman1977

Find all posts by Stellaman1977

06-07-2018

Registered User

503, 195

Join Date: Sep 2013

Last Activity: 22 January 2021, 1:52 PM EST

Location: France

Posts: 503

Thanks Given: 43

Thanked 195 Times in 176 Posts

Hi,

Maybe, you can to see to side: Copy files in Parallel

Regards.

This User Gave Thanks to disedorgue For This Post:

disedorgue

View Public Profile for disedorgue

Find all posts by disedorgue

06-07-2018

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

We're not going to improve "several days" into "instant" no matter what the means. Further, any program which creates files does largely does the same thing as cp.

Parellizing is usually a non-starter. The bottleneck you're already hitting would get worse.

Speeding it up, then, is a matter of improving or bypassing the connection to your NAS.

The holdup is very likely protocol latency multiplied by thousands of tiny files. If that's handled locally on your NAS, it will be much faster. You say "no archive", but that's still my answer. You don't have to wait for it to transfer, or even store it after all, the whole point of a UNIX tarball is you can extract it on the fly. You can transfer it over a network pipe of some sort and extract it while it's still being transferred.

Something like:

Code:

tar -C /path/to/localfolder -cf - | ssh -T username@host 'cd /path/to/destination ; tar -tf -'

Change tar -tf to tar -xf once you've tested and see that it does what you want.

Last edited by Corona688; 06-07-2018 at 07:49 PM..

This User Gave Thanks to Corona688 For This Post:

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

06-08-2018

Moderator

2,327, 710

Join Date: Feb 2012

Last Activity: 3 May 2020, 3:12 AM EDT

Location: Devon, UK

Posts: 2,327

Thanks Given: 442

Thanked 710 Times in 578 Posts

I think that I get the question as you've described it quite clearly. Different people will have different solutions but this is what I would do.

(Obviously, if the (original) copy takes several hours then users could be modifying files during that time so you need somehow to cope with that.)

Do the first copy using find piped to cpio and create a timestamp of the event:

Code:

# cd <source directory>
# date > timenow
# find . -depth -print | cpio -puvdm <destination directory>
# mv timenow timelastcopy

NOTE: The <destination directory> MUST already exist before the command is run otherwise it will fail, so create it manually if need be.

After the first copy, select only files that have changed since the last copy by using the -newer switch on find:

Code:

# cd <source directory>
# date > timenow
# find . -newer timelastcopy -depth -print | cpio -puvdm <destination directory>
# rm timelastcopy
# mv timenow timelastcopy

Note that we create the timestamp (timenow) before we start to copy because users might modify files whilst the copy is executing.

This way files that have not changed since before the very start of the last copy will not be copied again. The incremental copies will therefore be much quicker than a full copy. If the job fails to complete then the timelastcopy will not get updated so these files will get selected again on the next run.

Hope that helps and I hope I've explained that clear enough. If not, post back your questions.

Last edited by hicksd8; 06-08-2018 at 12:59 PM..

This User Gave Thanks to hicksd8 For This Post:

hicksd8

View Public Profile for hicksd8

Find all posts by hicksd8

06-08-2018

Moderator

1,484, 567

Join Date: Mar 2011

Last Activity: 28 November 2020, 9:34 AM EST

Posts: 1,484

Thanks Given: 68

Thanked 567 Times in 444 Posts

Considering you posted that most files remain unchanged, a rsync will not copy same files based on date or checksum.

This will lower io on the disks and network, but increase cpu usage if checksum is used.

What's a lot of files ?

Regards
Peasant.

These 2 Users Gave Thanks to Peasant For This Post:

Peasant

View Public Profile for Peasant

Find all posts by Peasant

06-08-2018

Registered User

11,728, 1,345

Join Date: Feb 2004

Last Activity: 8 May 2020, 9:07 AM EDT

Location: NM

Posts: 11,728

Thanks Given: 903

Thanked 1,345 Times in 1,201 Posts

This is clearly NOT an alternative to a command. So it may not meet your needs.

What file system? ZFS, EXT4...?

Some file systems support snapshots, so you backup from the snapshot. If you create a snapshot at time T, then run your backup against the time T snap at T + 10 days, you still get what was there at time T. No corruption.

You can also clone a file system to a different name, filesysA -> filesysB, then backup filesysB at your leisure.

You can get snapshot and clone worthy filesystems for Linux and Solaris. I do not know about HP or AIX.

This User Gave Thanks to jim mcnamara For This Post:

jim mcnamara

View Public Profile for jim mcnamara

Find all posts by jim mcnamara

UNIX for Beginners Questions & Answers

Alternative to cp command

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

Alternative to join command

Discussion started by: echo manolis

2. Shell Programming and Scripting

Alternative command/method to curl

Discussion started by: SkySmart

3. Shell Programming and Scripting

Alternative command to grep -w option

Discussion started by: veeresh_15

4. Shell Programming and Scripting

Maxdepth command not working in AIX.Need alternative solution for this command

Discussion started by: kommineni

5. AIX

Alternative command for topas

Discussion started by: sumanthupar

6. Shell Programming and Scripting

Alternative script for LFTP command

Discussion started by: vinay4889

7. Homework & Coursework Questions

locate command alternative,,

Discussion started by: ozman911

8. Shell Programming and Scripting

Any alternative of sar command

Discussion started by: vijays3

9. UNIX for Dummies Questions & Answers

alternative for head command

Discussion started by: nikhilneela

10. UNIX for Dummies Questions & Answers

an alternative of sed command..--imp

Discussion started by: aixjadoo