tar and gzip extraction issues


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users tar and gzip extraction issues
# 1  
Old 08-03-2011
tar and gzip extraction issues

Not sure if this is really in the right forum but here goes....

Looking for a way to extract individual compressed files from a compressed tarball WITHOUT tar -zxvf and then recompressing. Basically we need to be able to chunk out an individual compressed file while it still remains compressed.

For whatever reason, when we tar out the single, then recompress it's destroying the file integrity. The files are so large that we can't just decompress the whole then pick out pieces, so that's out.

We had looked at going in and extracting the bits and writing them elsewhere, but from what I understand about how tar and gzip work, we'd get garbage because the gzip algorithm is predicated on the overall file rather than the pieces when working with a tarball. To gunzip the individual pieces would rely upon the initial Huffman double encoded "rosetta stone" that was generated from the overall tarball structure, right? Without that "rosetta stone", we'd not be able to accurately decompressed the individual .gzs....and a reverse algorithm wouldn't work because the initial was encoded off of patterns that were present in the whole but may not be present in the individual.

I'm a bit of a n00b, so I just need to check and make sure I've absorbed this all correctly. But, in case I've processed it all incorrectly and if there is a way or a script that can accomplish this, please point me in the right direction. Thanks.
# 2  
Old 08-03-2011
If you don't want to decompress a tar, don't compress it.

But if you want random seek, tar might not be the best anyway -- tar is a streaming protocol and read beginning to end.

I think cpio supports random seek but aren't positive. It also has a filesize limit however, it can't hold files larger than 4 gigs apiece.

You could also check out 7zip.
# 3  
Old 08-03-2011
Unfortunately, I don't get to determine the format. It's not a compressed tarball I made. It is ready-made and I have to make lemonade with it. Otherwise, I'd probably set something else up if it were up to me.
# 4  
Old 08-03-2011
If it's compressed, you have to decompress it.

If you have space, try just decompressing it separately. That might speed up extraction later.
# 5  
Old 08-03-2011
Ugh. Yeah, I was afraid that that was the answer. I guess there's not much I can do on my end at this point but use the process of elimination to determine if it's my or the others' work where the corruption is creeping in.
# 6  
Old 08-03-2011
Hi.

Instead of using the compressed tar file, uncompress and untar the entire file, then compress the individual files, then tar the individual compressed files. That would allow you to extract a file, then uncompress only that file. It will also probably lower the risk of losing everything past a damaged place in the large compressed file. In fact, keeping a directory of the compressed individual files would allow "random access" because they would be available by filename.

The compression savings would probably differ from the original. Experimentation with a subset should allow you to estimate the difference.

Good luck ... cheers, drl
# 7  
Old 08-03-2011
What Operating System and version are you using?
How big is the largest archive (before and after compression)?
How big is the largest file (before and after compression)?

As others suggest, compressing an archive is foolish because you cannot extract individual files without decompressing the entire archive.

Is fitting lots more disc an option? In general there is no reason nowadays to compress files (because disc space is cheap) unless you need to copy them across a network.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Facing issues with tar and gzip !

Hi, I am trying to :- (1.) Tar the file and then (2.) Gzip it ! Tar command :- tar -cvf BLUESTAR_Archive.log_$(date +%y_%m_%d_%H_%M).tar /app/local/XXX/XXX/XXX/logs Gzip command :- Gzip /app/local/XXX/XXX/XXX/logs/BLUESTAR_Archive.log_$(date +%y_%m_%d_%H_%M).tar ... (9 Replies)
Discussion started by: acidburn_007
9 Replies

2. Shell Programming and Scripting

tar and gzip files

Hi Guys, I am using RHEL5 and Solaris 9 & 10. I want to tar and gzip my files then remove them after a successful tar command... Lets say I have files with extension .arc then I want to tar and gzip these files. After successful tar command I want to remove all these files (i.e .arc). ... (3 Replies)
Discussion started by: Phuti
3 Replies

3. Shell Programming and Scripting

tar and gzip problem

Hi Guys, I have a few files. i want to tar these files and zip it using gzip it. -rw-r----- 1 magesh magesh 12940369 Jul 27 09:26 dcx_imds_c.asc -rw-r----- 1 magesh magesh 1221391 Jul 27 09:27 dcx_imds_h.asc -rw-r----- 1 magesh magesh 1105673 Jul 27 09:27... (6 Replies)
Discussion started by: mac4rfree
6 Replies

4. Shell Programming and Scripting

tar + gzip + split together

Hi All I need guidance on this requirement . We have a directory structure which has data of approx 100 GB We need to tar the structure then zip it and create different files of not more than 10 GB A separate tar file then a .gz should not be created , on the fly a script is needed... (7 Replies)
Discussion started by: aamir1234
7 Replies

5. UNIX for Dummies Questions & Answers

tar and gzip

Hi, I would like to have a combined gzip and tar that will compress and create multiple output tar.gz files. I want to have multiple files output because i cannot create an archive because there is no more space on my harddisk. I cannot transfer it locally because of slow connection. I want to... (3 Replies)
Discussion started by: tungaw2004
3 Replies

6. UNIX for Advanced & Expert Users

tar/gzip/gz...which one to use?

P0251WLADC.svm_wl1 > /svm_wl1/billing/data/server/archive/ALLEVT $ du -k FEB2006 22050224 FEB2006 As you can see,i have a folder called "FEB2006" which is around 22 GB. i guess zip or compress wont work...( i don know how do we compress a folder) i wished to use ""tar" ( i suppose... (5 Replies)
Discussion started by: abhijeetkul
5 Replies

7. UNIX for Dummies Questions & Answers

TAR and GZIP help

Hi, There are 700 .pdf files in a certain directory on the server and I need to TAR them first and then compress them using GZIP to free up the space. The combined size of the .pdf files is 3gb. However, there is only 1gb of free space on the server. So as you can see when I try to TAR these... (3 Replies)
Discussion started by: VandeMatram
3 Replies

8. Shell Programming and Scripting

Help with tar extraction!

I have this tar file which has files of (.ksh, .ini &.sql) and their hard and soft links. Later when the original files and their directories are deleted (or rather lost as in a system crash), I have this tar file as the only source to restore all of them. In such a case when I do, tar... (4 Replies)
Discussion started by: manthasirisha
4 Replies

9. UNIX for Dummies Questions & Answers

can i tar and gzip in one liner ?

hello can i combine this 2 commands in one liner command? (1 Reply)
Discussion started by: umen
1 Replies

10. UNIX for Dummies Questions & Answers

Combine tar and gzip together?

Hello I just wandering, instead to doing "tar cvf foo.tar * " and then gzip foo.tar , can't it be combined to one command ? (1 Reply)
Discussion started by: umen
1 Replies
Login or Register to Ask a Question