Preparing for a move to a new server, I needed to offload about somewhat over a gigabyte of newsfeeds that my website collects, and that I had been saving on the server. I tarred them and zipped them into about a dozen smaller files of about 150Mb each. All seemed well. I downloaded them onto my Windows pc. The website was moved (
http://schema-root.org). My plan was to move them back to the new server, strip them from their rss formats and load the news items into a database. However, in my newbieness I managed to transfer the gzipped files in ascii mode (both directions!). So they won't unzip now, either on my pc, or on the server.
Using:
> gunzip < d200512.tar.gz | tar xvf -
I was able to extract a few percent of the files from the first archive I tried, maybe a hundred of five thousand or so.
My question is: Would it be possible to get rid of the linefeed-carriage returns that were inserted into the zip file by being ftp'ed in ascii mode, back to what they were before I screwed them up? In my innocence, I am imagining that every existing linefeed byte in the original zip file (bytes that happened to be linefeeds) had a carrage return byte added after it during the ftp transfer in ascii mode. And so I am wondering whether there might be some utility somewhere that would strip them back out, and if there were such a utility, whether it would be likely to produce a zip file that could be unzipped.
Otherwise I lose a ton of newsfeeds.
Thanks for any help.
John