Large file data handling issue


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Large file data handling issue
# 1  
Old 11-14-2012
Large file data handling issue

I have a single record large file, semicolon ';' and pipe '|' separated. I am doing a vi on the file. It is throwing an error "File to long"

I need to actually remove the last | symbol from this file.
Code:
sed -e 's/\|*$//' filename

is working fine for small files. But not working on this big file. The file size is "614077"

Last edited by Scrutinizer; 11-14-2012 at 04:40 PM.. Reason: code tags
# 2  
Old 11-14-2012
614077 what? 600K isn't very large at all, sed should be able to handle it fine.

Even if it's a large file, sed will still work with it, it will just take longer.

What might be a problem are really really long lines...
# 3  
Old 11-14-2012
hmm, yes this is a 600K file and as already said it is a single record. How can i handle the file manupulation?
# 4  
Old 11-14-2012
Quote:
Originally Posted by Gurkamal83
I need to actually remove the last | symbol from this file.
Code:
sed -e 's/\|*$//' filename

is working fine for small files. But not working on this big file. The file size is "614077"
What exactly do you mean by "sed is not working on this big file", What messages do you get when you press enter?
Paste the first and last 100 bytes of the file.
Also try it this way:

Code:
sed 's/|$//' filename

# 5  
Old 11-14-2012
Thanks for the reply.. but your command also didnt work.
With the small files i can see record values and the record getting trimmed off with the trailling '|'.
When I use the same command with my actual file there is nothing that is coming on screen even if I echo it into a newfile nothing with get populated in the new file.

Starting bytes
001;04.5.1;2012-10-25 08:47:41;ABCDE||3;1351169231;1351169261;;;1351169256;1351169256;1351169261;;;;;;;;;;;00:00:05;

Ending bytes.
255;8;1;incoming;;;;;2;0;;;0;;10;0;0;0;0;;abc.com;x1-6-00-1c-fb-2f-8e-14.XXXX.xc.xr.com;;;;;0;000000;;Singh Gurkamal;1;;;this is a very big file;;;||1000;2012-10-25 08:49:03|
# 6  
Old 11-14-2012
Is this "file" actually a "stream" with no record terminators?
# 7  
Old 11-14-2012
yes.. I am getting from where u r coming.. do u mean the line feed??
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Large File masking incorrectly happening Ç delimeter issue

The OS version is Red Hat Enterprise Linux Server release 6.10 I have a script to mask some columns with **** in a data file which is delimeted with Ç , I am using awk for the masking , when I try to mask a small file the awk works fine and masks the required column , but when the file is... (6 Replies)
Discussion started by: LinuxUser8092
6 Replies

2. Shell Programming and Scripting

Output large volume of data to CSV file

I have a program that output the ownership and permission on each directory and file on the server to a csv file. I am getting error message when I run the program. The program is not outputting to the csv file. Error: the file access permissions do not allow the specified action cannot... (2 Replies)
Discussion started by: dellanicholson
2 Replies

3. UNIX for Dummies Questions & Answers

File handling issue

Hi All, I am running into an issue. I have a very big file. Wants to split it in smaller chunks. This file has multiple header/ trailers. Also, between each header/trailer there are records. Number of records in each header trailer combination can vary. Also, headers can start with... (3 Replies)
Discussion started by: Gurkamal83
3 Replies

4. Shell Programming and Scripting

UNIX file handling issue

I have a huge file semicolon( ; ) separated records are Pipe(|) delimited. e.g abc;def;ghi|jkl;mno;pqr|123;456;789 I need to replace the 50th field(semicolon separated) of each record with 9006. The 50th field can have no value e.g. ;; Can someone help me with the appropriate command. (3 Replies)
Discussion started by: Gurkamal83
3 Replies

5. Red Hat

Advice regarding filesystems handling large number of files

Hi All, I have a CentOS operating system installed. I work with really huge number of files which are not only huge in number but some of them really huge in size. Minimum number of files could be 1 million to 2 million in one directory itself. Some of the files are even several Gigabytes in... (2 Replies)
Discussion started by: shoaibjameel123
2 Replies

6. Shell Programming and Scripting

Severe performance issue while 'grep'ing on large volume of data

Background ------------- The Unix flavor can be any amongst Solaris, AIX, HP-UX and Linux. I have below 2 flat files. File-1 ------ Contains 50,000 rows with 2 fields in each row, separated by pipe. Row structure is like Object_Id|Object_Name, as following: 111|XXX 222|YYY 333|ZZZ ... (6 Replies)
Discussion started by: Souvik
6 Replies

7. Shell Programming and Scripting

UNIX File handling -Issue in reading a file

I have been doing automation of daily check activity for a server, i have been using sqls to retrive the data and while loop for reading the data from the file for several activities. BUT i got a show stopper the below one.. where the data is getting store in $temp_file, but not being read by while... (1 Reply)
Discussion started by: KuldeepSinghTCS
1 Replies

8. Shell Programming and Scripting

Extract data from large file 80+ million records

Hello, I have got one file with more than 120+ million records(35 GB in size). I have to extract some relevant data from file based on some parameter and generate other output file. What will be the besat and fastest way to extract the ne file. sample file format :--... (2 Replies)
Discussion started by: learner16s
2 Replies

9. Shell Programming and Scripting

Performance issue in UNIX while generating .dat file from large text file

Hello Gurus, We are facing some performance issue in UNIX. If someone had faced such kind of issue in past please provide your suggestions on this . Problem Definition: /Few of load processes of our Finance Application are facing issue in UNIX when they uses a shell script having below... (19 Replies)
Discussion started by: KRAMA
19 Replies

10. HP-UX

Need to split a large data file using a Unix script

Greetings all: I am still new to Unix environment and I need help with the following requirement. I have a large sequential file sorted on a field (say store#) that is being split into several smaller files, one for each store. That means if there are 500 stores, there will be 500 files. This... (1 Reply)
Discussion started by: SAIK
1 Replies
Login or Register to Ask a Question