Awk problem


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers Awk problem
# 1  
Old 05-02-2011
Awk problem

Hello everyone.

I have a file with this structure :

>[something near random]
[Lots of letters, several lines]
>[something near random]
[Lots of letters, several lines]
>[something near random]
[Lots of letters, several lines]
>[something near random]
[Lots of letters, several lines]
.......

I want to separate each file on this basis :
Title is what is after the ">"
File information is evrything from one > to another >

What I have tried :

Code:
awk 'BEGIN{RS=">";FS="\n";OFS="\n"}{close(f);f=$1}{print>f}' [name of my loooooong file]

And, I get an error : awk: (FILENAME=[file name] FNR=1> fatal: expression for '>' redirection has null string value

Help ?
(Thank you !!!)
# 2  
Old 05-02-2011
Code:
awk 'BEGIN{i=0; f="file"} /^>/{f=sprintf("file_%d,i++ )} {print $0>f}'  [name of my loooooong file]

This will produce a bunch of files: file1, file2.... filen

You really should consider using csplit, it is meant for this. It is possible with awk to exceed your open file limit

Code:
$ -f "file" is the prefix of the split out file set  -n 8 gives 00000000 9999999 files
csplit  -f "file" -n 8  inputbigfilename  '/^>/'

you get file00000000 -> file99999999
This User Gave Thanks to jim mcnamara For This Post:
# 3  
Old 05-03-2011
Thank you for your response.

I didn't know this command, but after reading man, I think it's the best one !
Gonna try this one tomorrow. Thank you !

---------- Post updated 03-05-11 at 03:11 AM ---------- Previous update was 02-05-11 at 02:58 PM ----------

I just found how it work !

here is the code I used :

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/^>/' '{*}'

I think the '{*}' is to told the csplit to search for my pattern as much as possible.

Thank you, I get my hundreds of files Smilie Smilie


Hello,

I just tried the csplt command :

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/^>/'

Or

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/>/'

None of them worked well : It print a "0" meanig everything is okay, but instead of 2700 files, I get only two... One empty, and one with the same information my big file have...

I'm still trying to find the answer with google, but if you can help me I would appreciate Smilie

Thank you !

Last edited by sluvah; 05-03-2011 at 05:21 AM..
# 4  
Old 05-03-2011
Quote:
Originally Posted by sluvah
Code:
awk 'BEGIN{RS=">";FS="\n";OFS="\n"}{close(f);f=$1}{print>f}' [name of my loooooong file]

And, I get an error : awk: (FILENAME=[file name] FNR=1> fatal: expression for '>' redirection has null string value
The error is due to the empty line (appearing at first) that is getting processed, to nullify it use "NR > 1". So, here is the modified code:
Code:
awk 'BEGIN {RS=">"; FS=OFS="\n"} NR > 1 {close(f); f=$1; print > f}' [name of my loooooong file]

---------- Post updated at 03:59 PM ---------- Previous update was at 03:15 PM ----------

Quote:
Originally Posted by sluvah
Thank you for your response.

I didn't know this command, but after reading man, I think it's the best one !
Gonna try this one tomorrow. Thank you !

---------- Post updated 03-05-11 at 03:11 AM ---------- Previous update was 02-05-11 at 02:58 PM ----------

I just found how it work !

here is the code I used :

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/^>/' '{*}'

I think the '{*}' is to told the csplit to search for my pattern as much as possible.

Thank you, I get my hundreds of files Smilie Smilie


Hello,

I just tried the csplt command :

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/^>/'

Or

Code:
csplit -n 4 -f "LAL14.1-" no_space_Sis_110407_amino.fasta '/>/'

None of them worked well : It print a "0" meanig everything is okay, but instead of 2700 files, I get only two... One empty, and one with the same information my big file have...

I'm still trying to find the answer with google, but if you can help me I would appreciate Smilie

Thank you !
Code:
sed 's/>/\n&/g' infile | sed 1d > out
csplit -f "f" -n 8 out '/^>/' {*}

This works for me, but I don't know how to suppress the formation of file f00000000 with empty contents!! Smilie
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk problem

Hi I have two columns and I would like to create a third column based on how many lines away from a value of 1 in column 2, for example I have 1,0 2,0 3,0 4,0 5,0 6,1 7,0 8,0 9,0 10,0 11,1 And I want an output (6 Replies)
Discussion started by: garethsays
6 Replies

2. Shell Programming and Scripting

awk problem

i have an email list in file.txt with comma separated line1 - FIELD1,pippo@gmail.com,darth@gmail.com line2 - FIELD2,pippo@gmail.com,darth@gmail.com,sampei@gmail.com output=(awk -F ',' -v var="$awkvar" '$1==var {print $2,$3,$4}' spreadsheet.txt)but awk delete some letters at the... (8 Replies)
Discussion started by: pasaico
8 Replies

3. Shell Programming and Scripting

awk problem - combining awk statements

i have a datafile that has several lines that look like this: 2,dataflow,Sun Mar 17 16:50:01 2013,1363539001,2990,excelsheet,660,mortar,660,4 using the following command: awk -F, '{$3=strftime("%a %b %d %T %Y,%s",$3)}1' OFS=, $DATAFILE | egrep -v "\-OLDISSUES," | ${AWK} "/${MONTH} ${DAY}... (7 Replies)
Discussion started by: SkySmart
7 Replies

4. UNIX for Dummies Questions & Answers

Little problem with AWK

I thought I had solved this problem but after testing the script I came to realize that it is not doing what I need. So, here it goes again. This is the code: awk '/\>/{F=$2; N=$3; split(FILENAME, A, "."); getline; x = ">"}{print ">" A"-" x++" "F" " N"\n" $0}' This is the input file: ... (5 Replies)
Discussion started by: Xterra
5 Replies

5. Shell Programming and Scripting

awk problem

Find the number of files with sizes > 100KB in /, /bin, /usr, /usr/bin and /usr/sbin directories and output them in a two column format with the name of the directory and the number of files. i tried with awk $>ls -lh | awk '/^-/ && $5 >= 100k {print $8 $5}' but it is not working pls tell... (3 Replies)
Discussion started by: abhikamune
3 Replies

6. Shell Programming and Scripting

Problem with awk awk: program limit exceeded: sprintf buffer size=1020

Hi I have many problems with a script. I have a script that formats a text file but always prints the same error when i try to execute it The code is that: { if (NF==17){ print $0 }else{ fields=NF; all=$0; while... (2 Replies)
Discussion started by: fate
2 Replies

7. Shell Programming and Scripting

Awk problem: How to express the single quote(') by using awk print function

Actually I got a list of file end with *.txt I want to use the same command apply to all the *.txt Thus I try to find out the fastest way to write those same command in a script and then want to let them run automatics. For example: I got the file below: file1.txt file2.txt file3.txt... (4 Replies)
Discussion started by: patrick87
4 Replies

8. Shell Programming and Scripting

problem using awk

Hi there every body I'm new to shell scripting and there is a problem facing me,, please look at the following piece of code: awk ' BEGIN{ FS="<assertion id=\1"; RS="<assertion id=\"2"}/<assertion id=\"1/{print FS$2 > "/home/ds2/test/output.txt"} ' filename all I wanna do is to... (6 Replies)
Discussion started by: senior_ahmed
6 Replies

9. Shell Programming and Scripting

Problem with AWK

Hi All, How can i store a value of the unix command executed in AWK with system command. devise=`cut -c1-3 dvgp.txt` I wrote this command in awk as awk'{ code= sprintf("devise=`cut -c1-3 dvgp.txt`"); system(code); }' Is this correct. can you please suggest me how the code can be... (1 Reply)
Discussion started by: krishna_gnv
1 Replies

10. UNIX for Dummies Questions & Answers

AWK Problem

Hi, I posted something here about this yesterday but I can't seem to find it. I needed help writting a script which would append a file with new lines after every so many charachters. Example: (my original flat file) L60 LETTER OF CREDIT 60 DAYS W00 ON RECEIPT WIRE TRANSFER W30 NET... (12 Replies)
Discussion started by: gseyforth
12 Replies
Login or Register to Ask a Question