text manipulation


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting text manipulation
# 1  
Old 08-15-2011
Java text manipulation

Hi All;
i need to do text processing :
I have a file:
file1.txt
>>>>>>>>>>>>
30 2 23 some
30 2 22 text
30 2 21 xyz
30 2 20 ttttt
30 2 19 ttttt-1
30 2 18 xryz
30 2 17 xyzr
30 2 16 xy111z
30 2 15 xanyyz
30 2 14 xzz
30 2 13 xyy
30 2 0 zzz-w
50 3 25 zzz-w
50 3 12 productw
50 3 10 xyz20
50 3 9 eeeee
50 3 8 rrrr-1-77
50 3 7 producti
50 3 5 xyz
50 3 4 xyz40
50 3 3 xyz30
50 3 2 xyz
50 3 1 asdf-2
50 3 21 xasdf
50 3 22 xy30
50 3 23 product-2
50 5 24 asdf-2
50 5 4 ttttt-1-77
50 3 19 ttttt-77
50 3 18 xyz77
50 3 17 xyz
50 3 15 prod-cc
60 1 2 aaa
60 1 5 bbb
60 1 10 ccc
>>>>>>>>>>>>>>>>>>

the processing required is to have an output which is :
sorting the lines according to( the 3rd column for each 1st and 2nd columns) knowing that the 3rd column sorting for each 1st,2nd column should start from 0 and end with 50 .

i mean :for each 1st and 2nd columns values: sort the 3rd column from 0 to 50 and if a value between 0 and 50 in 3rd column is missing we should fill the same 1st,2nd columns value as for the other values and the 3rd column with the missing value and the fourth column with Nothing.


example :for the pair (60 1 3rd string) in file1 above:
we have :
60 1 2 aaa
60 1 5 bbb
60 1 10 ccc

the output for only the ( 60 1 x string) should be:
60 1 0 nothing
60 1 1 nothing
60 1 2 aaa
60 1 3 nothing
60 1 4 nothing
60 1 5 bbb
......
60 1 6-9 nothing
.....
60 1 10 ccc
60 1 11 nothing
.....
....
60 1 50 nothing

please help.
# 2  
Old 08-15-2011
This is for sorting...

Code:
 
$ sort -n -k 1,1 -k 2,2 -k 3,3 test
30 2 0 zzz-w
30 2 13 xyy
30 2 14 xzz
30 2 15 xanyyz
30 2 16 xy111z
30 2 17 xyzr
30 2 18 xryz
30 2 19 ttttt-1
30 2 20 ttttt
30 2 21 xyz
30 2 22 text
30 2 23 some
50 3 1 asdf-2
50 3 2 xyz
50 3 3 xyz30
50 3 4 xyz40
50 3 5 xyz
50 3 7 producti
50 3 8 rrrr-1-77
50 3 9 eeeee
50 3 10 xyz20
50 3 12 productw
50 3 15 prod-cc
50 3 17 xyz
50 3 18 xyz77
50 3 19 ttttt-77
50 3 21 xasdf
50 3 22 xy30
50 3 23 product-2
50 3 25 zzz-w
50 5 4 ttttt-1-77
50 5 24 asdf-2
60 1 2 aaa
60 1 5 bbb
60 1 10 ccc

---------- Post updated at 01:16 PM ---------- Previous update was at 01:03 PM ----------

Try this.. not tested.. as i dont have seq command in solaris

Code:
 
nawk '{print $1,$2}' test | sort | uniq | while read line; do for i in `seq 1 60`; do grep -w "$line $i" test || echo "$line $i nothing"; done;done | sort -n -k 1,1 -k 2,2 -k 3,3

---------- Post updated at 01:16 PM ---------- Previous update was at 01:16 PM ----------

test is the filename
# 3  
Old 08-15-2011
Another homework... Smilie
# 4  
Old 08-15-2011
Try...
Code:
awk 'function f(){for(i=1;i<=50;i++)print p,i,a[i]?a[i]:n;delete a}($1 OFS $2)!=p&&NR>1{f()}{p=$1 OFS $2;a[$3]=$4}END{f()}' n=nothing file1 > file2

# 5  
Old 08-16-2011
Thank you very much ,both ways did the job successfully.

Last edited by unlx; 08-16-2011 at 06:15 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Help text manipulation

Hello Forum , I need a help about text manupulation. I have a text file and I have to manipulate this file. Let's say source.txt source.txt UNB+UNOC:3+O0013000005MAN MN RVS:91+0098006688:92+190304:2313+F004169241' UNH+8146848+DELJIT:D:96A:UN' BGM+307:::JIS_SYNCRO_FIRM+2019030423234101+9'... (8 Replies)
Discussion started by: cemokam65
8 Replies

2. Shell Programming and Scripting

Text manipulation help

Hello again, I have a problem manipulating a large text document and there is no way I could edit this document by hand. Format is: Address : XXXX N 37 Ave, Hollywood, FL, 33021 Phone: XXX3190XXX Player: XXXXXX Character: Jaramillo DOB: June-14-1995 ----- Name: Alexandra Ticket... (3 Replies)
Discussion started by: galford
3 Replies

3. UNIX for Dummies Questions & Answers

Text manipulation help

Hello unix.com users, I have a ip file (line-by-line). How can I delete the ips that keep repeating by mark XXX.XXX.XXX.* ... I want to erase only the lines that keep repeating more than 2 times. Example: 1.2.3.1 1.2.3.2 1.2.3.3 I want to erase all ips blocks that are repeating by C... (1 Reply)
Discussion started by: galford
1 Replies

4. UNIX for Dummies Questions & Answers

Text Manipulation Help

Hello Unix.com, I have a text in format: john sara lee How can I make it: john:john john:john1 john:john12 john:john123 sara:sara sara:sara12 sara:sara123 and so on (2 Replies)
Discussion started by: galford
2 Replies

5. UNIX for Dummies Questions & Answers

text manipulation help

Hello again unix.com How can I extract from a large file in format: steve@aol.com steve hawkins Location of this member is bla bla bla sun@hotmail.com Sun Ying This member is using browser bla bla bla to another text in format: steve@aol.com steve hawkins sun@hotmail.com sun ying ... (5 Replies)
Discussion started by: galford
5 Replies

6. Shell Programming and Scripting

[HELP] Text manipulation... [HELP]

I need to know how can I remove all word after comma on each line. Like: jjkj,iiuiui,ijlkjkij,ookoo kijljlj,jhhkj,ijijkijkj,oijkijj kjkljlkj,kjkjlkjlkj,opok,okop to jjkj, kijljlj, ... (5 Replies)
Discussion started by: slutb3
5 Replies

7. UNIX for Dummies Questions & Answers

Text Manipulation

Greetings. Iīm a biologist and I donīt have mucho knowledge on Unix/Linux, but I need to use Cygwin to change some documents from a GenBank format to a FASTA format. GenBank format goes somthing like this: LOCUS NM_013964 2568 bp mRNA linear PRI 26-APR-2009... (2 Replies)
Discussion started by: vanesa1230
2 Replies

8. UNIX for Dummies Questions & Answers

Help with text manipulation

Hi there, I have some text files in unix format that processed by a program in windows, and when I open them with less or vi in linux, a warn for opening binary file is prompted, and as shown in vi, between every two characters there was inserted a "^@". How can I fix this. Plus, there are over... (2 Replies)
Discussion started by: dustinwang2003
2 Replies

9. UNIX for Dummies Questions & Answers

text manipulation

I am tryin to figure out how to extract interested text from file example.txt blah blah blah a: child1 blah a: child2 blah b: parent1 blah blah blah .... blah a: child21 blah a: child22 blah a: child23 blah b: parent2 this kinda text repeats .. number of children is... (6 Replies)
Discussion started by: rajkishore
6 Replies

10. Shell Programming and Scripting

Text Manipulation.

Hi I have only ever used awk and sed for basic requirements up until now. I have had to break a log down for multiple purposes. Using awk, sed and a date script. I am left with this: (message id, time of msg attempt, message id, domain name, time of msg completion) ... (4 Replies)
Discussion started by: Icepick
4 Replies
Login or Register to Ask a Question