Split 1 column into numerous columns based on patterns
Hi,
I have a text file 'Item_List.txt' containing only 1 column. This column lists different products, each separated by the same generic string header "NEW PRODUCT, VERSION 1.1". After this the name of the product is given, then a delimiter string "PRODUCT FIELD", and then the name of the field itself. After the field name comes the data which in this example only has 1 entry per field but will have more.
My desired output file is as follows;
Basically I require to list each product with its relevant field columns ordered from left to right. I should note there is no limit on the number of fields each product has.
I have tried using csplit, awk, sed, cut & paste but with no luck. I apologize for not inserting my code as what i have does not work and i didn't want to confuse things.
I'm not expecting the answer, but even just a pointer on to how best go about this task. Even a rough structure would be good and i could add my own code to that.
Thanks for your response! I ran your code on the file and got the following output;
This is very close but for the 2nd product only 2 out of its 3 fields are printed. (the 1st product only had 2 fields). Each product may have numerous fields, sorry if i was vague explaining this. I've had a good look through the code and i'm not sure why only 2 fields are printed.
Thanks
Last edited by Franklin52; 10-12-2015 at 10:32 AM..
Reason: Please use code tags
Thanks for your response! I ran your code on the file and got the following output;
This is very close but for the 2nd product only 2 out of its 3 fields are printed. (the 1st product only had 2 fields). Each product may have numerous fields, sorry if i was vague explaining this. I've had a good look through the code and i'm not sure why only 2 fields are printed.
Thanks
The only way that I can see that you would get that output from SriniShoo's script would be if the 2nd line in your input file that you showed us containing the text:
was misspelled or did not appear at start of a line.
What operating system and version of awk are you using?
Also, in your 1st post in this thread you said:
Quote:
After the field name comes the data which in this example only has 1 entry per field but will have more.
None of the suggestions presented so far have addressed this, and I'm not sure what you mean by it. Please show us sample input (in CODE tags) and corresponding desired sample output (in CODE tags) so we can see what you are trying to do.
In your 1st post, you also said:
Quote:
I apologize for not inserting my code as what i have does not work and i didn't want to confuse things.
Please don't feel that way. Showing us what you tried (and the output you got from what you tried) (both in CODE tags) helps us understand what you're thinking and gives us a better chance of understanding some minor point in shell scripting that is causing your scripts to fail.
Hi,
I am trying to split the following output into two columns, where each column has Source: Destination:
OUTPUT TO FILTER
$ tshark -r Capture_without_mtr.pcap -V | awk '/ (Source|Destination): /' | more
Source: x.x.x.x
Destination: x.x.x.x
Source:... (2 Replies)
Hi all,
Newbie here, so please bear over with my stupid question :)
I have used far too long time today on figuring this out, so I hope that someone here can help me move on.
I have some annotation data for a transcriptome where I want to split a column containing NCBI accession IDs into a... (7 Replies)
Hello :)
I am in this situation:
Input: two tab-delimited files, `File1` and `File2`. `File2` (`$2`) has to be parsed by patterns found in `File1` (`$1`).
Expected output: tab-delimited file, `File3`. `File3` has to contain the same rows as `File2`, plus the corresponding value in... (5 Replies)
Hi,
I have a similar input format-
A_1 2
B_0 4
A_1 1
B_2 5
A_4 1
and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks!
letter number_of_letters... (5 Replies)
i have file1.txt
asdas|csada|130310|0423|A1|canberra
sdasd|sfdsf|130426|2328|A1|sydney
Expected output : on eaceh third and fourth colum, split into each two characters
asdas|csada|13|03|10|04|23|A1|canberra
sdasd|sfdsf|13|04|26|23|28|A1|sydney (10 Replies)
Hi,
I have a file sample_1.txt (300k rows) which has data like below:
* Also each record is around 64k bytes
11|1|abc|102553|125589|64k bytes of data
10|2|def|123452|123356|......
13|2|geh|144351|121123|...
25|4|fgh|165250|118890|..
14|1|abc|186149|116657|......... (6 Replies)
Here is my source, i have million lines like this on a file.
disp0201.php?poc=4060&roc=1&ps=R&ooc=13&mjv=6&mov=5&rel=5&bod=155&oxi=2&omj=5&ozn=1&dav=20&cd=&daz=& drc=&mo=&sid=&lang=EN&loc=JPN
I want to split this into columns in order to load in database, anything starts with"&mjv=6" as first... (13 Replies)
hey guys...
Im looking to do the following:
1
2
3
4
5
6
7
8
9
Change to:
1 4 7
2 5 8
3 6 9
Did use | perl -lpe'$\=$.%3?$":"\n"' , but it doesnt give me the matrix i want. (3 Replies)
Have a column "address" which is combination of city, region and postal code like.
Format is : city<comma><space>region<space>postal code
abc, xyz 123456
All these three city, region and postal code are not mandatory. There can be any one of the above. In that case a nell... (2 Replies)