Text processing using awk Post: 302909392

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Processing a text file

A file contains one name per line, such as: john doe jack bruce nancy smith sam riley When I 'cat' the file, the white space is treated as a new line. For example list=`(cat /path/to/file.txt)` for items in $list do echo $items done I get: john doe

2. UNIX for Dummies Questions & Answers

text file processing

Hello! There is a text file, that contains hierarchy of menues, like: Aaaaa->Bbbbb Aaaaa->Cccc Aaaaa-> {spaces} Ddddd (it means that the full path is Aaaaa->Cccc->Ddddd ) Aaaaa-> {more spaces} Eeeee (it means that the full path is Aaaaa->Cccc->Ddddd->Eeeee ) Fffffff->Ggggg...

3. Shell Programming and Scripting

text processing ( sed/awk)

hi.. I have a file having record on in 1 line.... I want every 400 characters in a new line... means in 1st line 1-400 in 2nd line - 401-800 etc pl help.

4. Shell Programming and Scripting

awk, perl Script for processing a single line text file

I need a script to process a huge single line text file: The sample of the text is: "forward_inline_item": "Inline", "options_region_Australia": "Australia", "server_event_err_msg": "There was an error attempting to save", "Token": "Yes", "family": "Family","pwd_login_tab": "Enter Your...

5. Shell Programming and Scripting

Awk text processing

Hi Very much appreciate if somebody could give me a clue .. I undestand that it could be done with awk but have a limited experience. I have the following text in the file 1 909 YES NO 2 500 No NO . ... 1 ...

6. Programming

awk processing / Shell Script Processing to remove columns text file

Hello, I extracted a list of files in a directory with the command ls . However this is not my computer, so the ls functionality has been revamped so that it gives the filesizes in front like this : This is the output of ls command : I stored the output in a file filelist 1.1M...

7. Shell Programming and Scripting

Text columns processing using awk

P { margin-bottom: 0.25cm; line-height: 120%; }CODE.cjk { font-family: "WenQuanYi Micro Hei",monospace; }CODE.ctl { font-family: "Lohit Hindi",monospace; }A:link { } I'm trying to build an awk statement to print from a file (file1): A 1,2,3 * A 4,5,6 ** B 1 ...

8. Shell Programming and Scripting

Help with text processing

I have an Input file which has a series of lines(which could vary) followed by two blank lines and then another series of lines(Could be any number of lines) followed by two blank lines and then repeats. I need to use filters to convert the following input file(which is an example) to an output...

9. Shell Programming and Scripting

Text processing

Hi, Need an advise on $ cat test.txt START field1 field2 field3 field4 field5 field6 END 12345|6|1|2|3|4|111|119 67890|6|1|3|8|9|112|000 $

10. Shell Programming and Scripting

awk for text processing

Hi,my file is in this format ", \"symbol\": \"Rbm38\" } ]" I want to convert it to a more user readable format _id pubmed text symbol 67196 18667844 Overexpression of UBE2T in NIH3T3 cells significantly promoted colony formation in mouse cell cultures Ube2t 56190 21764855 ...

LEARN ABOUT DEBIAN

tabmerge

TABMERGE(1p)						User Contributed Perl Documentation					      TABMERGE(1p)

NAME

       tabmerge - unify delimited files on common fields

SYNOPSIS

	 tabmerge [action] [options] file1 file2 [...]

       Actions:

	 --min		      Take only fields present in all files [DEFAULT]
	 --max		      Take all fields present
	 -f|--fields=f1[,f2]  Take only the fields mentioned in the
			      comma-separated list

       Options:

	 -l|--list	      List available fields
	 --fs=x 	      Use "x" as the field separator
			      (default is tab "	")
	 --rs=x 	      Use "x" as the record separator
			      (default is newline "
")
	 -s|--sort=f1[,f2]    Sort data ASCII-betically on field(s)
	 --stdout	      Print data in original delimited format
			      (i.e., not in a table format)

	 --help 	      Show brief help and quit
	 --man		      Show full documentation

DESCRIPTION

       This program merges the fields -- not the rows -- of delimited text files.  That is, if several files are almost but not quite entirely
       unlike each other in their structure (in their field names, numbers or orders), this script allows you to easily unify the files into one
       file with all the same fields.  The output can be based on fields as determined by the three "action" flags.

       For the following examples, consider three files that contain the following fields:

	 +------------+---------------------------------+
	 | File       | Fields				|
	 +------------+---------------------------------+
	 | merge1.tab | name, type, position		|
	 | merge2.tab | name, type, position, lod_score |
	 | merge3.tab | name, position			|
	 +------------+---------------------------------+

       To list all available fields in the files and the number of times they are present:

	 $ tabmerge --list merge*
	 +-----------+-------------------+
	 | Field     | No. Times Present |
	 +-----------+-------------------+
	 | lod_score | 1		 |
	 | name      | 3		 |
	 | position  | 3		 |
	 | type      | 2		 |
	 +-----------+-------------------+

       To merge the files on the minimum overlapping fields:

	 $ tabmerge merge*
	 +----------+----------+
	 | name     | position |
	 +----------+----------+
	 | RM104    | 2.30     |
	 | RM105    | 4.5      |
	 | TX5509   | 10.4     |
	 | UU189    | 19.0     |
	 | Xpsm122  | 3.3      |
	 | Xpsr9556 | 4.5      |
	 | DRTL     | 2.30     |
	 | ALTX     | 4.5      |
	 | DWRF     | 10.4     |
	 +----------+----------+

       To merge the files and include all the fields:

	 $ tabmerge --max merge*
	 +-----------+----------+----------+--------+
	 | lod_score | name	| position | type   |
	 +-----------+----------+----------+--------+
	 |	     | RM104	| 2.30	   | RFLP   |
	 |	     | RM105	| 4.5	   | RFLP   |
	 |	     | TX5509	| 10.4	   | AFLP   |
	 | 2.4	     | UU189	| 19.0	   | SSR    |
	 | 1.2	     | Xpsm122	| 3.3	   | Marker |
	 | 1.2	     | Xpsr9556 | 4.5	   | Marker |
	 |	     | DRTL	| 2.30	   |	    |
	 |	     | ALTX	| 4.5	   |	    |
	 |	     | DWRF	| 10.4	   |	    |
	 +-----------+----------+----------+--------+

       To merge and extract just the "name" and "type" fields:

	 $ tabmerge -f name,type merge*
	 +----------+--------+
	 | name     | type   |
	 +----------+--------+
	 | RM104    | RFLP   |
	 | RM105    | RFLP   |
	 | TX5509   | AFLP   |
	 | UU189    | SSR    |
	 | Xpsm122  | Marker |
	 | Xpsr9556 | Marker |
	 | DRTL     |	     |
	 | ALTX     |	     |
	 | DWRF     |	     |
	 +----------+--------+

       To merge the files on just the "name" and "lod_score" fields and sort on the name:

	 $ tabmerge -f name,lod_score -s name merge*
	 +----------+-----------+
	 | name     | lod_score |
	 +----------+-----------+
	 | ALTX     |		|
	 | DRTL     |		|
	 | DWRF     |		|
	 | RM104    |		|
	 | RM105    |		|
	 | TX5509   |		|
	 | UU189    | 2.4	|
	 | Xpsm122  | 1.2	|
	 | Xpsr9556 | 1.2	|
	 +----------+-----------+

       To do the same but mimic the original tab-delimited input:

	 $ tabmerge -f name,lod_score -s name --stdout merge*
	 name	 lod_score
	 ALTX
	 DRTL
	 DWRF
	 RM104
	 RM105
	 TX5509
	 UU189	 2.4
	 Xpsm122 1.2
	 Xpsr9556	 1.2

       Why would you want to do this?  Suppose you have several delimited text files with nearly the same structure and want to create just one
       file from them, but the fields may be in a different order in each file and/or some files may contain more or fewer fields than others.
       (As far-fetched as it may seem, it happens to the author more than he'd like.)

SEE ALSO

       o   Text::RecordParser

       o   Text::TabularDisplay

AUTHOR

       Ken Youens-Clark <kclark@cpan.org>.

LICENSE AND COPYRIGHT

       Copyright (C) 2006-10 Ken Youens-Clark.	All rights reserved.

       This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
       the Free Software Foundation; version 2.

       This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
       MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License for more details.

perl v5.10.1							    2010-07-26							      TABMERGE(1p)

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Processing a text file

Discussion started by: TheCrunge

2. UNIX for Dummies Questions & Answers

text file processing

Discussion started by: alias47

3. Shell Programming and Scripting

text processing ( sed/awk)

Discussion started by: clx

4. Shell Programming and Scripting

awk, perl Script for processing a single line text file

Discussion started by: hmsadiq