Sponsored Content
Top Forums Shell Programming and Scripting split file depending on content Post 302145458 by bakunin on Wednesday 14th of November 2007 08:09:56 AM
Old 11-14-2007
In this case the last field would be matched by a regexp (replace "<tab>" by a literal tab character):

sed 's/.*<tab>//' file

The reason why this works is because Unix regexps are "greedy": always the longest possible match is used. If you have several tabs in one input line ".*<tab>" will match all possible characters (including tabs!) followed by a tab char, which will be the rightmost one. Be sure to have no trailing tabs at the end of the line, though, as in this case the regexp would match the whole line.

To extract the rightmpost field from the line and split the file linewise into different files use something like:

Code:
cat file | while read line ; do
     lastfield="$(print $line | sed 's/.*<tab>//')"
     case $lastfield in
          abc)
               print - "$line" >> file1
               ;;

           def)
               print - "$line" >> file2
               ;;

           ghi)
               print - "$line" >> file3
               ;;

           *)
               print -u2 "i do not know where to put $line"
               ;;

     esac
done

bakunin

Last edited by bakunin; 11-14-2007 at 09:19 AM..
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Split file into multiple files depending upon first 4 digits

Hi All, I have a file like below: 1016D"ddd","343","1299" 1016D"ddd","3564","1299" 1016D"ddd","3297","1393" 1016D"ddd","32989","1527" 1016D"ddd","346498","1652" 2312D"ddd","3269","1652" 2312D"ddd","328","1652" 2312D"ddd","2224","2100" 3444D"ddd","252","2100" 3444D"ddd","2619","2100"... (4 Replies)
Discussion started by: deepakgang
4 Replies

2. Shell Programming and Scripting

split file content

Hi All; I have input file like below name char(3) number number(3) inputfile namenumber xyz123abc509kai330 aca203 ald390afa000als303 I wanted to split like below:- output like this:- xyz123 abc509 kai330 aca203 ald390 (6 Replies)
Discussion started by: Jairaj
6 Replies

3. Shell Programming and Scripting

How to split a data file into separate files with the file names depending upon a column's value?

Hi, I have a data file xyz.dat similar to the one given below, 2345|98|809||x|969|0 2345|98|809||y|0|537 2345|97|809||x|544|0 2345|97|809||y|0|651 9685|98|809||x|321|0 9685|98|809||y|0|357 9685|98|709||x|687|0 9685|98|709||y|0|234 2315|98|809||x|564|0 2315|98|809||y|0|537... (2 Replies)
Discussion started by: nithins007
2 Replies

4. Shell Programming and Scripting

Split the file based on the content

Arun kumar something somehting Enterting in to the line . . . . Some text text Finshing the sentence Some other text . . . . Again something somehting Enterting in to the line . . . . . . Again text text Finshing the sentence (6 Replies)
Discussion started by: arukuku
6 Replies

5. Shell Programming and Scripting

split file content into specific folders

Hi I have a large text file and I want to split its content into multiple flies. this large file contains several blocks of codes separated by a comment line for each block. this comment line represents a directory path So, when separate these blocks each into a separate file, This output... (7 Replies)
Discussion started by: turki_00
7 Replies

6. Shell Programming and Scripting

Parsing files depending on the content

I am trying to parse files kkapjil kkpcjil kkexjil ...which have autosys job names. The objective is to parse each file...do an autorep -j <job name > -q and write it as output with a line for condition at the end of the each job. The problem I am facing is with the box jobs...as autorep -j <box... (0 Replies)
Discussion started by: w020637
0 Replies

7. Shell Programming and Scripting

How to Split File to 2 depending on condition?

Hi , cat myfile.txt ! 3100.2.0.5 ! 3100.2.22.4 ! 3100.2.30.33 ! 3100.2.4.1 ! ! 3100.2.0.5 ! 3100.2.22.4 ! 3100.2.22.11 ! 3100.2.4.1 ! ! 3100.2.0.5 ! 3100.2.2.50 ! 3100.2.22.11 ! 3100.2.4.1 ! ! 3100.2.0.5 ! 3100.2.22.4 ! 3100.2.30.33 ! 3100.2.4.1 ! ! 3100.2.0.5 ! 3100.2.22.4 !... (6 Replies)
Discussion started by: OTNA
6 Replies

8. Shell Programming and Scripting

Split file depending on Column Value

Hi , my file look likes below , cat file.csv 12/09/2014,50,5,0,300 12/09/2014, ,5,0,300 12/09/2014,50,,,300 i need to split file , the first one contains values (2nd column is 50 , 3rd and fourth column is null ) the second file contains all others firstfile ... (2 Replies)
Discussion started by: ubaisalih
2 Replies

9. Shell Programming and Scripting

Split a file in more files based on score content

Dear All, I have the following file tabulated: ID distanceTSS score 8434 571269 10 10122 393912 9 7652 6 10 4863 1451 9 8419 39 2 9363 564 21 9333 7714 22 9638 8334 9 1638 1231 11 10701 918 1000 6587 32056 111 What I would like to do is the following, create 100 new files based... (5 Replies)
Discussion started by: paolo.kunder
5 Replies

10. Shell Programming and Scripting

Split a non delimited file into columns depending on user input

I would like some advice on some code. I want to write a small script that will take an input file of this format 111222233334444555666661112222AAAA 2222333445556612323244455445454545 2334556345643534505435345353453453 (and so on) It will be called as : script inputfile X (where X is... (5 Replies)
Discussion started by: onlyforbopi
5 Replies
newform(1)						      General Commands Manual							newform(1)

NAME
newform - change or reformat a text file SYNOPSIS
[file]... DESCRIPTION
reads lines from the named files, or standard input if no input file is named, and reproduces the lines on standard output. Lines are reformatted in accordance with command line options in effect. Command line options can appear in any order, can be repeated, and can be intermingled with the optional files. Command line options are processed in the order specified. This means that option sequences such as yield results different from Options are applied to all files on the command line. Options recognizes the following options: Same as except characters are appended to the end of a line. Truncate n characters from the beginning of the line when the line length is greater than the effective line length (see The default is to truncate the number of characters necessary to obtain the effective line length. The default value is used when with no n is used. This option can be used to delete the sequence numbers from a COBOL program as follows: The must be used to set the effective line length shorter than any existing line in the file so that the option is activated. Change the prefix/append character to k. The default character for k is a space. Same as except that characters are truncated from the end of the line. Write the tab specification format line on the standard output before any other lines are output. The tab specification format line which is printed will correspond to the format specified in the option. If no option is specified, the line which is printed contains the default specification of Input tab specification: expands tabs to spaces, according to the tab specifications given. The tabspec recognizes all tab specification forms described in tabs(1). In addition, tabspec can be in which assumes that the tab specification is to be found in the first line read from the standard input (see fspec(4)). If no tabspec is given, tabspec defaults to A tabspec of expects no tabs; if any are found, they are treated as Set the effective line length to n characters. If n is not entered, defaults to 72. The default line length without the option is 80 characters. Note that tabs and backspaces are treated as single characters (use to expand tabs to spaces). Output tab specification: replaces spaces with tabs, according to the tab specifications given. The tab specifications are the same as for If no tabspec is given, tabspec defaults to A tabspec of means that no spaces will be converted to tabs on output. Prefix n characters (see to the beginning of a line when the line length is less than the effective line length. The default is to prefix the number of characters necessary to obtain the effective line length. Shear off leading characters on each line up to the first tab and place up to 8 of the sheared characters at the end of the line. If more than 8 characters (not counting the first tab) are sheared, the eighth character is replaced by a and any characters to the right of it are discarded. The first tab is always discarded. An error message and program exit occur if this option is used on a file without a tab on each line. The characters sheared off are saved internally until all other options specified are applied to that line. The characters are then added at the end of the processed line. For example, to convert a file with leading digits, one or more tabs, and text on each line, to a file beginning with the text, all tabs after the first expanded to spaces, padded with spaces out to column 72 (or truncated to column 72), and the leading digits placed starting at column 73, the command would be: RETURN VALUE
returns one of the following values: No errors encountered. An error occurred. DIAGNOSTICS
All diagnostics are fatal. was called with a bad option. There was no tab on one line. Self-explanatory. A line exceeds 512 characters after being expanded in the internal work buffer. A tab specification is incorrectly formatted, or specified tab stops are not ascending. A tabspec read from a file (or standard input) must not contain a tabspec referencing another file (or standard input). WARNINGS
normally only keeps track of physical characters; however, for the and options, keeps track of backspaces in order to line up tabs in the appropriate logical columns. does not prompt the user if a tabspec is to be read from the standard input (by use of or If the option is used, and the last option specified was and was preceded by either a or a the tab specification format line will be incor- rect. SEE ALSO
csplit(1), tabs(1), fspec(4). newform(1)
All times are GMT -4. The time now is 10:09 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy