I think your wording got you some interesting answers.
split is not normally affected by spaces, to answer your question. In your use it only pays attention to \n (newline) characters or control characters like vertical tab. It may be a data issue, like embedded control characters.
Try using od on the offending line in the original source file. Look for junk characters.
HI! All
iam using Split command to split a large .txt file in to smaller files,
The syntax iam using
split -25000 Product.txt
iam getting four output files but not in .txt format but in some other format ,
when i checked the properties the Type of the output files is Type
can any... (7 Replies)
Is there an option or a way with the split command to rename the partitioned files with a counter. For example, can the files testaa, testab, testac be renamed to test1, test2, test3 from the split command without explicilty renaming files.
Thanks,
- CB (3 Replies)
Hi All,
I have a parent file with 10 million records. I want to split the parent file in to child files.
Each child file contains 5000 records.
I am using the following command for splitting:
split -5000 parentfile.txt childfile.1
It will split the Parent file as... (7 Replies)
./myapp | split -b 10m -d -a 1 - "myappLog"
here split command is reading the input from the output of myapp and it will write the text in to file where in each file size is 10MB and it will create upto 10 files.
I have observed split is flushing the data for every 4096 bytes. if my... (7 Replies)
Hi all,
i have a .ksh script which is, among other stuff, splitting a file and saveing the filenames into variables for further processing:
# file split before ftp and put result filenames into variables
if ]; then
awk '{close(f);f=$1}{sub("^","");print > f".TXT"}' $_ftpfile
set B*.TXT... (0 Replies)
hi all,
i have problem with my script in unix ...i have script with split -d (--numeric-suffixes) in linux its working but in solaris machine the option -d isn't have
so how to i can change split -d (this output) will same in output solaris can i change with awk and how do that
thx before (2 Replies)
Hi I have a sequence which looks like this
# PH01000000
PH01000000G0240 P.he_genemodel_v1.0 CDS 120721 121773 . - . ID=PH01000000G0240.CDS;Parent=PH01000000G0240
PH01000001G0190 P.he_genemodel_v1.0 mRA 136867 137309 . - . ID=PH01000001G0190.mRNA;Parent=PH01000001G0190... (7 Replies)
Hi All,
I want to split the file after size gets above 100kb.
So I am using below command.
split -b 100kb File.txt Test
But after first file, my record is breaking as in middle of the record, size of file is getting above 100kb. So after splitting half record is in one file and half... (1 Reply)
Hi All,
I have a txt file which I would like to partition into 2 separate output files. I would like to partition the odd or even groups of 4 lines from the txt file. So I would like lines 1-4 to go to file1, and lines 5-8 to go to file2, and so on until the whole txt file is divided into two... (1 Reply)
Discussion started by: landrjos
1 Replies
LEARN ABOUT OPENSOLARIS
col
col(1) User Commands col(1)NAME
col - reverse line-feeds filter
SYNOPSIS
col [-bfpx]
DESCRIPTION
The col utility reads from the standard input and writes to the standard output. It performs the line overlays implied by reverse line-
feeds, and by forward and reverse half-line-feeds. Unless -x is used, all blank characters in the input will be converted to tab characters
wherever possible. col is particularly useful for filtering multi-column output made with the .rt command of nroff(1) and output resulting
from use of the tbl(1) preprocessor.
The ASCII control characters SO and SI are assumed by col to start and end text in an alternative character set. The character set to which
each input character belongs is remembered, and on output SI and SO characters are generated as appropriate to ensure that each character
is written in the correct character set.
On input, the only control characters accepted are space, backspace, tab, carriage-return and newline characters, SI, SO, VT, reverse line-
feed, forward half-line-feed and reverse half-line-feed. The VT character is an alternative form of full reverse line-feed, included for
compatibility with some earlier programs of this type. The only other characters to be copied to the output are those that are printable.
The ASCII codes for the control functions and line-motion sequences mentioned above are as given in the table below. ESC stands for the
ASCII escape character, with the octal code 033; ESC- means a sequence of two characters, ESC followed by the character x.
reverse line-feed ESC-7
reverse half-line-feed ESC-8
forward half-line-feed ESC-9
vertical-tab (VT) 013
start-of-text (SO) 016
end-of-text (SI) 017
OPTIONS -b Assume that the output device in use is not capable of backspacing. In this case, if two or more characters are to appear in the
same place, only the last one read will be output.
-f Although col accepts half-line motions in its input, it normally does not emit them on output. Instead, text that would appear
between lines is moved to the next lower full-line boundary. This treatment can be suppressed by the -f (fine) option; in this case,
the output from col may contain forward half-line-feeds (ESC-9), but will still never contain either kind of reverse line motion.
-p Normally, col will ignore any escape sequences unknown to it that are found in its input; the -p option may be used to cause col to
output these sequences as regular characters, subject to overprinting from reverse line motions. The use of this option is highly
discouraged unless the user is fully aware of the textual position of the escape sequences.
-x Prevent col from converting blank characters to tab characters on output wherever possible. Tab stops are considered to be at each
column position n such that n modulo 8 equals 1.
ENVIRONMENT VARIABLES
See environ(5) for descriptions of the following environment variables that affect the execution of col: LC_CTYPE, LC_MESSAGES, and
NLSPATH.
EXIT STATUS
The following error values are returned:
0 Successful completion.
>0 An error occurred.
ATTRIBUTES
See attributes(5) for descriptions of the following attributes:
+-----------------------------+-----------------------------+
| ATTRIBUTE TYPE | ATTRIBUTE VALUE |
+-----------------------------+-----------------------------+
|Availability |SUNWesu |
|CSI |enabled |
+-----------------------------+-----------------------------+
SEE ALSO nroff(1), tbl(1), ascii(5), attributes(5), environ(5)NOTES
The input format accepted by col matches the output produced by nroff with either the -T37 or -Tlp options. Use -T37 (and the -f option of
col) if the ultimate disposition of the output of col will be a device that can interpret half-line motions, and -Tlp otherwise.
col cannot back up more than 128 lines or handle more than 800 characters per line.
Local vertical motions that would result in backing up over the first line of the document are ignored. As a result, the first line must
not have any superscripts.
SunOS 5.11 1 Feb 1995 col(1)