09-12-2017
Hello
kraljic,
I have a few to questions pose in response first:-
- What have you tried so far?
- What output/errors do you get?
- What OS and version are you using?
- What are your preferred tools? (C, shell, perl, awk, etc.)
- What logical process have you considered? (to help steer us to follow what you are trying to achieve)
Most importantly,
What have you tried so far?
There are probably many ways to achieve most tasks, so giving us an idea of your style and thoughts will help us guide you to an answer most suitable to you so you can adjust it to suit your needs in future.
Did
split help, it's probably the right tool for this, unless you want to do other processing at the same time. With
split you can break up your file based on the number of bytes, lines, etc. and define output filenames and suffix length to make it suitable. It will not easily take the literal name
file1 and generate a
file2, but you can force it to be something sensible and then rename the output as needed.
If you generate 5 files of output and want to be separating out just the content of the first, it is straightforward to rename that one to the desired name and either delete the remainder or
cat them back together (in the right order) to overwrite the original file, thereby removing the lines you have just extracted. You would then need to tidy the temporary files up too to save space.
We're all here to learn and getting the relevant information will help us all.
Have a go with
split as my learned member
RudiC suggests and let us know if it helps or you get stuck. I'm sure we can get a working solution that you can support and/or reuse.
kind regards,
Robin
This User Gave Thanks to rbatte1 For This Post:
10 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
Hello!
Firts of all, I'm sorry for my English.
My problem:
I have text file with few Form Feed symbols (FF, ASCII code =12) inside (for example - some report, consists of some pages for
printing).
I want to split this text by pages - each page (until FF symbol)
in single file.
I... (2 Replies)
Discussion started by: ranri
2 Replies
2. Shell Programming and Scripting
Can an expert kindly write an efficient Linux ksh script that will split a large 2 GB text file into two?
Here is a couple of sample record from that text file:
"field1","field2","field3",11,22,33,44
"TG","field2b","field3b",1,2,3,4
The above rows are delimited by commas.
This script is to... (2 Replies)
Discussion started by: ihot
2 Replies
3. Shell Programming and Scripting
Hi all
I have written Perl script to swap the strings in the second a third column from a text file.
My input file format is :
the|empty|the|det lake|empty|lake|conj_and was|empty|was|auxpass drained|empty|drained|conj_and birds|empty|bird|s|nn
The expected output file format is... (11 Replies)
Discussion started by: my_Perl
11 Replies
4. Shell Programming and Scripting
Hi
I am using shell script where I am calling SQLPLUS and executing one PL/SQL block.
This PL/SQL block generates the spool file for example splfile.txt.
After successful generation of spool file I use nawk to split this file into 2 different files. Till here no issues.
nawk... (1 Reply)
Discussion started by: shekharjchandra
1 Replies
5. Shell Programming and Scripting
Hi,
I have a fixed width text file without any header row. One of the columns contains a date in YYYYMMDD format.
If the original file contains 3 dates, I want my shell script to split the file into 3 small files with data for each date.
I am a newbie and need help doing this. (14 Replies)
Discussion started by: bhanja_trinanja
14 Replies
6. Shell Programming and Scripting
Hi all,
I am very new to shell scripting and some help is greatly appreciated.
I have 10 column based text files, i would like to split each of them into 6 files ; the 1st one having columns 1, 2 ,3,4 , the second one having columns 1,2,8,9 etc.
Is there a way I could get 60 files out my... (3 Replies)
Discussion started by: shreymuk
3 Replies
7. Shell Programming and Scripting
chr1 412573 . A C 2758.77 . AC=2;AF=1.00;AN=2;DP=71;Dels=0.00;FS=0.000;HaplotypeScore=2.8822;MLEAC=2;MLEAF=1.00;MQ=58.36;MQ0=0;QD=38.86;resource.EFF=INTERGENIC(MODIFIER||||||||) GT:AD:DP:GQ:PL 1/1:0,71:71:99:2787,214,0 GATKSAM
chr1 602567 rs21953190 A ... (9 Replies)
Discussion started by: mehar
9 Replies
8. Shell Programming and Scripting
I have a text file with entries like
1186
5556
90844
7873
7722
12
7890.6
78.52
6679
3455
9867
1127
5642
..N so many records like this.
I want to split this file into multiple files like cluster1.txt, cluster2.txt, cluster3.txt, ..... clusterN.txt. (4 Replies)
Discussion started by: sammy777
4 Replies
9. Shell Programming and Scripting
Hi,
I have a text file (attached the sample). I have also, attached the way the way the files need to be split.
We get this file, that will either have 24 Jurisdictions, or will miss some and retain some.
Like in the attached sample file, there are only Jurisdictions 03,11,14,15, 20 and 30.... (3 Replies)
Discussion started by: ebsus
3 Replies
10. Shell Programming and Scripting
Hi, all.
I have an input file. I would like to generate 3 types of output files.
Input:
LG10_PM_map_19_LEnd_1000560
LG10_PM_map_6-1_27101856
LG10_PM_map_71_REnd_20597718
LG12_PM_map_5_chr_118419232
LG13_PM_map_121_24341052
LG14_PM_1a_456799
LG1_MM_scf_5a_opt_abc_9029993
... (5 Replies)
Discussion started by: huiyee1
5 Replies
paste(1) General Commands Manual paste(1)
NAME
paste - merge same lines of several files or subsequent lines of one file
SYNOPSIS
file1 file2 ...
list file1 file2 ...
list] file1 file2 ...
DESCRIPTION
In the first two forms, concatenates corresponding lines of the given input files file1, file2, etc. It treats each file as a column or
columns in a table and pastes them together horizontally (parallel merging). In other words, it is the horizontal counterpart of cat(1)
which concatenates vertically; i.e., one file after the other. In the option form above, replaces the function of an older command with
the same name by combining subsequent lines of the input file (serial merging). In all cases, lines are glued together with the tab char-
acter, or with characters from an optionally specified list. Output is to standard output, so can be used as the start of a pipe, or as a
filter if is used instead of a file name.
recognizes the following options and command-line arguments:
Without this option, the new-line characters
of all but the last file (or last line in case of the option) are replaced by a tab character. This option allows replac-
ing the tab character by one or more alternate characters (see below).
list One or more characters immediately following replace the default tab as the line concatenation character. The list is
used circularly; i.e., when exhausted, it is reused. In parallel merging (that is, no option), the lines from the last
file are always terminated with a new-line character, not from the list. The list can contain the special escape
sequences: (new-line), (tab), (backslash), and (empty string, not a null character). Quoting may be necessary if charac-
ters have special meaning to the shell. (For example, to get one backslash, use ).
Merge subsequent lines rather than one from each input file.
Use tab for concatenation, unless a list is specified with the option. Regardless of the list, the very last character of
the file is forced to be a new-line.
Can be used in place of any file name
to read a line from the standard input (there is no prompting).
EXTERNAL INFLUENCES
Environment Variables
determines the locale for the interpretation of text as single- and/or multi-byte characters.
determines the language in which messages are displayed.
If or is not specified in the environment or is set to the empty string, the value of is used as a default for each unspecified or empty
variable. If is not specified or is set to the empty string, a default of "C" (see lang(5)) is used instead of
If any internationalization variable contains an invalid setting, behaves as if all internationalization variables are set to "C". See
environ(5).
International Code Set Support
Single- and multi-byte character code sets are supported.
RETURN VALUE
These commands return the following values upon completion:
Completed successfully.
An error occurred.
EXAMPLES
List directory in one column:
List directory in four columns
Combine pairs of lines into lines
Notes
works similarly, but creates extra blanks, tabs and new-lines for a nice page layout.
DIAGNOSTICS
Except for the option, no more than - 3 input files can be specified (see limits(5)).
AUTHOR
was developed by OSF and HP.
SEE ALSO
cut(1), grep(1), pr(1).
STANDARDS CONFORMANCE
paste(1)