Sponsored Content
Top Forums Shell Programming and Scripting Split strings based on length Post 302333435 by summer_cherry on Monday 13th of July 2009 05:33:38 AM
Old 07-13-2009
Code:
while(<DATA>){
	my @tmp=split;
	for(my $i=1;$i<=4;$i++){
		$arr[$i]=substr($tmp[0],0,$i);
		$arr[$i+4]=substr($tmp[0],-$i);
	}
	
	print $tmp[0]," ";
	print join " ", @arr;
	print " ",$tmp[1],"\n";
}
__DATA__
International NOUN 
Corp. NOUN
's POS
Tulsa NOUN

 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Need find a file based length

Can some please help me? Want to find files over 35 characters in length? I am running HPUX. Would it be possible with find? Thanks in advance (8 Replies)
Discussion started by: J_ang
8 Replies

2. Shell Programming and Scripting

how to sort strings by length?

I'm trying to find the longest word in /usr/share/dict/words The first thing I can think of is to sort the content by length then it would be easy to find out, but then i realize theres no option of sort to sort by length. Could you guys please give me some help?:confused: (7 Replies)
Discussion started by: rockbike
7 Replies

3. Shell Programming and Scripting

remove strings of lowercase characters (with minimum length requirement)

Hi all, I want to delete all lowercase characters from my file, but only strings of length 7 and more. For example, how can I go from: JHGEFigeIGDUIirfyfiyhgfoiyfKJHGuioyrDHG To: JHGEFigeIGDUIKJHGuioyrDHG There should be a trick to add to sed 's///g', but I can't figure it out.... (2 Replies)
Discussion started by: elbuzzo
2 Replies

4. Shell Programming and Scripting

Replace strings based on position and length?

Suppose i have a file which contains thousands of records. e.g adjgmptjadmwpgjmwmd i need to replace the string from 3rd to 8th position using awk script in entire file. And also the positions will be passed as parameter. (3 Replies)
Discussion started by: laknar
3 Replies

5. UNIX for Dummies Questions & Answers

Delete strings in file1 based on the list of strings in file2

Hello guys, should be a very easy questn for you: I need to delete strings in file1 based on the list of strings in file2. like file2: word1_word2_ word3_word5_ word3_word4_ word6_word7_ file1: word1_word2_otherwords..,word3_word5_others... (7 Replies)
Discussion started by: roussine
7 Replies

6. Shell Programming and Scripting

Split strings

Hello I started to learn bash about 1 or 2 weeks, please help me. I have about 200000 strings like these: ATGCCAGGGGAGCCCAGAAGGTAAAACTTGATCTGAAATGTATGTTTATATATAATTTAGGTAATCAATTGGCATGTGAA and I need to split each letter to get: A T G C C A G G G G A G C C C A G A A G G T A A A A C T T... (9 Replies)
Discussion started by: geparada88
9 Replies

7. Shell Programming and Scripting

Split a fixed length file bases on last occurence of string

Hi, I need to split a file based on last occurece of a string. PFB the explanation I have a file in following format aaaaaaaaaaaaaaaaaaaaaaaaaaa bbbbbbbbbbbbbbbbbbbbbbbbbbb ccccccccccccccccccccccccccc ddddddddddddddddddddddddddd 3186rrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr... (4 Replies)
Discussion started by: Neelkanth
4 Replies

8. Shell Programming and Scripting

awk to sum a column based on duplicate strings in another column and show split totals

Hi, I have a similar input format- A_1 2 B_0 4 A_1 1 B_2 5 A_4 1 and looking to print in this output format with headers. can you suggest in awk?awk because i am doing some pattern matching from parent file to print column 1 of my input using awk already.Thanks! letter number_of_letters... (5 Replies)
Discussion started by: prashob123
5 Replies

9. Shell Programming and Scripting

Split certain strings in a line for a specific column.

Hi, i need help to extract certain strings/words from lines with different length. I have 3 columns separated by tab delimiter. like below Probable arabinan endo-1,5-alpha-L-arabinosidase A (EC 3.2.1.99) (Endo-1,5-alpha-L-arabinanase A) (ABN A) abnA Ady3G14620 Probable arabinan... (5 Replies)
Discussion started by: redse171
5 Replies

10. Shell Programming and Scripting

Append 0's based on length

I'm having data like this, "8955719","186497034","0001","M","3" "8955719","186497034","0002","M","10" "8955719","186497034","0003","M","10" "8955719","186497034","0004","M","3" "8955723","186499034","0001","M","3" "8955723","186499034","0002","M","10" "8955723","186499034","0003","M","10"... (3 Replies)
Discussion started by: Artlk
3 Replies
mkmsgs(1)						      General Commands Manual							 mkmsgs(1)

NAME
mkmsgs - create message files for use by gettxt() SYNOPSIS
locale] textfile msgfile DESCRIPTION
The command takes as input a file of localized text strings and generates a message file that can be accessed by the gettxt(3C) routine. textfile is the name of the file that contains the text strings. msgfile is the name of the output message file. appends the suffix to the message file name. The combined length of the file name should be less than 14 bytes for short file name file system. The msgfile file should not contain a colon since it will confuse the formatting routines. The textfile file contains the localized text strings. The text strings are separated by a newline character. The text strings are pro- cessed sequentially and copied to the msgfile message file. An empty line in the input results in a corresponding empty message written to the msgfile message file. Options The command supports the following options: Overwrite the msgfile message file if it exists. The msgfile message file is installed in the system-wide localization directory corresponding to the specified locale. Only a user with the appropriate privileges can create or overwrite the message file in that directory. The direc- tory will be created if it does not exist. EXTERNAL INFLUENCES
Environment Variables determines the interpretation of messages as single- and/or multibyte characters. Messages are issued in if it is set to a valid language and messages are available. Otherwise "C" locale messages are issued. If is not specified in the environment or is set to the empty string, the value of is used as a default for each unspecified or empty vari- able. If is not specified or is set to the empty string, a default of "C" (see lang(5)) is used instead of If any internationalization variable contains an invalid setting, behaves as if all internationalization variables are set to "C". See environ(5). International Code Set Support Single- and multibyte character code sets are supported. EXAMPLES
The following example shows the format of the input text strings: WARNINGS
is provided for SVID3 compatibility only. The user is encouraged to use the NLS mechanism developed by HP and the X/Open Company, Ltd. SEE ALSO
gencat(1), gettxt(3C), setlocale(3C). STANDARDS COMPLIANCE
mkmsgs(1)
All times are GMT -4. The time now is 04:59 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy