Shell script to split data with a delimiter having chars and special chars


 
Thread Tools Search this Thread
Top Forums UNIX for Beginners Questions & Answers Shell script to split data with a delimiter having chars and special chars
# 8  
Old 08-26-2019
Code:
dlm="<SelectStatement modified='1' type='string'><![CDATA["
dlm2="]]><ReadStatementFromFile type"
awk '
NR==1 && index($0, dlm) {print substr($0, index($0, dlm) + length(dlm))}
index($0, dlm2) {print substr($0, 1, index($0, dlm2)-1)}
' dlm="$dlm" dlm2="$dlm2" a1.txt


Last edited by rdrtx1; 02-18-2020 at 08:24 PM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

shell scripting to determine special chars in file

Hi, I need all your help to achieve the below functionality. I have a big 2 GB file and inside the file we need to identify, whether having a comma(,) or pipe(|) or tab or fixed position or semicolon(;) delimiter. If any of those delimiter found need to replace the file with pipe(|)... (1 Reply)
Discussion started by: lkeswar
1 Replies

2. UNIX for Advanced & Expert Users

Inserting delimiter after a specific number of chars

Hello guys, I have a problem where I need to add a delimiter, that can be | for example, after each 28000 chars. The problem is that sometimes 1 row, which should contain 28000 chars is split in 2, so I want to put the delimiter after each 28000 so I will know the end of each row. Please... (2 Replies)
Discussion started by: Diogo R Jesus
2 Replies

3. Shell Programming and Scripting

If condition matching with special chars

Hi, I have file #cat drivers.txt fcs0 fcs1 vscsi1 vscsi2 In this i need to check the availabality of "fcs" or "vscsi" alone not vscsi0,fcs1 I tried with "if condition" but it is not working. cat drivers.txt| while read ADAP do echo "Checking for $ADAP" if ;then echo "FC... (9 Replies)
Discussion started by: ksgnathan
9 Replies

4. Shell Programming and Scripting

All strings within two special chars

I have a file with multiple lines. From each line I want to get all strings that starts with '+' and ends with '/'. Then I want the strings to be separated by ' + ' Example input: +$A$/NOUN+At/NSUFF_FEM_PL+K/CASE_INDEF_ACC Sample output: $A$ + At + K (20 Replies)
Discussion started by: Viernes
20 Replies

5. UNIX for Dummies Questions & Answers

Strings with Special chars in IF condition

I was trying to run a code to check if a fax number is empty or not. for that, I've written the following code which is throwing an error. #!/bin/ksh fax= "999-999-9999" if ; then fax_no="000-000-0000" else fax_no=$fax fi echo $fax_no And I get the... (7 Replies)
Discussion started by: hooaamai
7 Replies

6. Shell Programming and Scripting

find 4 chars on 2nd line, 44 chars over

I know this should be simple, but I've been manning sed awk grep and find and am stupidly stumped :( I'm trying to use sed (or awk, find, etc) to find 4 characters on the second line of a file.txt 44-47 characters in. I can find lots of sed things for lines, but not characters. (4 Replies)
Discussion started by: unclecameron
4 Replies

7. Shell Programming and Scripting

How to convert C source from 8bit chars to 16bit chars?

I was using the following bash command inside the emacs compile command to search C++ source code: grep -inr --include='*.h' --include='*.cpp' '"' * | sed "/include/d" | sed "/_T/d" | sed '/^ *\/\//d' | sed '/extern/d' Emacs will then position me in the correct file and at the correct line... (0 Replies)
Discussion started by: siegfried
0 Replies

8. Shell Programming and Scripting

special chars arrangement in code

here is my simple script to show process and owners except me: ps `-ef |grep xterm |grep -v aucar` | while read a1 a2 a3 a4 a5 a6 a7 a8 do echo KILL..\($a1\).. $a2 |more done how can I pass values from command "ps -ef |grep xterm|grep -v aucar" to ? because above command... (2 Replies)
Discussion started by: xramm
2 Replies

9. Shell Programming and Scripting

treating special chars

Hi, I need some advise on treating non printable chars over ascii value 126 Case 1 : On some fields in the text , I need to retiain then 'as-is' and load to a database.I understand it also depends on database codepage. but i just wanna know how do i ensure it do not change while loading... (1 Reply)
Discussion started by: braindrain
1 Replies

10. UNIX for Advanced & Expert Users

Supress special chars in vi

Hi, One of our application is producing log files. But if we open the log file in vi or less or view mode, it shows all the special characters in it. The 'cat' shows correctly but it shows only last page. If I do 'cat' <file_name> | more, then again it shows special characters. ... (1 Reply)
Discussion started by: divakarp
1 Replies
Login or Register to Ask a Question
textutil(n)						      Texts and strings utils						       textutil(n)

NAME
textutil - Procedures to manipulate texts and strings. SYNOPSIS
package require Tcl 8.2 package require textutil ?0.5? textutil::adjust string args textutil::splitx string ?regexp? textutil::tabify string ?num? textutil::tabify2 string ?num? textutil::trim string ?regexp? textutil::trimleft string ?regexp? textutil::trimright string ?regexp? textutil::untabify string ?num? textutil::untabify2 string ?num? textutil::strRepeat text num DESCRIPTION
The textutil package provides commands that manipulate strings or texts (a.k.a. long strings or string with embedded newlines or para- graphs). The complete set of procedures is described below. textutil::adjust string args Do a justification on the string according to args. The string is taken as one big paragraph, ignoring any newlines. Then the line is formatted according to the options used, and the command return a new string with enough lines to contain all the printable chars in the input string. A line is a set of chars between the beginning of the string and a newline, or between 2 newlines, or between a newline and the end of the string. If the input string is small enough, the returned string won't contain any newlines. By default, any occurrence of spaces characters or tabulation are replaced by a single space so each word in a line is separated from the next one by exactly one space char, and this forms a real line. Each real line is placed in a logical line, which have exactly a given length (see -length option below). The real line may have a lesser length. Again by default, any trailing spaces are ignored before returning the string (see -full option below). The following options may be used after the string parameter, and change the way the command place a real line in a logical line. -full boolean If set to false, any trailing space chars are deleted before returning the string. If set to true, any trailing space chars are left in the string. Default to false. -justify (center|left|plain|right) Set the justification of the returned string to center, left, plain or right. By default, it is set to left. The justifica- tion means that any line in the returned string but the last one is build according to the value. If the justification is set to plain and the number of printable chars in the last line is less than 90% of the length of a line (see -length), then this line is justified with the left value, avoiding the expansion of this line when it is too small. The meaning of each value is: center The real line is centered in the logical line. If needed, a set of space char are added at the beginning (half of the needed set) and at the end (half of the needed set) of the line if required (see the option -full). left The real line is set on the left of the logical line. It means that there are no space chars at the beginning of this line. If required, all needed space chars are added at the end of the line (see the option -full). plain The real line is exactly set in the logical line. It means that there are no leading or trailing space chars. All the needed space chars are added in the real line, between 2 (or more) words. right The real line is set on the right of the logical line. It means that there are no space chars at the end of this line, and there may be some space chars at the beginning, despite of the -full option. -length integer Set the length of the logical line in the string to integer. integer must be a positive integer value. Defaults to 72. -strictlength boolean If set to false, a line can exceed the specified -length if a single word is longer than -length. If set to true, words that are longer than -length are split so that no line exceeds the specified -length. Defaults to false. textutil::splitx string ?regexp? Split the string and return a list. The string is split according to the regular expression regexp instead of a simple list of chars. Note that if you add parenthesis into the regexp, the parentheses part of separator would be added into list as additional element. If the string is empty the result is the empty list, like for split. If regexp is empty the string is split at every char- acter, like split does. The regular expression regexp defaults to "[ ]+". textutil::tabify string ?num?] Tabify the string by replacing any substring of num space chars by a tabulation and return the result as a new string. num defaults to 8. textutil::tabify2 string ?num?] Similar to textutil::tabify this command tabifies the string and returns the result as a new string. A different algorithm is used however. Instead of replacing any substring of num spaces this comand works more like an editor. num defaults to 8. Each line of the text in string is treated as if there are tabstops every num columns. Only sequences of space characters containing more than one space character and found immediately before a tabstop are replaced with tabs. textutil::trim string ?regexp?] Remove in string any leading and trailing substring according to the regular expression regexp and return the result as a new string. This apply on any line in the string, that is any substring between 2 newline chars, or between the beginning of the string and a newline, or between a newline and the end of the string, or, if the string contain no newline, between the beginning and the end of the string. The regular expression regexp defaults to "[ ]+". textutil::trimleft string ?regexp?] Remove in string any leading substring according to the regular expression regexp and return the result as a new string. This apply on any line in the string, that is any substring between 2 newline chars, or between the beginning of the string and a newline, or between a newline and the end of the string, or, if the string contain no newline, between the beginning and the end of the string. The regular expression regexp defaults to "[ ]+". textutil::trimright string ?regexp?] Remove in string any trailing substring according to the regular expression regexp and return the result as a new string. This apply on any line in the string, that is any substring between 2 newline chars, or between the beginning of the string and a newline, or between a newline and the end of the string, or, if the string contain no newline, between the beginning and the end of the string. The regular expression regexp defaults to "[ ]+". textutil::untabify string ?num?] Untabify the string by replacing any tabulation char by a substring of num space chars and return the result as a new string. num defaults to 8. textutil::untabify2 string ?num?] Untabify the string by replacing any tabulation char by a substring of at most num space chars and return the result as a new string. Unlike textutil::untabify each tab is not replaced by a fixed number of space characters. The command overlays each line in the string with tabstops every num columns instead and replaces tabs with just enough space characters to reach the next tabstop. This is the complement of the actions taken by textutil::tabify2. num defaults to 8. There is one asymmetry though: A tab can be replaced with a single space, but not the other way around. textutil::strRepeat text num The implementation depends on the core executing the package. Used string repeat if it is present, or a fast tcl implementation if it is not. Returns a string containing the text repeated num times. The repetitions are joined without characters between them. A value of num <= 0 causes the command to return an empty string. SEE ALSO
regexp(n), split(n), string(n) KEYWORDS
string, regular expression textutil 0.5 textutil(n)