Hope this works for you... Slight modification from your solution:
Explanation :
1. s/\(<[^>]*>\)\([^>]*\)\(<[^>]*>\)/\1\n\2\3/g
This replaces like <MY_BIG_TAG>\nThis_is_a_test<MY_BIG_TAG>
2. starts loop
3. After \n till < arrives, substitute all underscore to _Q
4. Again checks if the same pattern appears, if it is, go through the loop again.
5. Atlast replace \n with the empty ( which we replaced in line 1).
I built a 12 million record file and made a mistake, one field is 1 character too long.
The record is 40 bytes and ends always in 999. I am trying to delete the 37 character in each record. Is this possible without doing a cut and paste. (1 Reply)
It's all in the subject. I try to figure out how to repeat a character a number of time with printf.
For example to draw a line in a script output.
Thks (13 Replies)
Hi
I need to write a script that read a input file that had same statement repeatedly to replace only 2nd & 5th time repeated statements (ex: This is UNIX forum) with another statement ( UNIX forum threads in Shell programming) with out modifying 1st,3,4th repeated statements. I am planning to do... (2 Replies)
Hi all
I am trying to get my head around doing the following....
I have an input field that could contain either a number a blank field or a whitespace field.
What I want to do is delete a 0 (zero) if it's on its own or leading the number.
So:-
\t0 delete the zero
0 delete the... (8 Replies)
Hi All,
Was wondering how I can do the following....
I have a String as follows
"ACCTRL000005022RRWDKKEEDKDD...."
This string can be in a file called tail.out or in a Variable called $VAR2
Now I have another variable called $VAR1="000004785" (9 bytes long), I need the content of... (5 Replies)
I am a newbie and would like some help with the following -
Trying to search fileA for a string similar to -
AS11000022010 30.4 31.7 43.7 53.8 60.5 71.1 75.2 74.7 66.9 56.6 42.7 32.5 53.3
I then want to replace that string with a string from fileB - ... (5 Replies)
Hi there,
A total sed noob here. Is there a way using sed to delete everything before a character AND after another character on each line in a file? The deletion should also delete the indicating characters(here: an opening and a closing parenthesis).
The original file would look like... (3 Replies)
Hi,
Anyone can help using SED searches a character string for a specified delimiter character, and returns a leading or trailing space/blank.
Text file :
"1"|"ExternalClassDEA519CF5"|"Art1"
"2"|"ExternalClass563EA516C"|"Art3"
"3"|"ExternalClass305ED16B8"|"Art9"
...
...
... (2 Replies)
Hello is it possible with awk or sed to replace any white space with the previous line characters in the same position?
I am asking this because the file I have doesn't always follow a pattern.
For example the file I have is the result of a command to obtain windows ACLs:
icacls C:\ /t... (5 Replies)
I have a csv dataset like this :
C,rs18768
G,rs13785
GA,rs1065
G,rs1801279
T,rs9274407
A,rs730012
I'm thinking of use like awk, sed to covert the dataset to this format: (if it's two character, then keep the same)
CC,rs18768
GG,rs13785
GA,rs1065
GG,rs1801279
TT,rs9274407... (7 Replies)
Discussion started by: nengcheng
7 Replies
LEARN ABOUT DEBIAN
cw
CW(7) Miscellaneous Information Manual CW(7)NAME
CW - the international Morse code
DESCRIPTION
CW is an abbreviation for "continuous wave", the commonly used technical term for Morse code communication. A basic knowledge or under-
standing of Morse code is a requirement for Radio Amateurs and Marine Radio Operators in many parts of the world.
MORSE CODE TIMINGS
In Morse code, a dot or dash is referred to as an element. The basic timing unit is the dot period. This is the time taken to send a dot,
not including any space before or after the dot. The lengths of all other elements are then derived from this basic unit, using the fol-
lowing rules:
The duration of a dash is three dots.
The time between each element (dot or dash) is one dot length.
The space between characters is three dot lengths.
The space between words is seven dot lengths.
The following formula calculates the dot period in microseconds from the Morse code speed in words per minute:
dot period = ( 1200000 / speed )
This formula arises from the use of the word PARIS as a 'standard' word for calibrating Morse code speed. PARIS is 50 units long when sent
in Morse code. Analysis of English plain-text indicates that the average word is 50 units, including spaces.
MORSE CODE CHARACTERS
The following list shows the IS0 8859-1 (Latin-1) characters that have commonly understood representations in Morse code:
ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789"$()+-./:;=?_@ and space
In addition, following ISO 8859-1 and ISO 8859-2 accented characters are also part of the generally accepted international Morse code:
UACOEEANS (S with cedilla), (Z with caron/hacek),
Finally, libcw adds the following ASCII characters as extensions to single character procedural signals:
<>!&^~
MORSE CODE CHARACTER TABLES
The following table shows the Morse code equivalents for the ISO 8859-1, accented ISO 8859-1, and accented ISO 8859-2 characters above.
The ASCII portion of this table is taken from the ARRL Handbook, and the accented extensions from various other sources:
Ch Code Ch Code Ch Code Ch Code
-------------------------------------------------------
A .- B -... C -.-. D -..
E . F ..-. G --. H ....
I .. J .--- K -.- L .-..
M -- N -. O --- P .--.
Q --.- R .-. S ... T -
U ..- V ...- W .-- X -..-
Y -.-- Z --..
0 ----- 1 .---- 2 ..--- 3 ...--
4 ....- 5 ..... 6 -.... 7 --...
8 ---.. 9 ----.
" .-..-. ' .----. $ ...-..- ( -.--.
) -.--.- + .-.-. , --..-- - -....-
. .-.-.- / -..-. : ---... ; -.-.-.
= -...- ? ..--.. _ ..--.-
Ch Code Ch Code
-------------------------------------------------
U ..-- A .-.-
C -.-.. O ---.
E ..-.. A .-..-
A .--.- N --.--
S (S+cedilla) ---- (Z+caron/hacek) --..-
In addition to the above standard characters, the following characters are conventionally used for punctuation and procedural signals as
follows:
Ch Code Ch Code Ch Code Ch Code
-------------------------------------------------------
" .-..-. ' .----. $ ...-..- ( -.--.
) -.--.- + .-.-. , --..-- - -....-
. .-.-.- / -..-. : ---... ; -.-.-.
= -...- ? ..--.. _ ..--.- @ .--.-.
and the following are non-conventional extensions implemented by libcw:
Ch Code Ch Code Ch Code Ch Code
-----------------------------------------------------
< ...-.- > -...-.- ! ...-. & .-...
^ -.-.- ~ .-.-..
An alternative view of punctuation and procedural signals is as combination Morse characters:
Ch Prosig Ch Prosig Ch Prosig Ch Prosig
---------------------------------------------------------
" [AF] ' [WG] $ [SX] ( [KN]
) [KK] + [AR] , [MIM] - [DU]
. [AAA] / [DN] : [OS] ; [KR]
= [BT] ? [IMI] _ [IQ] @ [AC]
< [VA],[SK] > [BK] ! [SN] & [AS]
^ [KA] ~ [AL]
NOTES
Despite the fact that this manual page constantly and consistently refers to Morse code elements as dots and dashes, DO NOT think in these
terms when trying to learn Morse code. Always think of them as 'dit's and 'dah's.
SEE ALSO
Man pages for libcw(3,LOCAL), cw(1,LOCAL), cwgen(1,LOCAL), cwcp(1,LOCAL), and xcwcp(1,LOCAL).
CW Tutor Package CW(7)