I see quite cool! However, I have a couple of problems undertanding your script. First, I do not get anything if I use "/-*/. Second, let suppose I only want to print the sequences that do not contain an "*". Then, I use your code:
This is what I get:
suppose
fileA
kanika123ABC 1222222222222222
raciat5678ty 1221123333331121
jessica78ulllo 2233243223333333
so output shud be print only first 10 characters in series and rest remain same
kanika123A 1222222222222222
raciat5678 1221123333331121
jessica78u ... (1 Reply)
Hi I made a post earlier but now my problem has become a lot more complicated.
So I have a file that looks like this:
Name 1 13 94 1 AGGTT
Name 1 31 44 1 TTCCG
Name 1 13 94 2 AAAAATTTT
Name 1 41 47 2 GGGGGGGGGGG So the file is tab delimited and what I want to do is find... (8 Replies)
Dear friends,
hello to everyone. I am new to this forum.
I have a set of data where I need to find the repitition of series as below
data format:
0001230000456000001230000456
each digit can be separated by any delimeter
I need to find out the starting point (index) of '123' and '456'
I... (2 Replies)
I have two LARGE files of data more than 20,000 line each, file-1 and file-2, and I wish to do the following if possible:
file-1
1 2 5 7 9
2 4 6 3 8 9
4 6 8 9 3 2 1 3
1 2
.
.
.
file-2
1 2 3
2 5 7
5 7 3
7 9 4
. (5 Replies)
sed -e "s// /g" old.txt > new.txt
While I do know some control characters need to be escaped, can normal characters also be escaped and still work the same way? Basically I do not know all control characters that have a special meaning, for example, ?, ., % have a meaning and have to be escaped... (11 Replies)
Hi all.
I have a .txt file that I need to sort it
My file is like:
1- 88 chain0 MASTER (FF-TE) FFFF 1962510 /TCK T FD2TQHVTT1 /jtagc/jtag_instreg/updateinstr_reg_1 dff1 (TI,SO)
2- ... (10 Replies)
I am trying to find a specific set of characters in a long file. I only want to find the characters in column 265 for 4 bytes.
Is there a search for that? I tried cut but couldn't get it to work.
Ex. I want to find '9999' in column 265 for 4 bytes. If it is in there, I want it to print... (12 Replies)
Hi all, I need help.
I have an input text file (input.txt) like this:
21 GTGCAACACCGTCTTGAGAGG 50
21 GACCGAGACAGAATGAAAATC 73
21 CGGGTCTGTAGTAGCAAACGC 108
21 CGAAAAATGAACCCCTTTATC 220
21 CGTGATCCTGTTGAAGGGTCG 259
Now I need to count A/T/G/C numbers at each character location in column... (2 Replies)
Hi All,
Assuming i have got a file test.dat which has contains as follows:
Unix = abc def fgt jug
111 2222 3333
Linux = gggg pppp qqq
C# = ccc ffff llll
I would like to traverse through the file, get the 1st occurance of "=" and then need to get the sting... (22 Replies)
Hi
I have a file which is tab-delimited. Now, I'd like to print the lines which have "chr6" string in both first and second columns. Could anybody help? (3 Replies)
Discussion started by: a_bahreini
3 Replies
LEARN ABOUT REDHAT
encoding
encoding(n) Tcl Built-In Commands encoding(n)
__________________________________________________________________________________________________________________________________________________NAME
encoding - Manipulate encodings
SYNOPSIS
encoding option ?arg arg ...?
_________________________________________________________________INTRODUCTION
Strings in Tcl are encoded using 16-bit Unicode characters. Different operating system interfaces or applications may generate strings in
other encodings such as Shift-JIS. The encoding command helps to bridge the gap between Unicode and these other formats.
DESCRIPTION
Performs one of several encoding related operations, depending on option. The legal options are:
encoding convertfrom ?encoding? data
Convert data to Unicode from the specified encoding. The characters in data are treated as binary data where the lower 8-bits of
each character is taken as a single byte. The resulting sequence of bytes is treated as a string in the specified encoding. If
encoding is not specified, the current system encoding is used.
encoding convertto ?encoding? string
Convert string from Unicode to the specified encoding. The result is a sequence of bytes that represents the converted string.
Each byte is stored in the lower 8-bits of a Unicode character. If encoding is not specified, the current system encoding is used.
encoding names
Returns a list containing the names of all of the encodings that are currently available.
encoding system ?encoding?
Set the system encoding to encoding. If encoding is omitted then the command returns the current system encoding. The system encod-
ing is used whenever Tcl passes strings to system calls.
EXAMPLE
It is common practice to write script files using a text editor that produces output in the euc-jp encoding, which represents the ASCII
characters as singe bytes and Japanese characters as two bytes. This makes it easy to embed literal strings that correspond to non-ASCII
characters by simply typing the strings in place in the script. However, because the source command always reads files using the ISO8859-1
encoding, Tcl will treat each byte in the file as a separate character that maps to the 00 page in Unicode. The resulting Tcl strings will
not contain the expected Japanese characters. Instead, they will contain a sequence of Latin-1 characters that correspond to the bytes of
the original string. The encoding command can be used to convert this string to the expected Japanese Unicode characters. For example,
set s [encoding convertfrom euc-jp "xA4xCF"]
would return the Unicode string "u306F", which is the Hiragana letter HA.
SEE ALSO Tcl_GetEncoding(3)KEYWORDS
encoding
Tcl 8.1 encoding(n)