Blasted data inputters :mad: they always have to screw my data up....My comma delimited file with three fields ( firstname,surname and address ) has been screwed up by people entering addresses like this (putting a comma in between the house number and the street name)
142,Stonewall Avenue
... (8 Replies)
I want to add the delimiter in particular positions in one txt file.
My file is :
123450000000000testing
898983920202020testfil
.
.
.
1-5 -- after 5th position add ,
6-10 -- after 10th position add ,
11-7 -- like wise..
Expecting output is:
12345,0000000000,testing... (5 Replies)
Hi,
We have a file with a unit seperator as the delimeter.
Here are the Sample lines from the file:
ASIA/PACIFICHong KongFX2007071080900
ASIA/PACIFICHong KongFX2007071080900/ 800129HK
This delimeter has the ascii value of \037.
I have to... (4 Replies)
I have a file containing about 5 million rows, in the file there are some records which has extra delimiter at random position. (we dont know the positions), now we have to Count the delimeter from each row and if the count of delimeter is not matching then I want to delete those rows from the... (5 Replies)
Hi,
I have a file below
I want to attach an end delimeter '{}' after the time stamp
input file
22113350444356|Status:Assigned,Notes:
APP PRD |Sep 28 2011 12:12:55:660PM
22113350398356|Status:Assigned,Notes:
APP PRD |Sep 28 2011 12:12:55:660AM
22113350621356|Status:Assigned,Notes:... (6 Replies)
hi
if comma delimeter missing in columns treat them as bad file and if it is not then gudfiles. only checking columns not data.
id,name,sal,deptno =======> gudfile
1,awa,200,10
2,aba,100,20
3,cdc,300,30
idname,sal,deptno ========> badfile since its missing (.)... (8 Replies)
Hi Friends,
I have a file as below
source.txt
12345JackYKing32N
1235 JulyYoig 31N
i am using cut command for cutting the fields
cut -c 1-5 source.txt
12345
1235
like above i have to use each time to cut all the fieds manually. I have a file(pre.txt) which tells... (3 Replies)
Hi All,
I have an awk command that uses the substr function - At the moment I know the length of the values so can use the example below i.e substr(i,0,1)
However in future these lengths may change so wondered if you can use a delimiter within a substr? Like in bash you could use cut -d ';',... (6 Replies)
Hi,
Need help on replacing every second instance of delimeter.
Scenario:
var="Name1,Value1,Name2,Value2,Name3,Value3,Name4,Value"
I want every second "," to replace with "|"
I tried like below
echo $var| sed 's/,/|/2'
But, it's not working.
Expected output:
... (4 Replies)
Hi Team
below test file contains tab delimeter file and i am excepting the number of files 3.
File : test.txt
a b c
awk -vFPAT='\t' -vOFS="\t" -v a="0" -v b="10" ' NR>a {if (NF != b ) print NR"@"NF }' test.txt
current output is
1@2
required output is
1@3
Cloud you please help... (7 Replies)
Discussion started by: bmk123
7 Replies
LEARN ABOUT DEBIAN
utf8trans
utf8trans(1) docbook2X utf8trans(1)NAME
utf8trans - Transliterate UTF-8 characters according to a table
SYNOPSIS
utf8trans charmap [file]...
DESCRIPTION
utf8trans transliterates characters in the specified files (or standard input, if they are not specified) and writes the output to standard
output. All input and output is in the UTF-8 encoding.
This program is usually used to render characters in Unicode text files as some markup escapes or ASCII transliterations. (It is not in-
tended for general charset conversions.) It provides functionality similar to the character maps in XSLT 2.0 (XML Stylesheet Language -
Transformations, version 2.0).
OPTIONS -m, --modify
Modifies the given files in-place with their transliterated output, instead of sending it to standard output.
This option is useful for efficient transliteration of many files at once.
--help Show brief usage information and exit.
--version
Show version and exit.
USAGE
The translation is done according to the rules in the 'character map', named in the file charmap. It has the following format:
1. Each line represents a translation entry, except for blank lines and comment lines, which are ignored.
2. Any amount of whitespace (space or tab) may precede the start of an entry.
3. Comment lines begin with #. Everything on the same line is ignored.
4. Each entry consists of the Unicode codepoint of the character to translate, in hexadecimal, followed one space or tab, followed by the
translation string, up to the end of the line.
5. The translation string is taken literally, including any leading and trailing spaces (except the delimeter between the codepoint and
the translation string), and all types of characters. The newline at the end is not included.
The above format is intended to be restrictive, to keep utf8trans simple. But if a XML-based format is desired, there is a
xmlcharmap2utf8trans script that comes with the docbook2X distribution, that converts character maps in XSLT 2.0 format to the utf8trans
format.
LIMITATIONS
o utf8trans does not work with binary files, because malformed UTF-8 sequences in the input are substituted with U+FFFD characters. Howev-
er, null characters in the input are handled correctly. This limitation may be removed in the future.
o There is no way to include a newline or null in the substitution string.
AUTHOR
Steve Cheng <stevecheng@users.sourceforge.net>.
docbook2X 0.8.8 3 March 2007 utf8trans(1)