I've a csv file in which the record is getting break into 1 line or more than one line. I want to combine those splits into one line and remove the unwanted character existing in the record i.e. double quote symbol ("). The line gets break only when the record contains double quote(") symbol. Please find the following current and expected csv file format.
Current CSV file: Expected CSV file:
I know its bit difficult to find the solution but it would be great if anyone can help me out in getting the script prepared to fix this issue.
Thanks in Advance.
Last edited by Franklin52; 06-14-2012 at 08:05 AM..
Reason: Please use code tags for data and code samples
Hi,
I got some log files which print the whole xml message in separate lines:
e.g.
2008-10-01 14:21:44,561 INFO do something
2008-10-01 14:21:44,561 INFO print xml : <?xml version="1.0" encoding="UTF-8"?>
<a>
<b>my data</b>
</a>
2008-10-01 14:21:44,563 INFO do something again
I want... (3 Replies)
I have an issue where I need to append a value at the last of the csv, I have created a shell script and it is appending the columns at the last but it is appending at all lines, and my requirement is specific to just append at the 1st line.
Have a look and suggest, (7 Replies)
Using ls input as example..
ls | sed 's/\n/ /'outputs with line breaks, where I was expecting the \n to disappear. I've tried \r as well wondering if terminal output used different breaks.
Is there a way to remove the line breaks without saving to file and then working from there?
----------... (2 Replies)
Hi,
I am pasring a file line by line. I need to check each field in line and remove particular line.
input file lines are,
02;ABC;PQR
03;aaa;rrr
04;ABC;ggg
09;eee;ABC
04;lmn;stu
I am looking for line containing "ABC" as field value. Now How can I remove this line from input file... (7 Replies)
I need to remove all line breaks in a document after a match, until there is a blank line.
Example below, after the match "THE GREEN TABLE" remove line breaks until a blank line. Then, after the match "THE BLUE TABLE" do the same.
Before:
THE GREEN TABLE
Lorem ipsum dolor sit amet,... (14 Replies)
I have below file. I want to remove space at begining of every line and then after also remove blank line from file.
I use below code for each operation.
sed -e 's/^*//' < check.txt > check1.txt
sed '/^\s*$/d' < check1.txt > check2.txt
above code not remove all the space... (12 Replies)
i have data as below
123,"paul phiri",paul@yahoo.com,"po.box 23, BT","Eco Bank,Blantyre,Malawi"
i need an output to be
123,"paul phiri",paul@yahoo.com,"po.box 23 BT","Eco Bank Blantyre Malawi" (5 Replies)
Hi,
I want to remove all extra spaces, line breaks .
Need a new line entry only for term starting"array"
For eg: my input is
array(),
array(),
array(),
and my expected output is
array(),
array(),
array(),
Is it possible using awk? (5 Replies)
Regarding copy/pasted text of copyright-free book from archive.org (link below), in attempt to expand single-line-break paragraph text (not section headings or paragraph breaks) to wider right margin, Justify or Wrap in LIbreOffice is not working, and Find/Replace the paragraph mark ($) wraps all... (2 Replies)
Discussion started by: p1ne
2 Replies
LEARN ABOUT DEBIAN
ocr4gamera
OCR4GAMERA(1)OCR4GAMERA(1)NAME
ocr4gamera - OCR system using the Gamera framework
USAGE
ocr4gamera -x <traindata> [options] <imagefile>
OPTIONS -v <int>, --verbosity=<int>
Set verbosity level to <int>. Possible values are 0 (default): silent operation; 1: information on progress; >2: segmentation info
is written to PNG files with prefix debug_.
-h, --help
Display help and exit.
-d, --deskew
Do a skew correction (recommended).
-f, --filter
Filter out very large (images) and very small components (noise).
-a, --automatic-group
Autogroup glyphs with classifier.
-x <file>, --xmlfile=<file>
Read training data from <file>.
-o <xml>, --output=<xml>
Write recognized text to file <xml> (otherwise it is written to stdout).
-c <csv>, --extra_chars_csvfile=<csv>
Read additional class name conversions from file <csv>. <csv> must contain one conversion per line.
-R <rules>, --heuristic_rules=<rules>
Apply heuristic rules <rules> for disambiguation of some chars. <rules> can be roman (default) or none (for no rules).
-D, --dictionary-correction
Correct words using a dictionary (requires aspell or ispell).
-L <lang>, --dictionary-language=<lang>
Use <lang> as language for aspell (when option -D is set).
-e <int>, --edit-distance=<int>
Correct words only when edit distance not more than <int>.
OCR4GAMERA(1)