Sponsored Content
Full Discussion: Pinot 0.89 (Default branch)
Special Forums News, Links, Events and Announcements Software Releases - RSS News Pinot 0.89 (Default branch) Post 302238689 by Linux Bot on Sunday 21st of September 2008 11:30:08 AM
Old 09-21-2008
Pinot 0.89 (Default branch)

ImagePinot is a D-Bus service that crawls, indexes your documents, and monitors them for changes. It is also a GTK-based user interface that enables you to query the index built by the service or your favorite Web engine, and display and analyze the results. It makes full use of advanced indexing and search facilities offered by Xapian, features language detection, dynamic document summaries, easy labelling of documents, and internal support for common file types. The D-Bus interface allows easy integration with other applications. License: GNU General Public License (GPL)Changes:
Indexing and searching are now diacritics insensitive by default thanks to Unac. There's support for the removal of stopwords at query time. Short queries get better abstracts. Indexing of plain text and XML files and the use of boolean operators in spelling suggestions, both broken in 0.88, were fixed. Queries are de-hyphenated on line breaks. Spelling suggestions don't suggest the same thing over and over again. The Simplified Chinese and Brazilian Portuguese translation have been updated.Image

Image

More...
 
VILISTEXTUM(1)						      General Commands Manual						    VILISTEXTUM(1)

NAME
vilistextum - html to ascii converter SYNOPSIS
vilistextum [OPTIONS] [inputfile |-] [outputfile | -] DESCRIPTION
vilistextum is a html to ascii converter specifically programmed to get the best out of incorrect html. OPTIONS
inputfile,- resp. outputfile,- replace inputfile with '-' for reading from standard input, likewise outputfile with '-' for writing to standard output. -a, --no-alt don't output anything for IMG tags even if they have an ALT attribute. Implies --no-image. -c, --convert-tags some tags will be converted to special characters. -e, --errorlevel NUMBER increase level of verbosity for error messages (0: No error messages). -i, --defimage STRING IMG tags without alt attribute are output as [STRING]. -l, --links numbers the links in the document and creates footnotes of each link at the end of the file. -k, --links-inline print the links directly after the html tag. -m, --dont-convert-characters don't convert the entities from windows1252 (&#128;-&#159; and their proper entity names) -n, --no-image don't output [Image] for IMG tags that have no ALT attribute. -p, --palm output text more suitable for reading on a PDA. -r, --remove-empty-alt if there is an empty ALT attribute in a IMG tag (eg <IMG href="..." alt="">), don't output '[]'. -s, --shrink-lines [NUMBER] if there are more than NUMBER empty lines, output only NUMBER. Default: 1. -t, --no-title don't output title. -w, --width NUMBER maximum line width. -h, --help display this help and exit -v, --version output version information and exit MULTIBYTE OPTIONS (Only available if compiled with multibyte support) -u, --output-utf-8 instead of the character set of the html document, everything will be output as utf-8. -x, --translit use the //TRANSLIT feature of libiconv. Consult the iconv manual for details. -y, --charset CHARSET if the HTML document doesn't provide a character set in the meta tags, use CHARSET. LIMITATIONS
The rendering of tables is not very good. The handling of OL is incomplete. The program treats it as UL and more than 10 nested lists confuse it. Text is never justified. REPORTING BUGS
Please report bugs to <bhaak@gmx.net>. AUTHOR
Vilistextum was written by Patric Mueller <bhaak@gmx.net> and may be freely distributed under the terms of the GNU General Public License Version 2. There is ABSOLUTELY NO WARRANTY for this program. SEE ALSO
iconv(3), lynx(1), links(1), w3m(1) 22 OCT 2006 VILISTEXTUM(1)
All times are GMT -4. The time now is 05:17 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy