TV_EXTRACTINFO_EN(1p)					User Contributed Perl Documentation				     TV_EXTRACTINFO_EN(1p)

tv_extractinfo_en - read English-language listings and extract info from programme descriptions. SYNOPSIS
tv_extractinfo_en [--help] [--output FILE] [FILE...] DESCRIPTION
Read XMLTV data and attempt to extract information from English-language programme descriptions, putting it into machine-readable form. For example the human-readable text '(repeat)' in a programme description might be replaced by the XML element <previously-shown>. --output FILE write to FILE rather than standard output This tool also attempts to split multipart programmes into their constituents, by looking for a description that seems to contain lots of times and titles. But this depends on the description following one particular style and is useful only for some listings sources (Ananova). If some text is marked with the 'lang' attribute as being some language other than English ('en'), it is ignored. SEE ALSO
xmltv(5). AUTHOR
Ed Avis, BUGS
Trying to parse human-readable text is always error-prone, more so with the simple regexp-based approach used here. But because TV listing descriptions usually conform to one of a few set styles, tv_extractinfo_en does reasonably well. It is fairly conservative, trying to avoid false positives (extracting 'information' which isn't really there) even though this means some false negatives (failing to extract information and leaving it in the human-readable text). However, the leftover bits of text after extracting information may not form a meaningful English sentence, or the punctuation may be wrong. On the two listings sources currently supported by the XMLTV package, this program does a reasonably good job. But it has not been tested with every source of anglophone TV listings. perl v5.14.2 2011-05-07 TV_EXTRACTINFO_EN(1p)

TV_REMOVE_SOME_OVERLAPPING(1p)				User Contributed Perl Documentation			    TV_REMOVE_SOME_OVERLAPPING(1p)

tv_remove_some_overlapping - Remove some overlapping programmes from XMLTV data. SYNOPSIS
tv_remove_some_overlapping [--help] [--output FILE] [FILE...] DESCRIPTION
Read one or more XMLTV files and write a file to standard ouput containing the same data, except that some 'magazine' programmes which seem to contain two or more other programmes are removed. For example, if 'Schools TV' runs from 10:00 to 12:00, and there are two programmes 'History' from 10:00 to 11:00 and 'Geography' from 11:00 to 12:00 on the same channel, then 'Schools TV' could be removed. A programme is removed only if there are two or more other programmes which partition its timeslot, which implies that it and these other programmes must have stop times specified. To avoid throwing away any real programmes, no programme will be discarded if it has content data other than title and URL. Filtering this tool won't remove all overlapping programmes but it will deal with the 'big magazine programme containing smaller programmes' data commonly seen from listings sources. --output FILE write to FILE rather than standard output SEE ALSO
xmltv(5). AUTHOR
Ed Avis, perl v5.14.2 2004-01-01 TV_REMOVE_SOME_OVERLAPPING(1p)
