Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

scrapy(1) [debian man page]

SCRAPY(1)						      General Commands Manual							 SCRAPY(1)

NAME
scrapy - the Scrapy command-line tool SYNOPSIS
scrapy [command] [OPTIONS] ... DESCRIPTION
Scrapy is controlled through the scrapy command-line tool. The script provides several commands, for different purposes. Each command sup- ports its own particular syntax. In other words, each command supports a different set of arguments and options. OPTIONS
fetch [OPTION] URL Fetch a URL using the Scrapy downloader --headers Print response HTTP headers instead of body runspider [OPTION] spiderfile Run a spider --output=FILE Store scraped items to FILE in XML format settings [OPTION] Query Scrapy settings --get=SETTING Print raw setting value --getbool=SETTING Print setting value, intepreted as a boolean --getint=SETTING Print setting value, intepreted as an integer --getfloat=SETTING Print setting value, intepreted as an float --getlist=SETTING Print setting value, intepreted as an float --init Print initial setting value (before loading extensions and spiders) shell URL | file Launch the interactive scraping console startproject projectname Create new project with an initial project template --help, -h Print command help and options --logfile=FILE Log file. if omitted stderr will be used --loglevel=LEVEL, -L LEVEL Log level (default: None) --nolog Disable logging completely --spider=SPIDER Always use this spider when arguments are urls --profile=FILE Write python cProfile stats to FILE --lsprof=FILE Write lsprof profiling stats to FILE --pidfile=FILE Write process ID to FILE --set=NAME=VALUE, -s NAME=VALUE Set/override setting (may be repeated) AUTHOR
Scrapy was written by the Scrapy Developers <scrapy-developers@googlegroups.com>. This manual page was written by Ignace Mouzannar <mouzannar@gmail.com>, for the Debian project (but may be used by others). October 17, 2009 SCRAPY(1)

Check Out this Related Man Page

TV_GRAB_DK_DR(1p)					User Contributed Perl Documentation					 TV_GRAB_DK_DR(1p)

NAME
tv_grab_dk_dr - Grab TV listings for Denmark. SYNOPSIS
tv_grab_dk_dr --help tv_grab_dk_dr --configure [--config-file FILE] [--gui OPTION] tv_grab_dk_dr [--config-file FILE] [--output FILE] [--days N] [--offset N] [--quiet] tv_grab_dk_dr --capabilities tv_grab_dk_dr --version DESCRIPTION
Output TV listings for several channels available in Denmark. The data comes from dr.dk. The grabber relies on parsing HTML so it might stop working at any time. First run tv_grab_dk_dr --configure to choose, which channels you want to download. Then running tv_grab_dk_dr with no arguments will output listings in XML format to standard output. --configure Prompt for which channels, and write the configuration file. --config-file FILE Set the name of the configuration file, the default is ~/.xmltv/tv_grab_dk_dr.conf. This is the file written by --configure and read when grabbing. --gui OPTION Use this option to enable a graphical interface to be used. OPTION may be 'Tk', or left blank for the best available choice. Additional allowed values of OPTION are 'Term' for normal terminal output (default) and 'TermNoProgressBar' to disable the use of Term::ProgressBar. --output FILE Write to FILE rather than standard output. --days N Grab N days. The default is one week. --offset N Start N days in the future. The default is to start from today. --quiet Suppress the progress messages normally written to standard error. --capabilities Show which capabilities the grabber supports. For more information, see <http://wiki.xmltv.org/index.php/XmltvCapabilities> --version Show the version of the grabber. --help Print a help message and exit. SEE ALSO
xmltv(5). AUTHOR
This version of tv_grab_dk_dr was written by Thomas Horsten <thomas at horsten dot com> perl v5.14.2 2010-10-04 TV_GRAB_DK_DR(1p)
Man Page