Awk script to create new file based on previous line Post: 302420912

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk;sed appending line to previous line....

I know this has been asked before but I just can't parse the syntax as explained. I have a set of files that has user information spread out over two lines that I wish to merge into one: User1NameLast User1NameFirst User1Address E-Mail:User1email User2NameLast User2NameFirst User2Address...

2. Shell Programming and Scripting

Append specific lines to a previous line based on sequential search criteria

I'll try explain this as best I can. Let me know if it is not clear. I have large text files that contain data as such: 143593502 09-08-20 09:02:13 xxxxxxxxxxx xxxxxxxxxxx 09-08-20 09:02:11 N line 1 test line 2 test line 3 test 143593503 09-08-20 09:02:13...

3. Shell Programming and Scripting

Printing previous line based on pattern using sed

Hi, I have a written a shell script to get the previous line based on the pattern. For example if a file has below lines: ---------------------------------------------- #UNBLOCK_As _per #As per 205.162.42.92 #BLOCK_As_per #----------------------- #input checks abc.com...

4. Shell Programming and Scripting

awk get matched line's previous line

hi everyone, a b in c d e f in g output is: a e so awk search for "in", then print out the matched line's previuos line. Please advice.

5. Shell Programming and Scripting

Improve script made to calculate value based on present and previous line

Hi all, I have made at small script to make a simple calculation on a file which is formatted in this way: I want to create a new file in which the value of particular line minus the previous line is printed. So my wanted output is: I have made the following program to do the job...

6. Programming

create a spool file based on values passed from korn shell to sql script

this is my issue. 4 parameters are passed from korn shell to sql script. parameter_1= varchar2 datatype or no value entered my user. parameter_2= number datatype or no value entered my user. parameter_3= number datatype or no value entered my user. parameter_4= number datatype or no...

7. Shell Programming and Scripting

Perl: Conditional replace based on previous and current value in a line

I need to read the contents of a file. Then I need to grep for a keyword and replace part of the grepped line based on the condition of previous and present line. Example input file: V { port1 = P; port2 = 0; shift_port = P0; /* if next shift_port is P0 I need...

8. Shell Programming and Scripting

HELP: Shell Script to read a Log file line by line and extract Info based on KEYWORDS matching

I have a LOG file which looks like this Import started at: Mon Jul 23 02:13:01 EDT 2012 Initialization completed in 2.146 seconds. -------------------------------------------------------------------------------- -- Import summary for Import item: PolicyInformation...

9. Shell Programming and Scripting

awk script -print line when $2 > $2 of previous line

Hi all, From a while loop I am reading a sorted file where I want to print only the lines that have $1 match and $2 only when the difference from $2 from the previous line is > 30. Input would be like ... AN237 010 193019 0502 1 CSU Amoxycillin AN237 080 ...

10. Shell Programming and Scripting

Returning multiple outputs of a single line based on previous repeated lines

Hello, I am trying to return a time multiple times from a file that has varying output just before the time instance, i.e. cat jumped cat jumped cat jumped time = 1.1 cat jumped cat jumped time = 1.2 cat jumped cat jumped time = 1.3 In this case i would like to output a time.txt...

LEARN ABOUT CENTOS

www::robotrules

WWW::RobotRules(3)					User Contributed Perl Documentation					WWW::RobotRules(3)

NAME

       WWW::RobotRules - database of robots.txt-derived permissions

SYNOPSIS

	use WWW::RobotRules;
	my $rules = WWW::RobotRules->new('MOMspider/1.0');

	use LWP::Simple qw(get);

	{
	  my $url = "http://some.place/robots.txt";
	  my $robots_txt = get $url;
	  $rules->parse($url, $robots_txt) if defined $robots_txt;
	}

	{
	  my $url = "http://some.other.place/robots.txt";
	  my $robots_txt = get $url;
	  $rules->parse($url, $robots_txt) if defined $robots_txt;
	}

	# Now we can check if a URL is valid for those servers
	# whose "robots.txt" files we've gotten and parsed:
	if($rules->allowed($url)) {
	    $c = get $url;
	    ...
	}

DESCRIPTION

       This module parses /robots.txt files as specified in "A Standard for Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html>
       Webmasters can use the /robots.txt file to forbid conforming robots from accessing parts of their web site.

       The parsed files are kept in a WWW::RobotRules object, and this object provides methods to check if access to a given URL is prohibited.
       The same WWW::RobotRules object can be used for one or more parsed /robots.txt files on any number of hosts.

       The following methods are provided:

       $rules = WWW::RobotRules->new($robot_name)
	   This is the constructor for WWW::RobotRules objects.  The first argument given to new() is the name of the robot.

       $rules->parse($robot_txt_url, $content, $fresh_until)
	   The parse() method takes as arguments the URL that was used to retrieve the /robots.txt file, and the contents of the file.

       $rules->allowed($uri)
	   Returns TRUE if this robot is allowed to retrieve this URL.

       $rules->agent([$name])
	   Get/set the agent name. NOTE: Changing the agent name will clear the robots.txt rules and expire times out of the cache.

ROBOTS.TXT
       The format and semantics of the "/robots.txt" file are as follows (this is an edited abstract of
       <http://www.robotstxt.org/wc/norobots.html>):

       The file consists of one or more records separated by one or more blank lines. Each record contains lines of the form

	 <field-name>: <value>

       The field name is case insensitive.  Text after the '#' character on a line is ignored during parsing.  This is used for comments.  The
       following <field-names> can be used:

       User-Agent
	  The value of this field is the name of the robot the record is describing access policy for.	If more than one User-Agent field is
	  present the record describes an identical access policy for more than one robot. At least one field needs to be present per record.  If
	  the value is '*', the record describes the default access policy for any robot that has not not matched any of the other records.

	  The User-Agent fields must occur before the Disallow fields.	If a record contains a User-Agent field after a Disallow field, that
	  constitutes a malformed record.  This parser will assume that a blank line should have been placed before that User-Agent field, and
	  will break the record into two.  All the fields before the User-Agent field will constitute a record, and the User-Agent field will be
	  the first field in a new record.

       Disallow
	  The value of this field specifies a partial URL that is not to be visited. This can be a full path, or a partial path; any URL that
	  starts with this value will not be retrieved

       Unrecognized records are ignored.

ROBOTS.TXT EXAMPLES
       The following example "/robots.txt" file specifies that no robots should visit any URL starting with "/cyberworld/map/" or "/tmp/":

	 User-agent: *
	 Disallow: /cyberworld/map/ # This is an infinite virtual URL space
	 Disallow: /tmp/ # these will soon disappear

       This example "/robots.txt" file specifies that no robots should visit any URL starting with "/cyberworld/map/", except the robot called
       "cybermapper":

	 User-agent: *
	 Disallow: /cyberworld/map/ # This is an infinite virtual URL space

	 # Cybermapper knows where to go.
	 User-agent: cybermapper
	 Disallow:

       This example indicates that no robots should visit this site further:

	 # go away
	 User-agent: *
	 Disallow: /

       This is an example of a malformed robots.txt file.

	 # robots.txt for ancientcastle.example.com
	 # I've locked myself away.
	 User-agent: *
	 Disallow: /
	 # The castle is your home now, so you can go anywhere you like.
	 User-agent: Belle
	 Disallow: /west-wing/ # except the west wing!
	 # It's good to be the Prince...
	 User-agent: Beast
	 Disallow:

       This file is missing the required blank lines between records.  However, the intention is clear.

SEE ALSO

       LWP::RobotUA, WWW::RobotRules::AnyDBM_File

COPYRIGHT

	 Copyright 1995-2009, Gisle Aas
	 Copyright 1995, Martijn Koster

       This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

perl v5.16.3							    2012-02-18							WWW::RobotRules(3)

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

awk;sed appending line to previous line....

Discussion started by: walkerwheeler

2. Shell Programming and Scripting

Append specific lines to a previous line based on sequential search criteria

Discussion started by: jesse

3. Shell Programming and Scripting

Printing previous line based on pattern using sed

Discussion started by: Anjan1

4. Shell Programming and Scripting

awk get matched line's previous line

Discussion started by: jimmy_y