Sponsored Content
Full Discussion: Noob trying to improve
Operating Systems OS X (Apple) Noob trying to improve Post 302988248 by Ardzii on Thursday 22nd of December 2016 11:17:34 AM
Old 12-22-2016
Code Noob trying to improve

Hi everyone!

This is my first post here, I hope that I will not already be violating any rule! I also would like to apologize in advance as my post will definitely be a noob post... please have patience and faith Smilie!

Now that I have set the ground rules SmilieSmilie, my objective is trying to understand how to write a sort of script which could search and extract information out of the web and put that information into a CSV file.

To be completely transparent, I sort of already have a specific idea of what kind of information I'd like to get: The prices and details of some asset listings on the web.

I have checked the web and found out that with a while loop with a bunch of curl and grep commands could do the trick. Is there anyone who can help me in building this sort of "web-crawler".

Thanks in advance to you all!

Ardzii
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Can I improve this script ???

Hi all, Still a newbie and learning as I go ... as you do :) Have created this script to report on disc usage and I've just included the ChkSpace function this morning. It's the first time I've read a file (line-by-bloody-line) and would like to know if I can improve this script ? FYI - I... (11 Replies)
Discussion started by: Cameron
11 Replies

2. UNIX for Advanced & Expert Users

improve performance by using ls better than find

Hi , i'm searching for files over many Aix servers with rsh command using this request : find /dir1 -name '*.' -exec ls {} \; and then count them with "wc" but i would improve this search because it's too long and replace directly find with ls command but "ls *. " doesn't work. and... (3 Replies)
Discussion started by: Nicol
3 Replies

3. Shell Programming and Scripting

improve this?

Wrote this script to find the date x days before or after today. Is there any way that this script can be speeded up or otherwise improved? #!/usr/bin/sh check_done() { if then daysofmth=31 elif then if ... (11 Replies)
Discussion started by: blowtorch
11 Replies

4. UNIX for Dummies Questions & Answers

Improve Performance

hi someone tell me which ways i can improve disk I/O and system process performance.kindly refer some commands so i can do it on my test machine.thanks, Mazhar (2 Replies)
Discussion started by: mazhar99
2 Replies

5. Shell Programming and Scripting

Any way to improve performance of this script

I have a data file of 2 gig I need to do all these, but its taking hours, any where i can improve performance, thanks a lot #!/usr/bin/ksh echo TIMESTAMP="$(date +'_%y-%m-%d.%H-%M-%S')" function showHelp { cat << EOF >&2 syntax extreme.sh FILENAME Specify filename to parse EOF... (3 Replies)
Discussion started by: sirababu
3 Replies

6. IP Networking

How to improve throughput?

I have a 10Gbps network link connecting two machines A and B. I want to transfer 20GB data from A to B using TCP. With default setting, I can use 50% bandwidth. How to improve the throughput? Is there any way to make throughput as close to 10Gbps as possible? thanks~ :) (3 Replies)
Discussion started by: andrewust
3 Replies

7. Shell Programming and Scripting

Want to improve the performance of script

Hi All, I have written a script as follows which is taking lot of time in executing/searching only 3500 records taken as input from one file in log file of 12 GB Approximately. Working of script is read the csv file as an input having 2 arguments which are transaction_id,mobile_number and search... (6 Replies)
Discussion started by: poweroflinux
6 Replies

8. AIX

improve sulog

I just wrote a very small script that improves readability on system sulog. The problem with all sulog is there is lack of clarity whether the info you are looking at is the most current. So if you just need a simple soution instead of going thru the trouble of writing a script that rotate logs and... (0 Replies)
Discussion started by: sparcguy
0 Replies

9. Shell Programming and Scripting

How to improve an script?

Gents. I have 2 different scripts for the same purpose: raw2csv_1 Script raw2csv_1 finish the process in less that 1 minute raw2csv_2 Script raw2csv_2 finish the process in more that 6 minutes. Can you please check if there is any option to improve the raw2csv_2. To finish the job... (4 Replies)
Discussion started by: jiam912
4 Replies

10. Shell Programming and Scripting

Improve script

Gents, Is there the possibility to improve this script to be able to have same output information. I did this script, but I believe there is a very short code to get same output here my script awk -F, '{if($10>0 && $10<=15) print $6}' tmp1 | sort -k1n | awk '{a++} END { for (n in a )... (23 Replies)
Discussion started by: jiam912
23 Replies
Arch::LiteWeb(3pm)					User Contributed Perl Documentation					Arch::LiteWeb(3pm)

NAME
Arch::LiteWeb - simple way to access web pages SYNOPSIS
my $web = Arch::LiteWeb->new; my $content = $web->get("http://some.domain:81/some/path"); die $web->error . " while processing " . $web->request_url unless $content; my $content_type = $web->response_headers->{content_type}; DESCRIPTION
This class provides a basic and easy to use support for the client-side HTTP. It is supplied in order to avoid dependency on LWP. If such dependency is not a problem, consider to use LWP instead that provides much better support for HTTP and other protocols. METHODS
The following class methods are available: get, post, request_url, error, error_with_url, network_error, response_code, response_codestr, response_error, response_headers, response_content. get url [params ...] Execute HTTP get of the given url and return the html string or undef on network/response error. Use other methods to get the details about the error and the response. params is key-value hash, the following keys are supported: url_host - only used if url is none url_port - only used if url is none(80) url_path - only used if url is none endl - default is "1512" timeout - default is 20 seconds user_agent - default is "Arch::LiteWeb/0.1" nocache - add a no-cache header noredirect - don't follow redirect responses max_redirect_depth - default is 5 use_proxy - default is false proxy_url - proxy url ($http_proxy supported too) proxy_host - only used if proxy_url is none proxy_port - only used if proxy_url is none(80) post url input [params] Not implemented yet. request_url Actual url of the last issued request or undef. If partial redirect responses are enabled, then the result is the last (non-redirect) url. error If the last request resulted in error (i.e. get/post returned undef), then this method returns the error message, otherwise it returns undef. This is just a shortcut for network_error || response_error. error_with_url Like error, but with " while fetching request_url " text appended if non undef. network_error The network error message for the last request or undef. response_error The response error message for the last request or undef. response_code The last response code (integer) or undef. response_codestr The last response code (string) or undef. response_headers The last response headers (hashref of HTTP headers) or undef. response_content The last response content or undef. This is the same thing that the last get/post returns. BUGS
Not intended for use in mission-critical applications. AUTHORS
Mikhael Goikhman (migo@homemail.com--Perl-GPL/arch-perl--devel). SEE ALSO
For more information, see LWP, LWP::Simple. perl v5.10.1 2005-03-25 Arch::LiteWeb(3pm)
All times are GMT -4. The time now is 06:51 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy