Extracting the column containing URL from a text file
I have the file like this:
Timestamp URL Text 1331635241000 http://example.com Peoples footage at www.test.com,http://example4.com 1331635231000 http://example1.net crack the nuts http://example6.com 1331635280000 http://example2.net Loving thisEach column is tab separated. I need to extract only the URLs from column 2 and column 3 if in case of the no URLs then leave it empty for example to get the result like this:
URL Text http://example.com www.test.com,http://example4.com http://example1.net http://example6.com http://example2.net
I tried this script
This script can give me the all URLS in a single column without the header. Any suggestion to resolve this.
Hi All,
I have some HTML files and my requirement is to extract all the anchor text words from the HTML files along with their URLs and store the result in a separate text file separated by space. For example, <a href="/kid/stay_healthy/">Staying Healthy</a>
which has /kid/stay_healthy/ as... (3 Replies)
I have a tab delimited text file where the first column can take on three different values : 100, 150, 250. I want to extract all the rows where the first column is 100 and put them into a separate text file and so on. This is what my text file looks like now:
100 rs3794811 0.01 0.3434... (1 Reply)
I have a tab delimited text file where the first column can take on three different values : 100, 150, 250. I want to extract all the rows where the first column is 100 and put them into a separate text file and so on. This is what my text file looks like now:
100 rs3794811 0.01 0.3434
100... (1 Reply)
I have a text file where the second column is a list of numbers going from small to large. I want to extract the rows where the second column is smaller than or equal to 0.0001.
My input:
rs10082730 9e-08 12 46002702
rs2544081 1e-07 12 46015487
rs1425136 1e-06 7 35396742
rs2712590... (1 Reply)
I have a space delimited text file. I want to extract rows where the third column has 0 as a value and write those rows into a new space delimited text file. How do I go about doing that? Thanks! (2 Replies)
I would like to extract the last column of a text file but different rows of the text file have different numbers of columns. How do I go about doing that? Thanks! (1 Reply)
Hello Everyone,
I am trying to write a shell script(or Perl Script) that would do the following:
I have a file that contains the following lines:
File:
https://ims-svnus.com/dev/DB/trunk/feeds/templates/shell_script.txt -r860... (5 Replies)
I have the file like this:
Timestamp URL Text 1331635241000 http://example.com Peoples footage at www.test.com,http://example4.com 1331635231000 http://example1.net crack the nuts http://example6.com 1331635280000 http://example2.net ... (0 Replies)
I have the file like this:
Timestamp URL Text 1331635241000 http://example.com Peoples footage at www.test.com,http://example4.com 1331635231000 http://example1.net crack the nuts http://example6.com 1331635280000 http://example2.net ... (3 Replies)
Discussion started by: csim_mohan
3 Replies
LEARN ABOUT DEBIAN
bti-shrink-urls
BTI-SHRINK-URLS(1) bti-shrink-urls BTI-SHRINK-URLS(1)NAME
bti-shrink-urls - convert URLs to a shorter form using a web service
SYNOPSIS
bti [--escaped] [--help] [URL]
DESCRIPTION
bti-shrink-urls converts URLs to a shorter form using a web service.
Currently http://2tu.us/ (default) and http://bit.ly / http://j.mp are supported.
OPTIONS --escaped
Don't escape special characters in the URL any more, they are already percent encoded.
--help
Print help text.
URL
Specify the URL to be converted. If no URL is given bti-shrink-urls waits for input on stdin.
CONFIGURATION
bti-shrink-urls is configured by setting some values in ~/.bti:
shrink_host
Possible values: 2tu.us (default), bit.ly, j.mp
shrink_bitly_login
API login for bit.ly, j.mp, required if shrink_host is set to bit.ly or j.mp. See
https://code.google.com/p/bitly-api/wiki/ApiDocumentation
shrink_bitly_key
API key for bit.ly, j.mp, required if shrink_host is set to bit.ly or j.mp. See
https://code.google.com/p/bitly-api/wiki/ApiDocumentation
AUTHOR
Written by Bart Trojanowski bart@jukie.net.
COPYRIGHT AND LICENSE
Copyright (C) 2009 Bart Trojanowski bart@jukie.net.
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by
the Free Software Foundation version 2 of the License.
bti-shrink-urls March 2009 BTI-SHRINK-URLS(1)