Sponsored Content
Top Forums Shell Programming and Scripting Extract certain patterns from file. Post 302537467 by shoaibjameel123 on Friday 8th of July 2011 09:37:51 AM
Old 07-08-2011
Extract certain patterns from file.

Hi All,

I tried extracting this pattern using grep but it did not work.

What I have is a file which has contents like this:


Code:
file:///channel/add-adhd.html
file:///channel/allergies.html
file:///channel/arthritis.html
http://mail.yahoo.com/
http://messenger.yahoo.com/
http://news.yahoo.com/
http://shine.yahoo.com/
https://twitter.com/YahooHealth
IMG
My Yahoo!
Yahoo! Finance
Yahoo! News
Yahoo! Sports

Now what I want to do is to extract all http, https, and file patterns or lines and get rid of IMG, My Yahoo!, Yahoo! News etc. This means my final output should like this:

Code:
file:///channel/add-adhd.html
file:///channel/allergies.html
file:///channel/arthritis.html
http://mail.yahoo.com/
http://messenger.yahoo.com/
http://news.yahoo.com/
http://shine.yahoo.com/
https://twitter.com/YahooHealth

Doing:

Code:
more file_name | grep http

extracts lines containing "http". But I want to extract all the three patterns by somehow nesting grep. I even tried this:
Code:
more file_name | grep http | grep https | grep file

But it did not work. I am using Linux with Bash.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

To extract the string between two patterns

Sample input: Loading File System Networking in nature Closing the System now i need to extract the patterns between the words File and Closing: i.e. sample output: System Networking in Nature Thanks in advance !!!!!!!!!!!!!!!!! (6 Replies)
Discussion started by: aajan
6 Replies

2. Shell Programming and Scripting

Searching patterns in 1 file and deleting all lines with those patterns in 2nd file

Hi Gurus, I have a file say for ex. file1 which has 3500 lines in it which are different account numbers and another file (file2) which has 230000 lines in it. I want to read all the lines in file1 and delete all those lines from file2 which has that same pattern as in file1. I am not quite... (4 Replies)
Discussion started by: toms
4 Replies

3. Shell Programming and Scripting

Extract patterns and copy them in different files

Hi All, I have a file which looks like this: Name1;A01 Name2;A01.047 Name3;A01.047.025 Newname1;B01 NewName2;B01.056.32 NewName3;B04.09.43 NewNewName1;C01.03 NewNewName2;C01.034.44As you can see, in the file there is some name and followed by the name is some identifier. These... (5 Replies)
Discussion started by: shoaibjameel123
5 Replies

4. Shell Programming and Scripting

PERL: extract lines between two patterns

Hello Perl-experts, I am new to perl and need help to solve a problem. I have a table in below format. <Text A> <Pattern1> A Value B Value C Value D Value <Pattern2> <Text B> This table is in file1. I want to extract lines between Pattern1 and Pattern2 and write it into file2.... (11 Replies)
Discussion started by: mnithink
11 Replies

5. Shell Programming and Scripting

Extract line between two patterns

Hi All, I need a script to extract a lines between two patterns.I have done this using grep,cut,tail and head.But its very slow, because my input file contain more than a lakh. COMMAND:XXXXXXXXXXXXXXXXXXXX yyyyy zzzzzz REQUESTSTRING:aaaaaaaaaaaaaaa;11111 222222 333333... (4 Replies)
Discussion started by: rajamohan
4 Replies

6. Shell Programming and Scripting

Need to extract text repetitively between two patterns

Hi All, I want to extract the text between some pattern which occurs repeatedly in a file. For example my input is like, /home/..... ..........java:25: cannot find symbol ............ /home/...... /home/....... I want to display... (2 Replies)
Discussion started by: Vignesh58
2 Replies

7. Shell Programming and Scripting

Extract all the lines in between of 2 patterns and merge them

Hi, I have a file with many lines and need to extract lines between 2 patterns (AAA and BBB) and merge all the in-between lines into single line separated by space. $ cat file1 blah blah blah blah AAA 1 2 3 blah BBB blah blah blah blah blah blah blah blah blah AAA 5 6 blah blah... (4 Replies)
Discussion started by: prvnrk
4 Replies

8. Shell Programming and Scripting

Extract all the sentences that matched two patterns

Hi I have two lists of patterns named A and B consisting of around 200 entries in each and I want to extract all the sentences from a big text file which match atleast one pattern from both A and B. For example, pattern list A consists of : ama ani ahum mari ... ... and pattern... (1 Reply)
Discussion started by: my_Perl
1 Replies

9. Shell Programming and Scripting

Extract lines between patterns

I have a list in the format below, how do I read through the list and extract the lines between the ##START## and ##END##, so i can check for specific values between each ##START## & ##END## pattern ##START## RANDOMTEXT DFGSD SDFSDF ##END## ##START## morestuff sdfggfg sdfsdf... (10 Replies)
Discussion started by: squrcles
10 Replies

10. Shell Programming and Scripting

Bash - Find files excluding file patterns and subfolder patterns

Hello. For a given folder, I want to select any files find $PATH1 -f \( -name "*" but omit any files like pattern name ! -iname "*.jpg" ! -iname "*.xsession*" ..... \) and also omit any subfolder like pattern name -type d \( -name "/etc/gconf/gconf.*" -o -name "*cache*" -o -name "*Cache*" -o... (2 Replies)
Discussion started by: jcdole
2 Replies
UNZIP(1)						    BSD General Commands Manual 						  UNZIP(1)

NAME
unzip -- extract files from a ZIP archive SYNOPSIS
unzip [-aCcfjLlnopqtuv] [-d dir] zipfile DESCRIPTION
The following options are available: -a When extracting a text file, convert DOS-style line endings to Unix-style line endings. -C Match file names case-insensitively. -c Extract to stdout/screen. When extracting files from the zipfile, they are written to stdout. This is similar to -p, but does not suppress normal output. -d dir Extract files into the specified directory rather than the current directory. -f Update existing. Extract only files from the zipfile if a file with the same name already exists on disk and is older than the former. Otherwise, the file is silently skipped. -j Ignore directories stored in the zipfile; instead, extract all files directly into the extraction directory. -L Convert the names of the extracted files and directories to lowercase. -l List, rather than extract, the contents of the zipfile. -n No overwrite. When extracting a file from the zipfile, if a file with the same name already exists on disk, the file is silently skipped. -o Overwrite. When extracting a file from the zipfile, if a file with the same name already exists on disk, the existing file is replaced with the file from the zipfile. -p Extract to stdout. When extracting files from the zipfile, they are written to stdout. The normal output is suppressed as if -q was specified. -q Quiet: print less information while extracting. -t Test: do not extract anything, but verify the checksum of every file in the archive. -u Update. When extracting a file from the zipfile, if a file with the same name already exists on disk, the existing file is replaced with the file from the zipfile if and only if the latter is newer than the former. Otherwise, the file is silently skipped. -v List verbosely, rather than extract, the contents of the zipfile. This differs from -l by using the long listing. Note that most of the data is currently fake and does not reflect the content of the archive. -x pattern Exclude files matching the pattern pattern. -Z mode Emulate zipinfo(1L) mode. Enabling zipinfo(1L) mode changes the way in which additional arguments are parsed. Currently only zipinfo(1L) mode 1 is supported, which lists the file names one per line. Note that only one of -n, -o, and -u may be specified. If specified filename is "-", then data is read from stdin. ENVIRONMENT
If the UNZIP_DEBUG environment variable is defined, the -q command-line option has no effect, and additional debugging information will be printed to stderr. COMPATIBILITY
The unzip utility aims to be sufficiently compatible with other implementations to serve as a drop-in replacement in the context of the ports(7) system. No attempt has been made to replicate functionality which is not required for that purpose. For compatibility reasons, command-line options will be recognized if they are listed not only before but also after the name of the zipfile. Normally, the -a option should only affect files which are marked as text files in the zipfile's central directory. Since the archive(3) library reads zipfiles sequentially, and does not use the central directory, that information is not available to the unzip utility. Instead, the unzip utility will assume that a file is a text file if no non-ASCII characters are present within the first block of data decompressed for that file. If non-ASCII characters appear in subsequent blocks of data, a warning will be issued. The unzip utility is only able to process ZIP archives handled by libarchive(3). Depending on the installed version of libarchive, this may or may not include self-extracting archives. SEE ALSO
libarchive(3) HISTORY
The unzip utility appeared in FreeBSD 8.0. AUTHORS
The unzip utility and this manual page were written by Dag-Erling Smorgrav <des@FreeBSD.org>. It uses the archive(3) library developed by Tim Kientzle <kientzle@FreeBSD.org>. BSD
May 10, 2012 BSD
All times are GMT -4. The time now is 08:23 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy