Sponsored Content
Top Forums Shell Programming and Scripting Remove html tags with particular string inside the tags Post 302634073 by georgi58 on Wednesday 2nd of May 2012 11:10:57 PM
Old 05-03-2012
thanks for your help

thanks for your help
 

10 More Discussions You Might Find Interesting

1. Linux

How to remove only html tags inside a file?

Hi All, I have following example file i want to remove all html tags only, Input File: <html> <head> <title>Software Solutions Inc., </title> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> <body bgcolor=white leftmargin="0" topmargin="0"... (2 Replies)
Discussion started by: btech_raju
2 Replies

2. Shell Programming and Scripting

html tags

hi new to the forum so hi every one hope you all well, Iam attempting to write a bash script at the moment its a scraper/grabber using wget to download webpages related to the users query. that part is no probs when i have the page i need to stipr all the useless (to me) data out of the html... (3 Replies)
Discussion started by: dunryc
3 Replies

3. Shell Programming and Scripting

Remove html tags with bash

Hello, is there a way to go through a file and remove certain html tags with bash? If it needs sed or awk, that'll do too. The reason why I want this is, because I have a monitor script which generates a logfile in HTML and every time it generates a logfile, the tags are reproduced. The tags... (4 Replies)
Discussion started by: dejavu88
4 Replies

4. Shell Programming and Scripting

How to use sed to remove html tags including text between them

How to use sed to remove html tags including text between them? Example: User <b> rolvak </b> is stupid. It does not using <b>OOP</b>! and should output: User is stupid. It does not using ! Thank you.. (2 Replies)
Discussion started by: alphagon
2 Replies

5. Shell Programming and Scripting

remove html tags,consecutive duplicate lines

I need help with a script that will remove all HTML tags from an HTML document and remove any consecutive duplicate lines, and save it as a text document. The user should have the option of including the name of an html file as an argument for the script, but if none is provided, then the script... (7 Replies)
Discussion started by: clicstic
7 Replies

6. Shell Programming and Scripting

Parsing HTML, get text between 2 HTML tags

Hi there, I'm quite new to the forum and shell scripting. I want to filter out the "166.0 points". The results, that i found in google / the forum search didn't helped me :( <a href="/user/test" class="headitem menu" style="color:rgb(83,186,224);">test</a><a href="/points" class="headitem... (1 Reply)
Discussion started by: Mysthik
1 Replies

7. Shell Programming and Scripting

How to remove string inside html tag <a>

Does anybody know how i can remove string from <a> tag? There are several hundred posts in a few forums that need to be cleaned up. The precise situation is ---------- <a href="http://mydomain.com/cgi-bin/anyboard.cgi?fvp=/family/sexuality_and_spirituality/&cmd=rA&cG=43"> ------------- my... (6 Replies)
Discussion started by: georgi58
6 Replies

8. Shell Programming and Scripting

Removing all except couple of html tags from html file

I tried to find elegant (or at least simple) way to remove all but couple of html tags from html file, but all examples I found dealt with removing all the tags. The logic of the script would be: - if there is <li> or <ul> on the line, do nothing (=write same line to output) - if there is:... (0 Replies)
Discussion started by: juubuntu
0 Replies

9. Shell Programming and Scripting

How to remove the values inside the html tags?

Hi, I have a txt file which contain this: <a href="linux">Linux</a> <a href="unix">Unix</a> <a href="oracle">Oracle</a> <a href="perl">Perl</a> I'm trying to extract the text in between these anchor tag and ignoring everything else using grep. I managed to ignore the tags but unable to... (6 Replies)
Discussion started by: KCApple
6 Replies

10. Shell Programming and Scripting

How to remove multiline HTML tags from a file?

I am trying to remove a multiline HTML tag and its contents from a few HTML files following the same basic pattern. So far using regex and sed have been unsuccessful. The HTML has a basic structure like this (with the normal HTML stuff around it): <div id="div1"> <div class="div2"> <other... (4 Replies)
Discussion started by: threesixtyfive
4 Replies
PYTAGS(1)							  [FIXME: manual]							 PYTAGS(1)

NAME
pytags - set and remove tags on media files from filename and options SYNOPSIS
pytags [OPTIONS] file [file...] DESCRIPTION
pytags is a simple, general-purpose tool for setting and removing media file tags. With no options, pytags prints a tag summary for all files specified on the command line. Options can be used to set and remove tags on files. Using --format, it is easy to change or create tags on by parsing metadata from filenames according to an arbitrary format string. This makes it simple to tag files that are consistently named. Additionally, tags can be manipulated directly with the --add, --set, and --remove options. These options are processed in order, so options specified later on the command-line may cause values set by earlier options to be overwritten. These options must appear after the --format, if it is used. OPTIONS
--add=EXPR Append a tag value all files according to EXPR. EXPR should be an expression like "artist=Foo". May be specified multiple times to add multiple values. --format=FORMAT Tag files with metadata extrapolated from filenames using format string FORMAT; see the section called "FORMAT STRINGS". This option must be specified before --add, --set, or --remove. --set=EXPR Set a single tag on all files according to EXPR. EXPR should be an expression like "artist=Foo". May be specified multiple times to set multiple tags. --set cannot be used to specify multiple values for a single tag. To do that, use --remove followed by multiple --add options. --remove=TAG Remove tags named TAG. May be specified more than once to remove multiple tags. -h, --help Show summary of options and exit. -v, --version Show version of program and exit. FORMAT STRINGS
The format string expected by --format is the same as that for pytagsfs. See the pytagsfs manual page for more information. EXAMPLES
Remove the genre tag from all .ogg files in the current directory. $ pytags --remove genre *.ogg Set the artist and album tag on all .mp3 files, and remove the genre tag. $ pytags --set artist=Foo --set album=Bar --remove genre *.mp3 Tag all of the .flac files in the current directory. All such files are assumed to be in the format "tracknum artist - trackname [album].flac". $ pytags --format '%n %a - %t [%l].flac' *.flac BUGS
Please report bugs on launchpad at http://launchpad.net/products/pytagsfs/+bugs. pytags relies on mutagen (the underlying Python tags library) to choose an appropriate tag format for the files being tagged. This cannot currently be overridden. SEE ALSO
pytagsfs(1) AUTHOR
Forest Bond Author. COPYRIGHT
Copyright (C) 2007, 2008, 2009 [FIXME: source] 2009-12-04 PYTAGS(1)
All times are GMT -4. The time now is 11:18 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy