Sponsored Content
Top Forums UNIX for Beginners Questions & Answers Multiline html tag parse shell script Post 303040321 by RudiC on Saturday 26th of October 2019 12:57:47 PM
Old 10-26-2019
Why don't you paint the whole picture with your requirements (including but not limited to "get other texts out of the html", "get rid of the new lines", "replacing the commas with semicolon") and input data so people could work towards a final, optimal solution? E.g. the sed, awk, and dual tr invocations could be combined into a single run of one of the tools,
 

10 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

How do I extract text only from html file without HTML tag

I have a html file called myfile. If I simply put "cat myfile.html" in UNIX, it shows all the html tags like <a href=r/26><img src="http://www>. But I want to extract only text part. Same problem happens in "type" command in MS-DOS. I know you can do it by opening it in Internet Explorer,... (4 Replies)
Discussion started by: los111
4 Replies

2. Shell Programming and Scripting

how to use html tag in shell scripting

Hai friends I have a small doubt.. how can we use html tag in shell scripting code : echo "<html>" echo "<body>" echo " welcome to peace world " echo "</body>" echo "</html>" output displayed like this: <html> <body> welcome to peace world </body> </html> (5 Replies)
Discussion started by: jrex1983
5 Replies

3. UNIX for Advanced & Expert Users

shell script to parse html file

hi all, i have a html file something similar to this. <tr class="evenrow"> <td class="data">added</td><td class="data">xyz@abc.com</td> <td class="data">filename.sql</td><td class="modifications-data">08/25/2009 07:58:40</td><td class="data">Added TK prof script</td> </tr> <tr... (1 Reply)
Discussion started by: sais
1 Replies

4. Shell Programming and Scripting

Parse HTML tag parameters and text

Hi! I have a bunch of HTML files, which I want to parse to CSV files. Every page has a table in it, and I need to parse each row into a csv record. With awk and sed, I managed to put every table row in separate lines. So my file looks like this: <TR> .... </TR> <TR> .... </TR> ...One... (1 Reply)
Discussion started by: senszey
1 Replies

5. Shell Programming and Scripting

Script to delete HTML tag

Guys, I have a little script that I got of the internet and that I use in Squid to block ads. I used that script with linux but now i have moved my servers to freebsd. I have a step learning curve there but it is fun: Back to the script issue. The script used to work i with linux but... (15 Replies)
Discussion started by: zongo
15 Replies

6. Shell Programming and Scripting

awk Script to parse a XML tag

I have an XML tag like this: <property name="agent" value="/var/tmp/root/eclipse" /> Is there way using awk that i can get the value from the above tag. So the output should be: /var/tmp/root/eclipse Help will be appreciated. Regards, Adi (6 Replies)
Discussion started by: asirohi
6 Replies

7. Shell Programming and Scripting

Search for a html tag and print the entire tag

I want to print from <fruits> to </fruits> tag which have <fruit> as mango. Also i want both <fruits> and </fruits> in output. Please help eg. <fruits> <fruit id="111">mango<fruit> . another 20 lines . </fruits> (3 Replies)
Discussion started by: Ashik409
3 Replies

8. Shell Programming and Scripting

Using shell command need to parse multiple nested tag value of a XML file

I have this XML file - <gp> <mms>1110012</mms> <tg>988</tg> <mm>LongTime</mm> <lv> <lkid>StartEle=ONE, Desti = Motion</lkid> <kk>12</kk> </lv> <lv> <lkid>StartEle=ONE, Source = Velocity</lkid> <kk>2</kk> </lv> <lv> ... (3 Replies)
Discussion started by: NeedASolution
3 Replies

9. Shell Programming and Scripting

XML Parse between to tag with upper tag

Hi Guys Here is my Input : <?xml version="1.0" encoding="UTF-8"?> <xn:MeContext id="01736"> <xn:VsDataContainer id="01736"> <xn:attributes> <xn:vsDataType>vsDataMeContext</xn:vsDataType> ... (12 Replies)
Discussion started by: pareshkp
12 Replies

10. Shell Programming and Scripting

How to remove html tag which has multiple lines in SHELL?

I want to clean a html file. I try to remove the script part in the html and remove the rest of tags and empty lines. The code I try to use is the following: sed '/<script/,/<\/script>/d' webpage.html | sed -e 's/<*>//g' | sed '/^\s*$/d' > output.txt However, in this method, I can not... (10 Replies)
Discussion started by: YuhuiFeng
10 Replies
BINS(1) 						      General Commands Manual							   BINS(1)

NAME
bins_edit -- Set fields in XML picture description files for BINS SYNOPSIS
bins_edit [-a | --album ] [-m | --html ] [-t title | --title title ] [-e event | --event event ] [-l location | --location loca- tion ] [-p people | --people people ] [-y date | --date date ] [-d description | --description description ] [--longdesc longDe- scription] [--shortdesc shortDescription] [--sample pictureFileName] [-g tag=value | --generic tag=value ] [-h | --help ] [-v | --verbose ] [-q | --quiet ] [file] [files ...] DESCRIPTION
This manual page documents briefly the bins_edit command. This manual page was written for the Debian distribution because the original program does not have a manual page. Instead, it has docu- mentation in HTML in /usr/share/doc/bins/index.html as well as a --help option. bins_edit sets values in the XML picture description files that bins(1) uses to generate galleries. OPTIONS
This program follows the usual GNU command line syntax, with long options starting with two dashes (`-'). A summary of options is included below. file By default, file is the filename of the XML file with the image properties. If the argument has no .xml suffix, it is added, so you can directly give picture names on the command line. Spaces and other special characters (even newlines) can be used in val- ues given as parameters as long as they are enclosed between quotes. If the --album option is given, the filename refers to the directory of images, and the album.xml in that directory will be modi- fied instead. -a --album edit album description (instead of the default of editing the image description.) In this case, the file parameter must be the source directory of the album. Only the --title, --longdesc, --shortdesc and --sample switches make sense with this option. -m --html input value will be interpreted as HTML code, thus, no HTML encoding or quoting will be done. -t title --title title Set the title (of an image.) -e event --event event Set the event name (of an album or image.) -l location --location location Set the location (of an image.) -p people --people people Set the list of people (of an image.) -y date --date date Set the date (of an image.) -d description --description description Set the description (of an image.) --longdesc longDescription Set the long description (of an album.) --shortdesc shortDescription Set the short description (of an album.) --sample pictureFileName Select the sample picture, within this album, to be used on the album list (template subalbum.html.) Note that the filename is relative to the album directory, and thus doesn't have a directory component. -g tag=value --generic tag=value This lets you set arbitrary fields in the relevant XML file (of the image or the album.) Generic tags appear inside description which appears inside image; the tag appears as the name attribute of a field element, and the value appears as the content of the element. -h --help Gives quick help (which this man page is based on.) -v --verbose This switch can appear several times to increase verbosity level. -q --quiet Suppress output. EXAMPLES
Set the title of the Image.jpg file to "My picture": bins_edit -t "My picture" Image.jpg Set the title and location of all JPEG pictures in the directory: bins_edit --title Holiday --location Paris *.jpg Use of HTML values: bins_edit --html --description '<b>BINS</b> is cool' file.jpg Set the title short description and sample image of the album in the current directory (note the dot as final parameter): bins_edit -a -t "My Album" --sample image.jpg --shortdesc "This is my album" . SEE ALSO
bins (1). AUTHOR
This manual page was written by Mark W. Eichin eichin@thok.org for the Debian system (but may be used by others). As it is mostly derived from the bins_edit program itself, it is covered by the GNU General Public License. BINS(1)
All times are GMT -4. The time now is 10:33 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy