09-29-2014
Quote:
Originally Posted by
cmccabe
Is it possible in awk to parse a webpage (
EDAR Gene Sequencing - Genetic Testing Company | The DNA Diagnostic Experts | GeneDx), the source code is attached.
HTML Code:
<title> EDAR Gene Sequencing
<dt>Test Code:</dt>
<dd>156 </dd>
<dt>Turnaround Time:</dt>
<dd>6-8 weeks </dd>
<dt>Preferred Specimen:</dt>
<dd>2-5 mL Blood - Lavender Top Tube </dd>
<dt>CPT Codes:</dt>
<dd>81479x1</dd>
<ul id="clinical-utility">
<li>Confirmation of a clinical diagnosis </li>
<li>Differentiation between X-linked and autosomal forms of the disease </li>
<li>Prenatal diagnosis in at-risk pregnancies</li>
<ol id="references">
<li>Bal, E et al. Hum Mutat. 28:703-709, 2007.</li>
<li>Headon et al. Nature. 414:913-916, 2001.</li>
<li>Monreal et al. Nat Genet 22:366-369, 1999.</li>
<li>Chassaing et al. Hum Mutat. 27(3):255-259, 2006</li>
The <.....> are not needed only the text is, if it is possible. Thanks
.
Can you post what the desired output should look like...
This User Gave Thanks to shamrock For This Post:
10 More Discussions You Might Find Interesting
1. UNIX for Advanced & Expert Users
hi all,
i have a html file something similar to this.
<tr class="evenrow">
<td class="data">added</td><td class="data">xyz@abc.com</td>
<td class="data">filename.sql</td><td class="modifications-data">08/25/2009 07:58:40</td><td class="data">Added TK prof script</td>
</tr>
<tr... (1 Reply)
Discussion started by: sais
1 Replies
2. Shell Programming and Scripting
Hi!
I have a bunch of HTML files, which I want to parse to CSV files. Every page has a table in it, and I need to parse each row into a csv record.
With awk and sed, I managed to put every table row in separate lines. So my file looks like this:
<TR> .... </TR>
<TR> .... </TR>
...One... (1 Reply)
Discussion started by: senszey
1 Replies
3. Shell Programming and Scripting
hi guys,
i want to parse a file using public function, the file contain raw data in the below format i want to get the output like this to load it to Oracle DB
MARWA1,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
MARWA2,BSS:26,1,3,0,0,0,0,0.00,22,22,22.00
this the file raw format:
Number of... (6 Replies)
Discussion started by: dagigg
6 Replies
4. Shell Programming and Scripting
Hello,
I have a html file like this :
<html>
...
...
...
<table>
.......
......
</table>
<table name = "hi">
......
.....
...
</table>
<h1> Welcome </h1>
.......
......
</html> (11 Replies)
Discussion started by: prasanna1157
11 Replies
5. Shell Programming and Scripting
Hello,
I want to extract some informations from a html (website, http://www.energiecontracting.de/7-mitglieder/von-A-Z.php?a_z=B&seite=2 ) file and save those in a predefined format (.csv).. However it seems that the code on that website is kinda messy and I can't find a way to handle it... (5 Replies)
Discussion started by: TehOne
5 Replies
6. Shell Programming and Scripting
Hi all, I have a file that contains a good hundred of these job definitions below:
Job Name Last Start Last End ST Run Pri/Xit
________________________________________________________________ ____________________... (7 Replies)
Discussion started by: atticuss
7 Replies
7. Shell Programming and Scripting
<DIV><P>Pré-condição aceder ao ecrã Home do MRS.</P></DIV><DIV><P>OK.</P></DIV><DIV><P>Seleccionar Pesquisa de Recepção Directa.</P></DIV><DIV><P>Confirmar que abriu ecrã de Recepção Directa.</P></DIV><DIV> (6 Replies)
Discussion started by: oliveiraum
6 Replies
8. Shell Programming and Scripting
I have downloaded source code for 97 files using:
wget -x -i link.txt then run a rename loop:
for file in *
do
mv $file $file.txt
done to keep the html tags but make the file a text that can be parsed.
In each of the 97 txt files the gene # is variable, but the gene is associated... (15 Replies)
Discussion started by: cmccabe
15 Replies
9. Shell Programming and Scripting
I downloaded source code using:
wget -qO- http://fulgentdiagnostics.com/test/clinical-exome/ | cat > flugentsource.txt
Now I am trying to use sed to parse it to confirm a gene count. Basically, output (flugent.txt) all the gene names with a total count after them
I'm not all that... (5 Replies)
Discussion started by: cmccabe
5 Replies
10. UNIX for Beginners Questions & Answers
Hi,
im trying to read a Temperature value from html code.
So far i have managed to reduce the whole html page down to this single line with the following sed command:sed -n '/Temperature/p' $temp_temperature | tee temp_string
<TD width='350'>Temperature :</td><td>25... (2 Replies)
Discussion started by: naittis
2 Replies