process ma.gnolia_bookmarks for Diigo so that multiwords tags are preserved


 
Thread Tools Search this Thread
Top Forums Shell Programming and Scripting process ma.gnolia_bookmarks for Diigo so that multiwords tags are preserved
# 1  
Old 02-08-2009
process ma.gnolia_bookmarks for Diigo so that multiwords tags are preserved

Hi everyone.

I was lucky enough to backup all my bookmarks last xmas before the
recent complete meltdown of ma.gnolia.com

Unfortunately, when submitting the file to my new diigo account, all
multiword tags are converted to multiple single word tags which is not very
helpful for anyone searching through my tags.

Now I'm wondering how I can process the html file that contains
the bookmarks I managed to salvage (6500) using a combination
of akw and sed I suppose so I can adapt
the inner metadata as to preserve the tags integrity during the import into diigo.

Here is a sample of what the file contains:
Code:
<dt><A HREF="http://max.jsrhost.com/ajaxify/demo.php"
ADD_DATE="1230944239" LAST_MODIFIED="1230944239" 
TAGS="jquery, ajax, web devel tools">Ajaxify - jQuery plugin</A></dt>
<dd>Ajaxify is a jquery plugin that manage your ajax request</dd>

Using the previous code example, in TAGS="(.*?)"
  1. I need to replace the strings of 2 caracters: comma space
    with a single comma.
    Code:
    TAGS="jquery, ajax, web devel tools"

    becomes
    Code:
    TAGS="jquery,ajax,web devel tools"

  2. Then, in the same field, replace the remaning spaces with the underscore caracter.
    Code:
    TAGS="jquery,ajax,web devel tools"

    becomes
    Code:
    TAGS="jquery,ajax,web_devel_tools"

How would I do this using my linux shell?
I'd the like to share my findings with my fellow ma.gnoliers

Many thanks in advance

Jeunium
___________________________________
Amanita Muscaria
Is a man eater that must scare ya
Smilie
# 2  
Old 02-08-2009
Try sed:
Code:
sed 's/, /,/g' file > newfile

# 3  
Old 02-08-2009
Thanks for your answer danmero but wouldn't this do the replacement in all the file?

I want the replacements to occur only within the TAGS tag of each hyperlinks of my input file.

Thanks again for your time danmero

Jeunium
___________________________________
Amanita Muscaria
Is a man eater that must scare ya
Smilie
# 4  
Old 02-08-2009
Sorry , I overseen the second requirement Smilie , try awk:
Code:
awk '{FS=OFS="\"";gsub(", ",","$2);gsub(" ","_",$2)}1' file > newfile

# 5  
Old 02-08-2009
You are very generous denmero

I greatly appreciate your time and efforts on this but,

Where, in your command does it

limit the "search and replace" to only the TAGS=",,,,," part?

Code:
<dt><A HREF="http://max.jsrhost.com/ajaxify/demo.php"
ADD_DATE="1230944239" LAST_MODIFIED="1230944239" 
TAGS="jquery, ajax, web devel tools">Ajaxify - jQuery plugin</A></dt>
<dd>Ajaxify is a jquery plugin that manage your ajax request</dd>

Jeunium
___________________________________
Amanita Muscaria
Is a man eater that must scare ya
Smilie
# 6  
Old 02-08-2009
Smilie
Code:
awk  '{FS=OFS="\""}/TAGS/{gsub(", ",",",$2);gsub(" ","_",$2)}1' file > newfile


Last edited by danmero; 02-09-2009 at 10:01 AM.. Reason: Fix code
# 7  
Old 02-09-2009
I think we're on to something here but it doesn't quite work yet.

Quote:
<A HREF="http://cssjuice.com/13-online-generators-for-web-20-design/" ADD_DATE="1186424504" LAST_MODIFIED="1186424477"
TAGS="web devel tools, online generators">CSS Juice - Design, Tutorial, Showcase and more » 13 Online Generators for Web 2.0 Design</A>
becomes

Code:
<A HREF="http://cssjuice.com/13-online-generators-for-web-20-design/" 
ADD_DATE="1186424504" LAST_MODIFIED="1186424477" 
TAGS="web devel tools,http://cssjuice.com/13-online-generators-for-web-20-design/online generators">
CSS Juice - Design,http://cssjuice.com/13-online-generators-for-web-20-design/Tutorial,
http://cssjuice.com/13-online-generators-for-web-20-design/Showcase and more » 13 Online Generators for Web 2.0 Design</A>

Let's remember that both substitutions must occur within the same
Quote:
TAGS="web devel tools, online generators"
which in this case should become
Quote:
TAGS="web_devel_tools,online_generators"
The substitions must occur ONLY within the TAGS="...." AND NOT on the whole line !!!

Thanks again danmero for your time.

Jeunium
___________________________________
Amanita Muscaria
Is a man eater that must scare ya
Smilie

Last edited by jeunium; 02-09-2009 at 12:46 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Command to get exact tomcat process I am running ignoring other java process

Team, I have multiple batchjobs running in VM, if I do ps -ef |grep java or tomcat I am getting multiple process list. How do I get my exact tomcat process running and that is unique? via shell script? (4 Replies)
Discussion started by: Ghanshyam Ratho
4 Replies

2. Shell Programming and Scripting

Monitoring processes in parallel and process log file after process exits

I am writing a script to kick off a process to gather logs on multiple nodes in parallel using "&". These processes create individual log files. Which I would like to filter and convert in CSV format after they are complete. I am facing following issues: 1. Monitor all Processes parallelly.... (5 Replies)
Discussion started by: shunya
5 Replies

3. UNIX for Advanced & Expert Users

Process remians in Running state causing other similar process to sleep and results to system hang

Hi Experts, I am facing one problem here which is one process always stuck in running state which causes the other similar process to sleep state . This causes my system in hanged state. On doing cat /proc/<pid>wchan showing the "__init_begin" in the output. Can you please help me here... (6 Replies)
Discussion started by: naveeng
6 Replies

4. UNIX for Advanced & Expert Users

Process remians in Running state causing other similar process to sleep and results to system hang

Hi Experts, I am facing one problem here which is one process always stuck in running state which causes the other similar process to sleep state . This causes my system in hanged state. On doing cat /proc/<pid>wchan showing the "__init_begin" in the output. Can you please help me here... (1 Reply)
Discussion started by: naveeng
1 Replies

5. BSD

Process remians in Running state causing other similar process to sleep and results to system hang

Hi Experts, I am facing one problem here which is one process always stuck in running state which causes the other similar process to sleep state . This causes my system in hanged state. On doing cat /proc/<pid>wchan showing the "__init_begin" in the output. Can you please help me here... (0 Replies)
Discussion started by: naveeng
0 Replies

6. UNIX for Dummies Questions & Answers

Formatting not preserved when using mailx

I tried to send email through mailx, the text file is well formatted when i open in unix, but when received the mail in outlook, the column do not align with the heading. The text file contain the tablespace details which I spool out from database. Please advise. Do I need to specify any parameter... (4 Replies)
Discussion started by: fenocean
4 Replies

7. Shell Programming and Scripting

script to monitor the process system when a process from user takes longer than 15 min run.

get email notification from from system when a process from XXXX user takes longer than 15 min run.Let me know the time estimation for the same. hi ,any one please tell me , how to write a script to get email notification from system when a process from as mentioned above a xxxx user takes... (1 Reply)
Discussion started by: kirankrishna3
1 Replies

8. Shell Programming and Scripting

Shell Script to Kill Process(number of process) Unix/Solaris

Hi Experts, we do have a shell script for Unix Solaris, which will kill all the process manullay, it used to work in my previous env, but now it is throwing this error.. could some one please help me to resolve it This is how we execute the script (and this is the requirement) ... (2 Replies)
Discussion started by: jonnyvic
2 Replies

9. Shell Programming and Scripting

script to monitor process running on server and posting a mail if any process is dead

Hello all, I would be happy if any one could help me with a shell script that would determine all the processes running on a Unix server and post a mail if any of the process is not running or aborted. Thanks in advance Regards, pradeep kulkarni. :mad: (13 Replies)
Discussion started by: pradeepmacha
13 Replies

10. Shell Programming and Scripting

Script - How to automatically start another process when the previous process ends?

Hi all, I'm doing automation task for my team and I just started to learn unix scripting so please shed some light on how to do this: 1) I have 2 sets of datafiles - datafile A and B. These datafiles must be loaded subsequently and cannot be loaded concurrently. 2) So I loaded datafile A... (10 Replies)
Discussion started by: luna_soleil
10 Replies
Login or Register to Ask a Question