Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
DiscussionsAccessExcelInfoPathOutlookPowerPointPublisherWord
DirectoryUser Groups
Related Topics
Outlook ExpressInternet ExplorerWindowsMS Server ProductsMore Topics ...

MS Office Forum / Word / Conversions / June 2006

Tip: Looking for answers? Try searching our database.

Zombie hyperlinks keep returning from the dead

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
mpalotay@gmail.com - 16 Jun 2006 00:27 GMT
Hi,

I'm converting an html file into a Word doc by the simple expedient of
opening it in Word and doing a Save As. (Word 2000 on Windows 98, if
anyone cares.)

The trouble I'm having is with removing the hyperlinks from images. In
the html file, there are thumbnail images that link to the actual,
larger, image. In the word doc, I don't want the thumbnails to link to
anything. So I right-click the image and select Hyperlink - Remove
Hyperlink.

Then I change the wrapping from inline to square, and Ta-da! the
hyperlink is back.

I've tried using some of the "remove all hyperlinks" code, both that
provided by Microsoft and stuff I found on the Word MVP site. This
helps some, in that the hyperlinks don't always come back, at least not
immediately; but every fifth image or so will have re-acquired its
hyperlink when I get to it.

Has anybody encountered something like this? Is there anything I can
do, other than keep re-running the 'remove all hyperlinks' code?

The html file is very basic - I've posted a schematic of it below. It's
created by a magic spel[ahem, sorry] script of some sort; it doesn't
come anywhere near Word until I get to it, so Word's notoriously bad
html-writing-ability can't be faulted.

Please help...

Martha Palotay
mpalotay at yahoo dot com

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html><head><title>This is the title</title>
<style type=text/css> img {
    border-top-width: 0px; border-left-width: 0px; float: right;
border-bottom-width: 0px; border-right-width: 0px}
p.bib {
    font-size: 85%}
hr.bib {
    color: #000000; height: 4px}
</style>
<meta http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<meta content="MSHTML 6.00.2900.2873" name=GENERATOR></head>
<body>
<p>Intro stuff</p>
<table>
 <tbody>
 <tr>
   <td>
     <hr>
     <a name=1></a><a name=fred></a>
     <p><a href="[path to full-size image]"><img
     alt="Fred Smith" src="[path the thumbnail image]"></a><b>1 Fred
Smith</b></p>
     <p>Stuff about Fred Smith's image</p> </td> </tr>
 <tr>
    <td>
      <hr>
       <a name=2></a><a name=joe></a>
       <p><a href="[path to full-size image]"><img alt="Joe Blow"
        src="[path to thumbnail image]"></a><b>2 Joe Blow</b></p>
        <p>Stuff about Joe Blow's image</p> </td> </tr> </tbody>
</table>
<hr class=bib>
<p><b>Bibliography</b></p>
<p class=bib><!--019--><b>Author; <em>"Article Title"</em>; (<a
href="[link to article]">link to article</a>).</b></p> </body> </html>
Klaus Linke - 17 Jun 2006 02:10 GMT
Hi Martha,

I don't know why the hyperlinks keep coming back, but if you open the HTML
source as a text file (as in the sample you posted), you could delete the
hyperlinks with a wildcard replacement:

Find what: \<a[^9-^13^32]{1;2}href=*\>
Replace with:

(Check "More > Match wildcards", and type nothing in "Replace with)

Since the angle brackets < and > have a special meaning in wildcard
searches, you need to put a backslash in front of them.
The wildcard * matches any text.
The [^9-^13^32]{1;2} makes sure there can be a space or a return (which
might contain two characters ^13 and ^10) at that place.

Regards,
Klaus

> Hi,
>
[quoted text clipped - 66 lines]
> <p class=bib><!--019--><b>Author; <em>"Article Title"</em>; (<a
> href="[link to article]">link to article</a>).</b></p> </body> </html>
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.