Hi I wonder if anyone knows the answer to this...
If I use a databse to allow users to update their content
and they copy and paste some text from a word document the
resulting webpage when generated using ASP keeps all
words' non-standard (SGML) character entities.
However, I need to validate my webpages and if words non-
standard code is retained this won't be possible.
How do other people get around this problem? I have tried
saving word files as .htm, .txt. .rtf and even htm
(filtered) and it still keeps those characters in! I know
I can go online and 'clean' a html file created from word
but is there an easier way?
I want to allow users to maintain content easily but not
at the expense of generating non-standard code. I know I
can go online and 'clean' a html file created from word
but is there an easier way?
Thanks
Steven
Hi Steve,
Can you provide a URL and some additional information?
I'm not sure what your definition is on 'non standard'.
If the character entity codes are those in the http://wc3.org
HTML v4 spec then that would be in compliance with the
standards. If those are the entities then an HTML webpage
validator should not have a problem with them.
============
Hi I wonder if anyone knows the answer to this...
If I use a databse to allow users to update their content
and they copy and paste some text from a word document the
resulting webpage when generated using ASP keeps all
words' non-standard (SGML) character entities.
However, I need to validate my webpages and if words non-
standard code is retained this won't be possible.
How do other people get around this problem? I have tried
saving word files as .htm, .txt. .rtf and even htm
(filtered) and it still keeps those characters in! I know
I can go online and 'clean' a html file created from word
but is there an easier way?>>
I want to allow users to maintain content easily but not
at the expense of generating non-standard code. I know I
can go online and 'clean' a html file created from word
but is there an easier way?
Thanks
Steven>>
--
Hope that helps,
Bob Buckland ?:-) MS Office Products family MVP
*Courtesy is not expensive and can pay big dividends*
http://go.CompuServe.com/MSOfficeForum?loc=us
Have a little fun with Office
http://microsoft.com/uk/office/xtra/
Get a Microsoft Certification of your Office App Skills:
http://microsoft.com/traincert/mcp/mous/requirements.asp
Try the new MS Knowledge Base Help and Support Search Tools & FAQ:
http://support.microsoft.com
Choose the newsgroups focused on your MS Office applications:
via Browser:
http://communities.microsoft.com/newsgroups/default.asp?icp=prod_office
by Newsreader: (Outlook Express)
news://msnews.microsoft.com
Steve - 15 Jul 2003 12:31 GMT
Hi I can't give you a url as I am testing internally but I
am attempting to validate pages as XHTML 1.0 using text
copied and pasted from word.
http://validator.w3.org/
The main problem is with punctuation such as apostrophes,
commas, hypens ect which comes up as NON-SGML entities.
Such punctuation appears as strnge rectangular characters.
Thanks
Steven
>-----Original Message-----
>Hi Steve,
[quoted text clipped - 51 lines]
> via Browser:
>
http://communities.microsoft.com/newsgroups/default.asp?
icp=prod_office
> by Newsreader: (Outlook Express)
> news://msnews.microsoft.com
>
>.