Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
DiscussionsAccessExcelInfoPathOutlookPowerPointPublisherWord
DirectoryUser Groups
Related Topics
Outlook ExpressInternet ExplorerWindowsMS Server ProductsMore Topics ...

MS Office Forum / Outlook / Programming Add-Ins / May 2006

Tip: Looking for answers? Try searching our database.

duplicate emails

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Martin - 18 May 2006 17:56 GMT
Hi all,

I noticed some relevant posts in this group, so thought I'd post my
query here.

I have two MSG files, both exported from PST files. They are the same
message, same sent time, same sent date, To, From, CC, BCC, Importance
etc. In fact all the fields are the same.

However, I am trying to de-duplicate these two messages using an MD5's
based partially on the body of the email.

Viewing the source of the messages (the examples are HTML messages) and
comparing them using a diffing tool reveals no differences, but viwing
the text version of the message, in one there is an extra carriage
return.

I think this could be due to different versions of outlook, or
something that has happened to the messages along the way, but I am
unsure how to tell. Does anyone have any experience of this particular
problem?

Many thanks,

Martin.
Dmitry Streblechenko - 18 May 2006 19:50 GMT
Don't use any kind of hash to compare MSG files. Would you consider two
message different if one MSG file stores first the sender name, then the
subject and the second MSG file first stores the subject then the sender
name? To an end user, the order in which thee properties are stored is
irrelevant, but changing the order throws any hashing off.
You need to define "sameness" of 2 MSG files. If I were you, I'd extract the
properties that you care about (minus trailing carriage returns, etc) then
compare them either separately or as a concatenated string.

Dmitry Streblechenko (MVP)
http://www.dimastr.com/
OutlookSpy  - Outlook, CDO
and MAPI Developer Tool

> Hi all,
>
[quoted text clipped - 21 lines]
>
> Martin.
Brana Bujenovic - 21 May 2006 13:14 GMT
> However, I am trying to de-duplicate these two messages using an MD5's
> based partially on the body of the email.
[quoted text clipped - 8 lines]
> unsure how to tell. Does anyone have any experience of this particular
> problem?

Hello:

Try to compare just the message-ids, if present.

HTH

Signature

Brana's E-Mail Gadgets
http://brana.bujenovic.ns.users.sbb.co.yu/

 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.