Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
DiscussionsAccessExcelInfoPathOutlookPowerPointPublisherWord
DirectoryUser Groups
Related Topics
Outlook ExpressInternet ExplorerWindowsMS Server ProductsMore Topics ...

MS Office Forum / Word / Page Layout / October 2009

Tip: Looking for answers? Try searching our database.

How to extract raw text from columns

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Dave Miles - 27 Oct 2009 16:26 GMT
I have a word doc that the author created columns in and I need to get
the raw text.  If I save it as txt the formatting gets messed up.

When I look at the page (or print it) I see somthing like:

Date: xx/xx/xx           Name: Fred
Time: xx:xx               Occupation: Tech Support

When I save as text, or select, copy & paste in notepad, I see
something like:

Date: xx/xx/xx           Name:
Time: xx:xx               Occupation:
Fred
Tech Support

I have a lot more info on the page which makes it impossible for me to
parse it out.  Is there a way to just remove the columns and preserve
the same text on the same line?

Thanks!

Signature

Dave Miles

macropod - 27 Oct 2009 17:58 GMT
Hi Dave,

Have you tried Tabel|Convert|Table to Text?

Signature

Cheers
macropod
[Microsoft MVP - Word]

> I have a word doc that the author created columns in and I need to get
> the raw text.  If I save it as txt the formatting gets messed up.
[quoted text clipped - 17 lines]
>
> Thanks!
Dave Miles - 28 Oct 2009 00:24 GMT
It's not a table so the option is not avail to me :(

> Hi Dave,
>
[quoted text clipped - 23 lines]
> >
> .
Doug Robbins - Word MVP - 28 Oct 2009 00:54 GMT
Send me a copy of the document to look at.

Signature

Hope this helps

Doug Robbins - Word MVP
Please reply only to the newsgroups unless you wish to avail yourself of my
services on a paid, professional basis.

> It's not a table so the option is not avail to me :(
>
[quoted text clipped - 25 lines]
>> >
>> .
macropod - 28 Oct 2009 06:54 GMT
So what sort of column arrangement are you using? And how do you keep the items aligned?

Signature

Cheers
macropod
[Microsoft MVP - Word]

> It's not a table so the option is not avail to me :(
>
[quoted text clipped - 25 lines]
>> >
>> .
Doug Robbins - Word MVP - 28 Oct 2009 07:15 GMT
Hi Paul,

Dave sent me one of the documents and I believe that it may have been
produced via OCR.

I am sending him the following response:

You can clean up the document a lot by using Edit>Replace to first replace
^b with nothing to remove all of the Section Breaks, then ^n with nothing to
remove the column breaks, then use Ctrl+A to select everything and use the
Format Paragraph dialog to set the paragraph indents to 0 and the Special
Indent to None.  Then use Edit>Replace again to replace ^t with ^p.

A macro could be written to perform all of the above and to further process
the documents (assuming that you have many to do), you could create a list
of the attributes for which you want to extract the values, and then use
this in a macro that iterated through that list and then inserted a tab
after each attribute.  If you then used Convert Text to Table, you would
have most of the information in a two column table with the attributes in
the first column and the values in the second column.  There would be a few
exceptions such as the addresses and a bit more attention would need to be
paid to the Loan Details section

With a bit of work however, and depending upon how similar the documents are
and what you want as the final result, it should be possible to create some
code that would do a fairly complete job of parsing the data from the
document.

Signature

Hope this helps

Doug Robbins - Word MVP
Please reply only to the newsgroups unless you wish to avail yourself of my
services on a paid, professional basis.

> So what sort of column arrangement are you using? And how do you keep the
> items aligned?
[quoted text clipped - 27 lines]
>>> >
>>> .
Dave Miles - 29 Oct 2009 03:21 GMT
Hey Doug & Paul,

I think the docs may be generated by Access.  I understand that the source
comes in in Excel and the reports are generated from that.  Yes, the simple
answer would be to work from the Excel sheets  but they contain more data
than I license so I have to take what I get......sad but true :(

> Hi Paul,
>
[quoted text clipped - 57 lines]
>
> .
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2010 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.