[comp.sys.mac] Microsoft Word parser?

dae@shire.cs.psu.edu (Dave Eckhardt) (05/24/89)

I am faced with a large file in Microsoft Word format.  I need to be
able to turn this into a text file, but with some degree of control.
In other words, I need to be able to do something like turning all
italicized letters into upper case.  Best would be a piece of code that
would call a user-provided routine with various tokens (like
enter_italic or footnote) so that I could write back ends for various
purposes.

Since I don't have any version of C or Pascal for the Mac (I'm used to
Unix, where the compilers are free, sigh), I'd need either pointers to
a good cheap langauge implementation or code that could be made to run
under a non-Mac environment.

Any suggestions would be useful; if you have any or want to know what
I've found out, please send me mail directly.  Thanks.

--Daemon

chuq@Apple.COM (Chuq Von Rospach) (05/25/89)

>I am faced with a large file in Microsoft Word format.  I need to be
>able to turn this into a text file, but with some degree of control.
>In other words, I need to be able to do something like turning all
>italicized letters into upper case.

there's nothing I know of that'll do this for you. To do it quickly and
inexpensively, I'd save the Word file in RTF format and then put together a
Hypercard Script to massage it. It should be simple to look at the RTF 
output and decide what information you need to manipulate and what you want
to throw away.



Chuq Von Rospach      =|=     Editor,OtherRealms     =|=     Member SFWA/ASFA
         chuq@apple.com   =|=  CI$: 73317,635  =|=  AppleLink: CHUQ
      [This is myself speaking. No company can control my thoughts.]

This is....The Voice....of USENET....in special English. 1300UTC on 11525. 

gdavis@primate.wisc.edu (Gary Davis) (05/26/89)

From article <31580@apple.Apple.COM>, by chuq@Apple.COM (Chuq Von Rospach):
>>I am faced with a large file in Microsoft Word format.  I need to be
>>able to turn this into a text file, but with some degree of control.
>>In other words, I need to be able to do something like turning all
>>italicized letters into upper case.
> 
> there's nothing I know of that'll do this for you. To do it quickly and
> inexpensively, I'd save the Word file in RTF format and then put together a
> Hypercard Script to massage it. It should be simple to look at the RTF 
> output and decide what information you need to manipulate and what you want
> to throw away.
> 
> 
> 
> Chuq Von Rospach      =|=     Editor,OtherRealms     =|=     Member SFWA/ASFA
>          chuq@apple.com   =|=  CI$: 73317,635  =|=  AppleLink: CHUQ
>       [This is myself speaking. No company can control my thoughts.]
> 
The word processor Nisus will do this kind of thing quickly and easily.
Of course, it won't be inexpensive if you have to buy Nisus. But if you
do this kind of thing often, it might be worth the expense. Besides its
a great word processor, the best in my opinion.


Gary Davis













> This is....The Voice....of USENET....in special English. 1300UTC on 11525.