Re: [Thinkpad] Word Viewer

From: Julian Thomas <jt_at_jt-mj.net>
Date: Tue Aug 31 2004 - 14:40:42 EDT

In <cyLNB5mipJOe092yn@islandnet.com>, on 08/31/04
   at 09:59 AM, jberry@islandnet.com (Jonathan Berry) typed:

>--Open the file in a hex editor or a programmer's editor or maybe even
>Notepad. There will be huge gobs of formatting 8-bit stuff at beginning
>and end, but in all likelihood
>the plain text will be right there in the middle.

It is. Depending on which version of Word produced the document, the meat
begins at either 0600 hex or 0A00 hex and ends when there are a number of
null characters in a row.

In between, there may be isolated null characters that I replace with
spaces, and hex '0d' should be replaced with '0d0a' (CRLF).

I have a rexx program (os2) that does the job; I would need to modify it
because it also converts a bunch of special codes (>128) into the html
equivalent - for instance:

ascii 233=>"&eacute;" /* Ú lowercase e, acute accent é */ which is
of interest only if the word document is being made into a web page.

If anyone wants it, send me email off list; if there's a lot of demand
I'll put it on my web site.

 

-- 
 Julian Thomas:   jt_at_jt-mj.net    http://jt-mj.net
 In the beautiful Finger Lakes Wine Country of New York State!
 Boardmember of POSSI.org - Phoenix OS/2 Society, Inc  http://www.possi.org
 -- --
 COFFEE.EXE Missing - Insert Cup and Press Any Key
_______________________________________________
Thinkpad mailing list
Thinkpad@stderr.org
http://stderr.org/cgi-bin/mailman/listinfo/thinkpad
Received on Tue Aug 31 14:53:47 2004

This archive was generated by hypermail 2.1.8 : Fri May 26 2006 - 16:03:26 EDT