Date: Mon, 6 May 2002 16:17:11 -0400
Reply-To: "Huang, Ya" <ya.huang@PFIZER.COM>
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: "Huang, Ya" <ya.huang@PFIZER.COM>
Subject: Re: Adobe as Input
Content-Type: text/plain
If you mean Adobe pdf file, you can try to open that
file with a text editor, such as notepad, textpad. If
you are lucky, the pdf file was created by some kind
of application without using encryption, you may find
your "static information" buried in the text. If
you are more than lucky, that static information happen
to be a string with some fixed prefix letters, then you
can just read in the file with a single data _null_ step,
and parse the text to find the data you want.
A non-encrypted pdf file when opened with a text editor
should look like:
%PDF-1.0
1 0 obj << /Type /Catalog /Pages 6 0 R /Outlines 2 0 R >> endobj
2 0 obj << /Type /Outlines /Count 0 >> endobj
3 0 obj << /Type /Font /Subtype /Type1 /Name /F1
/BaseFont /Courier /Encoding /StandardEncoding >> endobj
4 0 obj << /Type /Font /Subtype /Type1 /Name /F2
/BaseFont /Helvetica-Bold /Encoding /StandardEncoding >> endobj
5 0 obj [ /PDF /Text ] endobj
6 0 obj << /Type /Pages /Count 6 /Kids [
7 0 R
9 0 R
11 0 R
13 0 R
15 0 R
17 0 R
] >> endobj
7 0 obj << /Type /Page /Parent 6 0 R /Resources
<< /Font << /F1 3 0 R /F2 4 0 R>> /ProcSet 5 0 R >>
/MediaBox [ 0 0 792 612 ] /Contents 8 0 R >> endobj
8 0 obj << /Length 5067 >>
stream
BT /F2 128 Tf 200 300 Td 0 Tr 0.9 g (DRAFT)' ET BT /F1 8 Tf 8 TL 36 522 Td 0
Tr 0 g
.....
HTH
Ya Huang
-----Original Message-----
From: Jenine Eason [mailto:jenine.eason@AUTOTRADER.COM]
Sent: Monday, May 06, 2002 10:39 AM
To: SAS-L@LISTSERV.UGA.EDU
Subject: Adobe as Input
Hello you wonderful SAS-L folks!
I'm needing to find a way to read in Adobe files as input. Specifically, I
have a 900 page Adobe file that I need to extract a few bits of static
information off each page and store it as SAS data. I'm not finding much
information on Adobe as an input. Any suggestions?
|