I wasn't sure what category to post this in.
So I downloaded and opened this file, and was informed it had a form in it that wasn't supported in my reader:
I ran it through peepdf and it shows:
/Names: [1143, 1146]
/JS: [1147, 1148, 1149]
Now, I've not done any further analysis to know if there's actually anything malicious in there (although it is a little suspicious for real scans of a book and not e.g. pirated ebooks that might come with some kind of tracking or DRM), but it got me wondering: What's the policy here when it comes to potential PDF exploits?
I didn't see a way to report files (if there is one), and archive.org provides JP2 images in a zip file for anyone wanting to avoid PDF altogether (although many readers you'd have to convert the images first). I've gotten plenty of books from the generous uploaders here that are clean (either in safe formats or running through peepdf shows nothing suspicious, and the last discovery of malware on one of my machines was a long time ago). Are PDF files scanned in some way before uploading, or is anyone allowed to dump PDF files onto the site like if you were to look elsewhere?
Sorry if this is a somewhat strange topic. This is a problem I see all over when it comes to downloading PDF files, and probably a hard one to address. I guess it is just download at your own risk, keep your software up-to-date, and run a trusted anti-virus? I never worried about PDF exploits until recently, since I wasn't aware of what they could do. I should probably look into converting my collection to a safer format in the future, but PDF seems like such a complicated format to me that I would be worried about loss of quality or unnecessary bloating of file size.
Not trying to scare anyone by the way! Just curious what you all think. Like I said, I don't know a lot about the PDF format.