Jochen Hayek's Blog: "pdftohtml" vs. DRM

Tuesday, February 8, 2011

"pdftohtml" vs. DRM

A project of mine involves extracting strings and other details from PDF files using "pdftohtml -xml".

A plain "pdftohtml -xml" refuses to read PDF files with set copy-protection bits set. But if you add "-nodrm" on the command line, it reads them anyway, but it mentions the problem on STDERR.

Jochen Hayek's Blog

Tuesday, February 8, 2011

"pdftohtml" vs. DRM

No comments:

networks, profiles, logos, badges, …

my most exciting web-sites

my home pages, profiles, ...

the top of my book shelf

Popular Posts

Total Pageviews

favourites and wishlist

Blog Archive