How to convert a web page into a PDF?
I found wkhtmltopdf project, which seems to do the trick. It's command line, so there is a bit of a learning curve, but not too bad.
Specifically to convert a web page, open a command window in the directory where wkhtmltopdf
was installed and execute the following:
wkhtmltopdf.exe http://www.yourpage.com/index.htm c:\misc\cnn.pdf
The application has a ridiculous amount of options designed to tweak the output as needed, but the defaults give a pretty good result.
CutePDF writer uses GhostScript for text processing and then ps2pdf to create searchable PDFs of web pages. This will of course not work if the text on the page is an image to start out with.
What's the problem with Print to PDF solutions? I've two virtual printers installed in my system: PDF Creator & Virtual Printer from Adobe Acrobat X. Both work fine. I can search text in generated PDFs easily as long as my PDF viewer has OCR capabilities (which is common these days).
If you are thinking about creating something like text and word document, you can't. Its limitation of PDF format or says its how PDF format work. Due to this, we can embed fonts etc. in this format without any dependency to make it universal.
And, I don't think wkhtmltopdf project can generate a PDF file in which text can be searched without using OCR technology (because it'll violate PDF specification).