On 09/11/2024 22:36, Paul wrote:
On Sat, 11/9/2024 2:33 PM, Mickey D wrote:
https://artsandculture.google.com/story/the-last-voyage-of-sir-john-franklin-derbyshire-record-office/jAVxEd0YGOfbJQ?hl=en
All I wanted to do was print the web page to PDF.
But nothing results except an empty page with a header & footer.
What am I doing wrong?
The page is not designed to be printed.
I'd put it the other way round. The page is designed NOT to be printed.
The "venetian-blind" presentation of text and image involves Javascript
that cannot be represented in print (which is what PDF tries to do).
The only solution is to download the HTML, Tidy it to XML, and then
write some XSLT to pull it apart and isolate
image-text-image-text-image-tex, etc, then reformat it for (eg) LaTeX to
print it.
Even Vivaldi's "Capture Page" feature fails to work.
The page was created by Google for Derbyshire Record Office. They should
be ashamed of themselves. Maybe a polite email (or better, a physical
paper letter sent by Recorded Delivery) would get a reply from them.
Peter
"Save As, Web Page Complete" will not work. If you watch,
the transfer during that, even has some trouble with the
static files, let alone anything else.
Some of the pages are delivered by AJAX or Fetch. They put
up a background image first, as part of "Discovery" and
"Making the user wait until the foreground image appears".
Then the foreground image is acquired via AJAX.
The PDF printer is intolerant of these behaviors and
every time you print, the page will be missing different
things. The PDF print will be a shambles, and not even
remotely the correct length.
You can't cheat by using a large-sized virtual screen.
Won't help.
If you open in Seamonkey Composer, using the URL, all the
pictures are missing.
Don't waste your time, in other words.
Paul
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)