• Re: Debian repository -- any well documented PDF to HTML converters?

    From Yassine Chaouche@21:1/5 to All on Wed Mar 19 15:00:02 2025
    Le 3/19/25 à 14:52, Richard Owlett a écrit :
    [...]
    I has a *problem*. The PDF can collapse sub-points. The HTML *cannot*.

    I think I saw a javascript code on emacswiki that does that for html exports from org-mode documents,
    but sorry, I didn't take any notes about it...

    Best,

    --
    yassine -- sysadm
    http://about.me/ychaouche
    Looking for side gigs.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Owlett@21:1/5 to All on Wed Mar 19 15:00:02 2025
    I've just received a 68 page PDF document. I don't know proper
    terminology but visually it resembles an outline in many of it's items
    are multiple paragraphs. The points are "collapsible". When everything
    is collapsed, the document index is 9 lines.

    Due to vision/perception issues I find it more comfortable to work wit
    HTML documents than PDF. As my default OS is Debian 12.8, I used
    Synaptic to search for a conversion utility.
    Found that poppler-utils { supplying pdftohtml } was installed.
    Used a sparse man-page [ https://manpages.debian.org/bookworm/poppler-utils/pdftohtml.1.en.html ]
    to make a generally good test run.

    The Poppler Wiki describes the situation as:
    There is currently very little documentation for the intenal library.

    Is there a Debian package whose HTML output can collapse sub-points when
    the input PDF does so?

    TIA




    I has a *problem*. The PDF can collapse sub-points. The HTML *cannot*.
    Went looking for more complete documentation than the man-page.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Owlett@21:1/5 to Yassine Chaouche on Wed Mar 19 16:00:01 2025
    On 3/19/25 8:59 AM, Yassine Chaouche wrote:
    Le 3/19/25 à 14:52, Richard Owlett a écrit :
    [...]
    I has a *problem*. The PDF can collapse sub-points. The HTML *cannot*.

    I think I saw a javascript code on emacswiki that does that for html
    exports from org-mode documents,
    but sorry, I didn't take any notes about it...

    Best,


    A quick DuckDuckGo search of https://www.emacswiki.org didn't find it.
    However a more general search found hints at https://www.w3schools.com .

    I keep saying its time to increase my HTML skill set ;}

    Thanks

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Yassine Chaouche@21:1/5 to All on Thu Mar 20 11:40:01 2025
    Le 3/19/25 à 15:50, Richard Owlett a écrit :
    A quick DuckDuckGo search of https://www.emacswiki.org didn't find it.

    I changed the search parameters a bit: javascript org-mode folding
    First google hit: https://orgmode.org/manual/JavaScript-support.html
    First link in that page: https://orgmode.org/org-info.js

    But again,
    this is for org documents...
    A possible workflow is to transform the PDF into text,
    then define headers in the generated text,
    then use org-mode with the provided script to generate a foldable HTML out of that (org-mode) text document.


    Or you could feed the js file to AI and ask it to adapt it in ways that suite your needs.
    I hear blackbox and copilot are good at dealing with code,
    but YMMV as usual.


    Best,


    --
    yassine -- sysadm
    http://about.me/ychaouche
    Looking for side gigs.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)