• Curious about "AI" PCs, and NPMs

    From Philip Herlihy@21:1/5 to All on Sun Jun 22 12:32:59 2025
    Although my 12?-year-old Dell PC is still performing beautifully, I
    figure I'll need to replace it around October. (Yes, I'll likely
    convert it to Linux, but I do want a Windows PC.)

    I'm seeing a lot of references to "AI" PCs and laptops, which I
    understand have a Neural Processing Module (NPM). I'm finding it hard
    to get these in perspective. I do increasingly use AI - Perplexity
    generated a spectacularly good Formal Objection to a complex planning application recently, and it's a good place to start with a lot of
    queries. But I'm happy with the online services (good fibre broadband!)
    so I'm not clear what benefit an NPM would bring me. Of course I'm an
    end user rather than a developer (mostly) these days, but I sense I'm
    just scratching the surface.

    However, I don't want to be left out! I had been thinking based on my
    great experience with my current Dell that I'd buy a beefy workstation
    for the next ten years or so, but this technology is just emerging, and
    I'm rather wondering why NPMs seem to be built-in rather than
    replaceable. I'd certainly like decent graphics capability - I
    sometimes use a 3D CAD package which renders architectural and interior designs, though I don't need "real-time" rendering (happy to wait while
    it chugs away).

    Any thoughts, wise friends?


    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andy Burns@21:1/5 to Philip Herlihy on Sun Jun 22 13:26:03 2025
    Philip Herlihy wrote:

    I'm seeing a lot of references to "AI" PCs and laptops, which I
    understand have a Neural Processing Module (NPM).
    I make only very light use of AI, it's not bad for searching API
    documentation, and throwing together example code snippets, though it
    makes all the same bugs as a wet behind the ears programmer.

    Beware if you do get an "AI PC" it will likely enable "Windows Recall"
    taking and OCR'ing screenshots of your machine every few seconds.

    I'm happy enough running AI chats on someone else's tin, so don't have
    any NPU-equipped machines, even so I've disabled/uninstalled that
    feature, juuuust in case ...

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Sun Jun 22 14:10:46 2025
    In article <1038us1$hnq6$[email protected]>, [email protected] says...

    It really depends if you want to run AI - like Microsoft's CoPilot -
    through dedicated desktop applications or as part of other tools (i.e. >photoshop).



    That's the bit I don't get. What software is there (yet?) which can
    exploit this capability?

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andy Burns@21:1/5 to Philip Herlihy on Sun Jun 22 14:15:34 2025
    Philip Herlihy wrote:

    That's the bit I don't get. What software is there (yet?) which can
    exploit this capability?

    A lot of the AI models that hit the headlines seemingly several times
    per week, can be downloaded and run locally with something like "Ollama" provided you've got tons of memory and GPU/NPU

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Theo@21:1/5 to Philip Herlihy on Sun Jun 22 15:51:10 2025
    Philip Herlihy <[email protected]> wrote:
    In article <1038us1$hnq6$[email protected]>, [email protected] says...

    It really depends if you want to run AI - like Microsoft's CoPilot - >through dedicated desktop applications or as part of other tools (i.e. >photoshop).



    That's the bit I don't get. What software is there (yet?) which can
    exploit this capability?

    A lot of 'AI' software doesn't use the NPU: https://www.youtube.com/watch?v=9Y5FzHUxvS8

    - it either uses the cloud or the GPU instead.
    CoPilot is cloud-based, Photoshop is GPU based.

    There are several challenges with using the NPU - one is that there's enough memory of the right kind for the model to run on-device. Another is that
    the model is set up to run on the NPU - possible that it doesn't fit for
    some reason and runs better on the GPU.

    Not to say an NPU might not be useful in future, but it's a bit like saying
    20 years ago 'I'll buy a GPU to make my graphics go faster' - yes it might,
    but only if apps are programmed in the right way - back then many apps
    weren't, and still some apps aren't suitable for the GPU.

    So I think at the moment you need to identify the apps you want to run and
    dig into what kind of hardware will run them. Don't assume that the NPU is automatically going to help. It may be preferable to get a better GPU
    instead.

    Theo

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Mon Jun 23 12:35:39 2025
    In article <1038us1$hnq6$[email protected]>, [email protected] says...

    https://www.tomshardware.com/laptops/what-is-an-ai-pc


    Very useful article - thanks!

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Thu Jun 26 12:37:34 2025
    In article <[email protected]>, [email protected] says...

    Although my 12?-year-old Dell PC is still performing beautifully, I
    figure I'll need to replace it around October. (Yes, I'll likely
    convert it to Linux, but I do want a Windows PC.)

    I'm seeing a lot of references to "AI" PCs and laptops, which I
    understand have a Neural Processing Module (NPM). I'm finding it hard
    to get these in perspective. I do increasingly use AI - Perplexity
    generated a spectacularly good Formal Objection to a complex planning >application recently, and it's a good place to start with a lot of
    queries. But I'm happy with the online services (good fibre broadband!)
    so I'm not clear what benefit an NPM would bring me. Of course I'm an
    end user rather than a developer (mostly) these days, but I sense I'm
    just scratching the surface.

    However, I don't want to be left out! I had been thinking based on my
    great experience with my current Dell that I'd buy a beefy workstation
    for the next ten years or so, but this technology is just emerging, and
    I'm rather wondering why NPMs seem to be built-in rather than
    replaceable. I'd certainly like decent graphics capability - I
    sometimes use a 3D CAD package which renders architectural and interior >designs, though I don't need "real-time" rendering (happy to wait while
    it chugs away).

    Any thoughts, wise friends?

    Thanks to everyone for interesting and informative observations.

    Just spotted this article, which does seem to address directly why you
    might (?) want an NPM on your fancy new PC: https://www.thurrott.com/windows/windows-11/322465/the-settings-agent- in-windows-11-has-its-own-ai-model

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andy Burns@21:1/5 to Philip Herlihy on Thu Jun 26 14:20:54 2025
    Philip Herlihy wrote:

    Just spotted this article, which does seem to address directly why you
    might (?) want an NPM on your fancy new PC: <https://www.thurrott.com/windows/windows-11/322465/the-settings-agent-in-windows-11-has-its-own-ai-model>
    Yeah great, wouldn't want "SystemSettings.exe" chewing up zillions of
    CPU cycles just to tell me what settings I should change ...

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Daniel James@21:1/5 to Philip Herlihy on Sat Jul 5 16:34:16 2025
    On 22/06/2025 12:32, Philip Herlihy wrote:
    I'm seeing a lot of references to "AI" PCs and laptops, which I
    understand have a Neural Processing Module (NPM). I'm finding it hard
    to get these in perspective.

    Back from my holidays and coming to this thread a little late ...

    I'm not an expert ... but I see two ways that people are using AI, at
    present.

    One is very much cloud-based, and requires little or no assistance from
    the client. I'm thinking of things like ChatGPT and its brethren, which
    work using huge models that live on the server. NP capability on the
    server is presumably useful, but not on the client.

    The other is in applications that use pre-built models that can be run
    locally. Things like recognition of human forms in images use this, and
    they can run much better with GPGPU computing of an NPM.

    Aside: I have a cheap (~£20) security camera that has an "AI chip" built
    in to help it recognize people, vehicles, pets, etc., as well as just
    motion in the images from the sensor. It seems to work quite well, but I
    don't know how much the so-called AI chip actually helps, though the CPU
    in a £20 camera can't be all that powerful (and presumably the models
    for recognizing bipedal and quadripedal forms are easy to bake into a
    chip, and simple enough that it doesn't take an expensive chip to run them).

    Chip-makers are jumping on the AI bandwagon by adding some NP capability
    to CPUs, but only to quite high-end CPUs, as yet. It's not clear to me
    whether this is supposed to be helpful with server-side AI applications
    like ChatGPT or with client side applications. In either case there
    won't be any benefit unless there is software support. Ideally you'd
    hope that a standard API for using NP facilities would emerge that would
    work with any NP hardware, but the lack of any such API for GPGPU
    computing (CUDA is nVidia-only) even now suggests that this may be some
    time coming.

    So ... I wouldn't worry at this stage about not having AI capability in
    a new PC. The playing field is still moving around and we can't see
    where the goalposts are going to be, yet. The NP capabilities of a CPU
    you buy today may not be relevant in a year or two's time.

    In any case, there are NPMs (e.g. Hailo) that can attach to PCIe
    (especially M.2) buses. Get one of those, if you're interested enough to
    want to play, and you'll be able to replace it with something else when
    it becomes obsolete without ditching the entire CPU. If PCs with NPMs
    become a thing you can add one of those.

    I think the exciting side of AI will be server-side for some time, yet,
    though - especially as the AI companies want you to upload your data to
    help improve their models (but perhaps I'm a little cynical).

    --
    Cheers,
    Daniel.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Mon Jul 7 14:02:59 2025
    In article <104bgfe$1ilqt$[email protected]>, [email protected]d says...


    In any case, there are NPMs (e.g. Hailo) that can attach to PCIe
    (especially M.2) buses. Get one of those, if you're interested enough to
    want to play, and you'll be able to replace it with something else when
    it becomes obsolete without ditching the entire CPU. If PCs with NPMs
    become a thing you can add one of those.

    I think the exciting side of AI will be server-side for some time, yet,



    That confirms my growing view that the ideal in such an early
    development market (especially for someone whose last PC lasted 11 years
    so far!) that replaceable components would be the way to go. I suspect
    that a "soldered-on" NPU would be obsolete before any software arrived
    that might use it!

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Andy Burns@21:1/5 to Philip Herlihy on Mon Jul 7 14:14:53 2025
    Philip Herlihy wrote:

    Daniel James says...

    there are NPMs (e.g. Hailo) that can attach to PCIe
    (especially M.2) buses.

    Is this the sort of thing you mean? https://www.amazon.co.uk/Waveshare-Hailo-8-Accelerator-Raspberry- Compatible/dp/B0D9298XL5/
    Win12 might need one of these to run the new AI Paint or Notepad

    <https://etontechnology.com/products/900-21001-0020-000-nvidia-a100-80gb-pcie-gpu-accelerator-new>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Mon Jul 7 14:07:39 2025
    In article <104bgfe$1ilqt$[email protected]>, [email protected]d says...


    I think the exciting side of AI will be server-side for some time,


    Wondering if Anti-virus tools might be early adopters of NPM
    availability?

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Mon Jul 7 14:06:25 2025
    In article <104bgfe$1ilqt$[email protected]>, [email protected]d says...


    there are NPMs (e.g. Hailo) that can attach to PCIe
    (especially M.2) buses.


    Is this the sort of thing you mean? https://www.amazon.co.uk/Waveshare-Hailo-8-Accelerator-Raspberry- Compatible/dp/B0D9298XL5/

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Daniel James@21:1/5 to Philip Herlihy on Mon Jul 7 19:22:27 2025
    On 07/07/2025 14:07, Philip Herlihy wrote:
    I think the exciting side of AI will be server-side for some time,

    Wondering if Anti-virus tools might be early adopters of NPM
    availability?

    That would certainly be a useful thing for them to do ... but I think
    the days of AV tools simply matching virus-typical patterns of opcodes
    in executable binaries are long gone. These days viral code is encrypted
    and hidden inside the image to make it harder for these tools to work.

    It may be that AI can be useful here, but I don't know that todays
    simple parallel pattern-matchers can quite cut the mustard.

    --
    Cheers,
    Daniel.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Daniel James@21:1/5 to Philip Herlihy on Mon Jul 7 19:19:49 2025
    On 07/07/2025 14:06, Philip Herlihy wrote:
    Is this the sort of thing you mean?
    [snipped Amazon link]

    Well, maybe ... that link doesn't work for me (possibly because it's line-wrapped) ... but definitely this sort of thing:

    https://hailo.ai/products/ai-accelerators/hailo-8l-m-2-ai-acceleration-module-for-ai-light-applications/

    or, indeed, one of its bigger brethren with the Hail0-8 (rather than 8L)
    chip, which has twice the processor elements (and about twice the cost).

    Note that the Hailo NPMs are apparently optimized for edge (i.e. on the
    client) video applications, which is perhaps why all the examples are
    for such useful applications as recognizing bananas in video streams!

    --
    Cheers,
    Daniel.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)