Jason Evans <
[email protected]> writes:
Right now with INN, one of the storage options is to save articles
directly in /var/spool/news/articles. What if, in addition to saving
articles there, the articles were also saved in IPFS so they could be retrieved by anyone who wanted a copy.
I'm not sure I understand what benefit IPFS offers for Usenet.
Looking over their front page, my impression of the problems that IPFS is trying to solve is:
* Peer-to-peer distributed hash-addressable content archiving.
* Data versioning and unique identifiers via the hash.
* Tamper-resistence (again because everything is tied to a hash).
Usenet does not have the unique identifier problem; that's the message ID, which already has to be unique for Usenet to work. Usenet doesn't have a
data versioning problem; you're not supposed to ever modify an article
once it has been posted. And everything on Usenet retrieves messages by message ID or by newsgroup name, so the hash-addressable content storage
is kind of useless since no one has or cares about those hashes.
Meanwhile, you have a potential uniqueness problem for IPFS storage
because the Path, Xref, and some other headers vary by Usenet node, so you
need to apply some filter to the article before storing it in IPFS anyway
or you just break IPFS by storing multiple duplicates of every article.
The peer-to-peer thing is arguably interesting, except that this is
essentially what NNTP already does, with roughly equivalent efficiency, so mostly what you're gaining here is adding nodes that are IPFS nodes but
not Usenet nodes to the archive network. Which, okay, sure, and
admittedly setting up IPFS is probably a lot easier than setting up a news server, but then how do you *use* those nodes to retrieve articles in any useful way?
If we were designing Usenet today from scratch, there's a decent argument
to be made that message IDs should just be message hashes over a
canonicalized version of the message, which (mostly) avoids the problem of
how to generate good message IDs and also allows verification that the
message hasn't been modified (against which Usenet currently has no real protection except speed of propagation). And indeed there's nothing that
would stop Usenet software from starting to use hashes as message IDs now, although because it's not guaranteed, it would need a lot of adoption
before you would get any reasonable integrity protection benefits. But
the rest of IPFS feels like a generic implementation of functionality that Usenet already has a more specialized implementation of, but with the
wrong storage keys so you would have to store external metadata to have
any hope of being able to do meaningful message retrieval.
I agree that it would be great to have more independent Usenet archives,
but simplicity says the easiest way to do that would be to find some
volunteers to run ordinary Usenet servers with infinite retention.
--
Russ Allbery (
[email protected]) <
https://www.eyrie.org/~eagle/>
Please post questions rather than mailing me directly.
<
https://www.eyrie.org/~eagle/faqs/questions.html> explains why.
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)