Franck <
[email protected]d> writes:
What about :
- From
- Subject
Oh, sorry, yes, I knew I forgot something. Those should go into the
"readers are going to have a hard time understanding the message"
category of rejecting duplicates.
- Archived-At
- Bcc
- Cc
- Keywords
- Reply-To
- Sender
- To
Reply-To is probably in the From and Subject territory.
None of the rest are Usenet headers, so I think this is just a question of whether you want to generally impose a uniqueness rule on email headers
that happen to show up in Usenet articles. I think that's a reasonable
thing to do if you want, but you'll be dropping articles that will be
readable and not pose any practical problems.
(A Bcc header showing up in a Usenet post indicates that someone or
someone's software was very confused about how everything works.)
- Nntp-Posting-Date
- Nntp-Posting-Host
- X-Complaints-To
- X-Trace
These are all obsolete trace headers that should be replaced by
Injection-Info, but which are still in use in the wild. They're all informational headers that don't carry any protocol meaning, so this is a
bit like the email header case. Maybe a slightly stronger argument for uniqueness since they are relevant to abuse situations.
- Also-Control
I'm fairly sure that this is thoroughly obsolete and not honored by
anything, although maybe I would be surprised. If it is honored, you'd
want it to be unique, but really nothing should honor it.
- Article-Name
- Article-Updates
- Date-Received
- Posting-Version
- Relay-Version
- See-Also
I think these are all just random headers that nothing uses. (Some of
them, like Relay-Version, are just so incredibly obsolete that nothing
will use them any more.)
- Cancel-Key
- Cancel-Lock
Oh, good call, RFC 8315 says these MUST NOT appear more than once, so may
as well reject messages with more than one of them as well.
In some cases it's arguable that the article is still sensible if the
header is duplicated but both copies of the header have the same value
(the most common duplication error in my experience), but it's still
technically invalid to duplicate them.
Added to my to-list :-)
Note that for relaying you can't fix this (by dropping one of the
duplicates) since that's an unpermitted alteration of the article. You
have to either accept it or reject it.
On injection, it may not be a bad idea to toss duplicate header fields
that have the same content before rejecting articles that still have duplicates. It's just a bit friendlier to broken posting agents; whether
you want to be friendly to such things is a bit of a judgment call.
I do not manage filters with calls to perl or python but they can be set
for injection and/or relay (Some sort of postfilter and cleanfeed) and I
will use this (long) list to initialize a combo in the GUI part of the software.
I'm not sure I would. I don't think having a long list of headers that
you care about was the right design for INN's filters. It's just hard to
fix now.
I'd be more inclined to populate a dropdown with protocol headers that
people are likely to care about and then let people type in the names of additional headers if they care.
--
Russ Allbery (
[email protected]) <
https://www.eyrie.org/~eagle/>
Please post questions rather than mailing me directly.
<
https://www.eyrie.org/~eagle/faqs/questions.html> explains why.
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)