feat(users/Profpatsch/netencode): ignore earlier record entries r/3822

It turns out that the netencode spec requiring to ignore *later* entries meant that every parser has to do an extra check for each element, instead of just overriding the key in the hash map. This leads to a situation where the simple implementation is the wrong one, which would lead to very subtle problems in parsers (see also the infamous “json duplicate record entry” problem which has been used for various exploits in the past). To be fair, exploits are still possible, but at least a `Map.fromList` will be the right implementation (provided it folds from the left) now instead of the wrong one. Examples of the trivial implementation being now right: Python: > dict([("foo", 1), ("foo", 2)]) {'foo': 2} Rust: > println!("{:?}", HashMap::from([ ("foo", 1), ("foo", 2) ])); {"foo": 2} Haskell: > Data.Map.fromList [ ("foo", 1), ("foo", 2) ] fromList [("foo",2)] Change-Id: Ife9593956f4718e5e720f4f348c227e4f3a71e2d Reviewed-on: https://cl.tvl.fyi/c/depot/+/5108 Tested-by: BuildkiteCI Reviewed-by: Profpatsch <mail@profpatsch.de> Reviewed-by: sterni <sternenseemann@systemli.org> Autosubmit: Profpatsch <mail@profpatsch.de>
author: Profpatsch <mail@profpatsch.de> 2022-01-29T11·50+0100
committer: Profpatsch <mail@profpatsch.de> 2022-02-14T14·12+0000
commit: ed68ba675191bc3da57019b14609deb713247821 (patch)
tree: 222c6fadbfab3427688d89e01680061df83df2a2 /users/Profpatsch/netencode/README.md
parent: 82ba42c4396c35e8bf69535af311e9b9f0cffa17 (diff)
1 files changed, 5 insertions, 1 deletions
diff --git a/users/Profpatsch/netencode/README.md b/users/Profpatsch/netencode/README.md
index 67cb843a58..8dc39f6337 100644
--- a/users/Profpatsch/netencode/README.md
+++ b/users/Profpatsch/netencode/README.md
@@ -73,7 +73,11 @@ A tag (`<`) gives a value a name. The tag is UTF-8 encoded, starting with its le
 ### records (products/records), also maps
 
 A record (`{`) is a concatenation of tags (`<`). It needs to be closed with `}`.
-If tag names repeat the later ones should be ignored. Ordering does not matter.
+
+If tag names repeat the *earlier* ones should be ignored.
+Using the last tag corresponds with the way most languages handle converting a list of tuples to Maps, by using a for-loop and Map.insert without checking the contents first. Otherwise you’d have to revert the list first or remember which keys you already inserted.
+
+Ordering of tags in a record does not matter.
 
 Similar to text, records start with the length of their *whole encoded content*, in bytes. This makes it possible to treat their contents as opaque bytestrings.
author	Profpatsch <mail@profpatsch.de>	2022-01-29T11·50+0100
committer	Profpatsch <mail@profpatsch.de>	2022-02-14T14·12+0000
commit	ed68ba675191bc3da57019b14609deb713247821 (patch)
tree	222c6fadbfab3427688d89e01680061df83df2a2 /users/Profpatsch/netencode/README.md
parent	82ba42c4396c35e8bf69535af311e9b9f0cffa17 (diff)