Protobuf-ES: Protocol Buffers TypeScript/JavaScript runtime

I gave up on protobufs years ago. The protobuf team has no idea how to write PHP and JS libraries. I got segfaults from using the PHP extension. The built-in toJSON would return invalid JSON (missing braces for binary types). Ridiculous stuff.

I really just prefer to use JSON for everything. It's much easier to debug and observe traffic (browser Network tab). I like JSON-RPC, very simple spec (basically one page long). I don't like REST.

All that said, I'm really glad to see the community take things into their own hands.

onion2k · 3 years ago

It's much easier to debug and observe traffic (browser Network tab).

The DX for JSON things is much better. The UX for protobufs is much better (faster, less data over the wire, etc). Which you optimize for is up to you, but there isn't a straightforward "Use this tech because it's the best one."

lucideer · 3 years ago

> faster, less data over the wire, etc.

I've always wondered about this. Firstly, I'm fairly sure clientside JSON parsing is significantly faster than protobuf decoding but even data over the wire: JSON can be pretty compressible so surely the gains here are going to be marginal. Surely never enough benefits to UX to warrant the DX trade off, right?

izacus · 3 years ago

protobufs have a great property of having a schema (and then generating code). Which means that it's pretty easy to setup a system where accidental change of API fails CI tests for mobile apps and web.

This is doable with JSON, but I've never seen a JSON based setup actually work well at catching these kind of regressions.

ZiiS · 3 years ago

Assuming your developer time is contained improved DX often also leads to better UX (more features). So even if you are optimizing for UX you may well be better with JSON.

nlnn · 3 years ago

I don't develop in JS so can't comment on DX there, but I've found the DX to be pretty good when using protobuf in other languages.

That's mostly been down to having IDE autocompletion for data structures and fields once the protobuf code's been generated.

For many JSON APIs I've worked with there's only been human readable documentation, making them more error prone to work with (e.g. having to either craft JSON manually for requests, or writing a client library if one doesn't already exist).

mike_hock · 3 years ago

There's also msgpack. Best of both worlds.

halfmatthalfcat · 3 years ago

So does that make GraphQL the best then? JSON + faster/less data over the wire.

asim · 3 years ago

I think protobuf really works well on the backend and specifically with compiled languages like Go or C++ as per seen by the usage at Google and adoption of gRPC for Go based cloud tooling. Beyond that it's a huge failure. The generated code and usage for other languages is not idiomatic. In fact it's a hindrance and you can see that by the lack of adoption except by the largest orgs who are enforcing it using some sort of grpc-web bridge with types for the frontend. Ultimately you can just convert proto to OpenApi specs and do a much better job at custom client libs with that.

I'm not a frontend dev. Most of my time was spent on the backend but what I'll say is I much prefer the fluidity and dynamic nature of JavaScript and the built in ability to deal with JSON that naturally become objects. All the type stuff is easy to do but with docs you can get away with not needing it.

My feeling. Protobuf lives on for gRPC server side stuff but for everywhere else OpenApi is winning.

bufbuild · 3 years ago

It's worth checking out our take on a lot of these problems: https://buf.build/blog/connect-web-protobuf-grpc-in-the-brow...

fsaintjacques · 3 years ago

JSON parsing is a minefield, especially in cross-platforms scenarios (language and/or library). You won't encounter those problems on toy project or simple CRUD applications. For example, as soon as you deal with (u)int64 where values are greater than 2^53, a simple round-trip to javascript can wreak silent havoc.

See http://seriot.ch/projects/parsing_json.html

Protobuf support for google's first-class citizen languages is usually very good, i.e. C++, Java, Python and Go. For other languages, it depends on each implementation.

RedShift1 · 3 years ago

Though you're not wrong, in what common cases are integers larger than 2^53 required?

arein3 · 3 years ago

Nice article

capableweb · 3 years ago

As always, each protocol/data format has it's place. You need to maximize the amount of data you send in each packet? Then protobuf is better than JSON. Need to support large amount of clients without any fuzz? Then JSON is better. Wanna pass around data you don't know the schema of? JSON again.

Contexts matters, there is no silver bullets, everything has trade offs and so on, and so on.

speedgoose · 3 years ago

JSON messages in a compressed websocket stream are surprisingly tiny. Bigger than compressed protobuf packets but not by much, and much smaller than uncompressed protobuf packets.

ninepoints · 3 years ago

Honestly, gzipped json is likely much smaller than uncompressed protobuf.

If you were going to use a binary protocol, why choose one that has no partial parsing/toc these days. There are much better alternatives IMO (flatbuffers being one of them)

maccard · 3 years ago

> Wanna pass around data you don't know the schema of? JSON again.

This is a false flag. If you don't know the schema on the receiving (or sending, for that matter) side, then you can't do anything with the data, other than pass it on. If you _do_ know what it looks like, then it has an implicit schema whether you call it a schema or not.

francislavoie · 3 years ago

At the time, we needed interop with C. So that's why we chose protobufs. But it was a nightmare to work with in other languages. Including C++ for cross platform desktop apps where cross compiling became a problem too.

JSON in C is unfortunately way harder than in other modern languages (e.g. Go which makes it a breeze with struct tags and a great stdlib).

depr · 3 years ago

Surely the technical requirements of my specific use case are applicable to any use case.

fuzzy2 · 3 years ago

The problem I see with JSON is its limited set of “native” types. I really wish it had specified support for proper numeric types (int, uint, various widths) and not just doubles. A timestamp type would be great as well.

What I really like about Protocol Buffers is that you must write a schema to get started. No more JSON.stringify anything. Everything else sucks though.

robertlagrant · 3 years ago

I think we could remove about a quarter of all Javascript programming time if JSON had a native Date type.

haberman · 3 years ago

Hi there, I am the primary maintainer of the PHP library as of the last few years. I have heard that there used to be a lot of crashes; the code was almost completely rewritten in 2020 and is in a much better state now. If you find a segfault and you have a repro, file a bug and we will fix it.

bitwize · 3 years ago

I recommend Capnproto. Parsing time is zero, you can pretend you're a Microsoft programmer in the early 90s and just use the in-RAM struct as your wire format. Maybe it doesn't make sense for in-browser JS applications (though WASM is a different story) but for IPC and RPC in the general case, all parsing and unparsing does is generate waste heat.

ALWAYS favor a binary format unless you have a really good reason otherwise.

kccqzy · 3 years ago

Capnproto is designed by Kenton, a former Google engineer who did a lot of work with protobufs at Google. I see Capnproto as the spiritual successor of protobuf, fixing many issues in protobufs.

Also, Capnproto is quite extensively used in some Cloudflare products.

sa46 · 3 years ago

I like protobufs but I was also disappointed at the JS protobuf options. I disliked both the JS object representation and RPC transport.

grpc-web in particular requires an Envoy proxy which seems absurdly heavyweight. I ended up using Twirp because Buf connect wasn't yet released or planned.

I rolled my own JS representation. The major differences from Connect:

- Avoid undefined if the message is not present on the wire and use an empty instance of the object instead. For recursive types, find the minimal set of fields to initialize as undefined instead of empty.

- Transparently promote some protobuf types, like google.protobuf.Timestamp to a proper Instant type (from js-joda or similar library). This makes a surprisingly large difference on reducing the number of jumps from the UI to the API.

tough · 3 years ago

What about tRPC?

francislavoie · 3 years ago

I would use tRPC if I used TypeScript in the backend. But I use PHP, so it's not viable.

artursapek · 3 years ago

your problem is that you're using PHP

francislavoie · 3 years ago

Bad take. Modern PHP is great.

Why should usual developers use protobuf instead of json? You are just making your life harder

If using compression the size is in the same ballpark (protobuf can be between 20% and 50% smaller). For 99% of users it should not make a difference. https://nilsmagnus.github.io/post/proto-json-sizes/#gzipped-...

Thaxll · 3 years ago

JSON/REST does not declare its schema, it's like talking about type vs dynamic typed language.

arein3 · 3 years ago

A subjective opinion, but it's much easier to read some documentation and checking maybe an OpenAPI spec than having to deal with protobuf.

You also have solutions like GraphQL that define a schema, or you can publish some kind of schema (a good thing to do) but use JSON instead of a binary format.

morelisp · 3 years ago

Protobuf also does not declare its schema. Message parsers can be generated from a schema, but that's also true for REST over JSON. Even ad hoc REST APIs often have better self-declaration of resource types than protobuf.

(I still like protobuf, but the schemas are a terrible reason to like it.)

arriu · 3 years ago

It has everything to do with automatic validation on both sides and little to do with the transfer size.

MrJohz · 3 years ago

But you can do automatic validation fairly easily with JSON Schema. You don't need to choose a binary format to get validation.

The principle benefit is that you can use the schema to define the data format, which means you can pack the data in more tightly (you don't need a byte to say "this is an object" if you know that the input data must be an object at this point). That's a big benefit in certain situations, but if you're using this sort of stuff just to get validation then you're probably better off using JSON Schema and having a wire transfer format that you can read easily without additional tools.

arein3 · 3 years ago

Introducing a binary format for payload validation is like shooting yourself in the foot because you have an itch.

onion2k · 3 years ago

For 99% of users it should not make a difference.

The link you included shows that protobufs are at least 15% better for all users, and as much as 57% better for cases where the data is small. Doesn't that mean for 100% of users it will actually make a difference?

Your users might not care about the difference but it will be there.

arein3 · 3 years ago

Usually when visiting a website, saving a few kilobytes on the client side on requests to backend does not make any difference.

jameshart · 3 years ago

A feature that never ships has value for 0% of users.

Actually realizing that speed up for your users will take time away from delivering features.

Engineering is a trade off, always will be.

marcosdumay · 3 years ago

> 57% better for cases where the data is small

You don't optimize things for the cases when they are fast. (Unless the gain is a couple of orders of magnitude; certainly not for a 50% speedup.)

The 15% gain is the one that matters. On practice, it comes at the expense of a more complex (thus larger, negating some of it) and less reliable system. It is very rare that this trade-off is worth it.

jtolmar · 3 years ago

You'd also have to compare this against the download size of the protobuf library itself.

Deleted Comment

endtime · 3 years ago

protobuf is much more concise and readable than OAS. You can define API contracts in protobuf and still serve JSON APIs via the standard-ish gRPC/JSON transcoding enabled by google.api annotations.

dboreham · 3 years ago

To talk to a server that doesn't speak json.

lolinder · 3 years ago

This only makes sense if you have a server that someone else put together that for some reason only speaks protobuf. I'm not aware of any language ecosystem that has protocol buffers but no json support, so if you're building a server from scratch this isn't a good reason to use protobufs.

And if you are faced with a server that only speaks protobuf, the same question applies to the original devs: why did they make that decision?

arein3 · 3 years ago

For non-niche use cases that is a bad developer experience.

If you are designing your own solution that uses protobuf instead of JSON say goodbye to a range of useful tools that the whole industry uses. From testing to automation it will be harder at every step, and you will have to find custom solutions instead of usual no-customization solution that works OOTB with JSON.

It is a good way to frustrate your developers and generate sometimes brittle solutions related to testing/automation/infrastructure.

Deleted Comment

tekkk · 3 years ago

I am using it for sending data between game server and client. Encoding the messages in JSON would be just silly, although I wonder what is the standard in the game industry.

depr · 3 years ago

Protocol buffers are used in Dark Souls 3, Pokemon GO, Hearthstone and I'm sure many other games.

cdelsolar · 3 years ago

we use it at https://woogles.io for pretty much all communication (server-to-server and client-to-server). I do loathe dealing with the JS aspect of it and am very excited to move over to protocol-es after reading this article (and shaving off a ton of repeated code and generated code).

arein3 · 3 years ago

Your case is one of those 1% if you have a real time game where a fraction of a second is important.

soylentgraham · 3 years ago

Large blocks of data. (Eg 10,000 floats)

Otherwise personally json wins

cdelsolar · 3 years ago

nothing to do with the size, but with having robust schemata.