The smallest quantized version (2bit) needs 20GB of RAM (which can be offloaded onto the VRAM of a decent 4090 GPU). The 4bit quantized versions are the largest models that can just about fit onto a 32GB system (29GB-31B). The 6bit (41GB) and 8bit (52GB) models need a 64GB system. You would need multiple GPUs with shared memory if you wanted to offload the higher precision models to VRAM.
I've experimented with the 7B and 13B models, but haven't experimented with these models yet, nor other larger models.
I have recently spent a fair bit of time experimenting with this on Fly for my application (https://www.ssrfproxy.com). It's hard to beat the straightforwardness of deploying in a single region, with the database in close proximity. This approach probably to meets the needs of what 99% of developers require. Aka Heroku.
Papers and references (page maintained by central academic in the world of CRDTs): https://crdt.tech
Group doing research into how they can be used to build interesting collaborative (and async) applications: https://www.inkandswitch.com
A few of the major open source implementations - mostly for rich text editing or JSON like data structures:
- Yjs: https://github.com/yjs/yjs
- Automerge: https://github.com/automerge/automerge
- Peritext: https://www.inkandswitch.com/peritext/
- Dimond types: https://github.com/josephg/diamond-types
People building eventually consistent database syncing with them:
- https://electric-sql.com (Postgres <-> SQLite)
- https://vlcn.io (SQLite <-> SQLite)
Open source colaborative servers (coordination, persistance, presence):
- https://github.com/ueberdosis/hocuspocus
If anyone has thoughts about this space, feature requests, would like a preview of what we are building, or anything else, please do reach out direcly to me at henry@firesync.live, I'm talking to as many people as possible at the moment.
Remote: Yes
Willing to relocate: no
Technologies: node.js, react, typescript, terraform, redis, k8, elixir, mongo, Postgres, docker. GCP/AWS, hashicorp stack, Linux, Nginx, haproxy, bash etc
Résumé/CV: Former founder of ShareLaTaX, successful exit from bootstrapping it. Now on First job hunt in 10+ years. Lots of experience in ops/dev ops, fast real time collaboration with big scale. Full stack developer at heart, with lots ops experience in recent years managing large scale cloud migrations and infrastructure. 12+ year’s experience.
Email: henry.oswald@gmail.com
Simple, readable task runner. It has replaced make and rake in a lot of use cases.