I just added support for websocket sources which enables sql over the Bluesky firehouse.
https://github.com/turbolytics/sql-flow?tab=readme-ov-file#c...
Duckdb does all the sql execution, and python is responsible for sourcing the data.
The project is still quite young and I’m very much still experimenting, but I’d love any feedback. Thank you.
When I have small data that fits on my laptop, Pandas is good enough.
Maybe 10% of the time I have stuff that's annoyingly slow to run with Pandas; then I might choose a different library, but needing this is rare. Even then, of that 10% you can solve 9% of that by dropping down to numpy and picking a better algorithm...