Readit News logoReadit News
rch commented on Apache DataFusion   datafusion.apache.org/... · Posted by u/thebuilderjr
krapht · a year ago
I feel like I'm not the target audience for this. When I have large data, then I directly write SQL queries and run them against the database. It's impossible to improve performance when you have to go out to the DB anyway; might as well have it run the query too. Certainly the server ops and db admins have loads more money to spend on making the DB fast compared with my anti-virus laden corporate laptop.

When I have small data that fits on my laptop, Pandas is good enough.

Maybe 10% of the time I have stuff that's annoyingly slow to run with Pandas; then I might choose a different library, but needing this is rare. Even then, of that 10% you can solve 9% of that by dropping down to numpy and picking a better algorithm...

rch · a year ago
Maybe your data is stored in a multi-PB pile of HDF5.
rch commented on Show HN: Execute SQL against Bluesky firehose   github.com/turbolytics/sq... · Posted by u/dm03514
dm03514 · a year ago
Hello, I’ve been working on a project that embeds duckdb for stream processing.

I just added support for websocket sources which enables sql over the Bluesky firehouse.

https://github.com/turbolytics/sql-flow?tab=readme-ov-file#c...

Duckdb does all the sql execution, and python is responsible for sourcing the data.

The project is still quite young and I’m very much still experimenting, but I’d love any feedback. Thank you.

rch · a year ago
How do you position this relative to Flink SQL?
rch commented on QwQ: Alibaba's O1-like reasoning LLM   qwenlm.github.io/blog/qwq... · Posted by u/amrrs
m3kw9 · a year ago
I’m tried it and it keeps refusing to answer coding questions. It just says I cannot answer that.
rch · a year ago
Ensemble with coder-instruct
rch commented on Model Context Protocol   anthropic.com/news/model-... · Posted by u/benocodes
rch · a year ago
Strange place for WS* to respawn.
rch commented on Llama-OCR: Document to Markdown   llamaocr.com/... · Posted by u/lapnect
nutlope · a year ago
Hi all, I'm the author of llama-ocr. Thank you for sharing & for the kind comments! I built this earlier this week since I wanted a simple API to do OCR – it uses llama 3.2 vision (hosted on together.ai, where i work) to parse images into structured markdown. I also have it available as an npm package.

Planning to add a bunch of other features like the ability to parse PDFs, output a response in JSON, ect... If anyone has any questions, feel free to send them and I'll try to respond!

rch · a year ago
I've had trouble with pulling scientific content out of poster PDFs, mostly because e.g. nougat falls apart with different layouts.

Have you considered that usage yet?

rch commented on Ask HN: What type of Auth are you using on your side projects?    · Posted by u/honksillet
gedy · a year ago
Auth0 and FusionAuth
rch · a year ago
+1 for FusionAuth
rch commented on Moshi: A speech-text foundation model for real time dialogue   github.com/kyutai-labs/mo... · Posted by u/gkucsko
rch · a year ago
Do app running in an a-shell terminal on the iPad have a convenient way provide a tts interface?
rch commented on Breaking down a record-setting day on the Texas grid   blog.gridstatus.io/a-reco... · Posted by u/kmax12
bdcravens · a year ago
After giving my electricity provider access to my EV for optimal charging pretty much killed the 12v battery (they were pinging it hundreds of times an hour, meaning it never went to sleep), I'm never going to give them access to anything.
rch · a year ago
Next day load shapes are predictable, so devices should optimize their charging accordingly.
rch commented on Google is a monopoly – the fix isn't obvious   theregister.com/2024/08/1... · Posted by u/rntn
bearjaws · a year ago
If what comes after is not better, then we waited too long to break this monopoly up.

We either start ripping this band-aids off or we will just continually have a worse and worse internet.

rch · a year ago
Break them all up simultaneously or find a better approach.

My perception is that there are too many politicians trying to pick winners for their own benefit.

u/rch

KarmaCake day6317July 14, 2010
About
[ my public key: https://keybase.io/rch; my proof: https://keybase.io/rch/sigs/N4CAr2P1I5D742LNn3UgGKtObAuYd4gbImkG5fjfsxo ]

Buena Vista, CO

http://zndx.org (sometimes)

@zndx

View Original