rch (u/rch) - Readit News

rch commented on Apache DataFusion datafusion.apache.org/... · Posted by u/thebuilderjr

krapht · a year ago

I feel like I'm not the target audience for this. When I have large data, then I directly write SQL queries and run them against the database. It's impossible to improve performance when you have to go out to the DB anyway; might as well have it run the query too. Certainly the server ops and db admins have loads more money to spend on making the DB fast compared with my anti-virus laden corporate laptop.

When I have small data that fits on my laptop, Pandas is good enough.

Maybe 10% of the time I have stuff that's annoyingly slow to run with Pandas; then I might choose a different library, but needing this is rare. Even then, of that 10% you can solve 9% of that by dropping down to numpy and picking a better algorithm...

rch · a year ago

Maybe your data is stored in a multi-PB pile of HDF5.

rch commented on Show HN: Execute SQL against Bluesky firehose github.com/turbolytics/sq... · Posted by u/dm03514

dm03514 · a year ago

Hello, I’ve been working on a project that embeds duckdb for stream processing.

I just added support for websocket sources which enables sql over the Bluesky firehouse.

https://github.com/turbolytics/sql-flow?tab=readme-ov-file#c...

Duckdb does all the sql execution, and python is responsible for sourcing the data.

The project is still quite young and I’m very much still experimenting, but I’d love any feedback. Thank you.

rch · a year ago

How do you position this relative to Flink SQL?

rch commented on QwQ: Alibaba's O1-like reasoning LLM qwenlm.github.io/blog/qwq... · Posted by u/amrrs

m3kw9 · a year ago

I’m tried it and it keeps refusing to answer coding questions. It just says I cannot answer that.

rch · a year ago

Ensemble with coder-instruct

rch commented on Model Context Protocol anthropic.com/news/model-... · Posted by u/benocodes

rch · a year ago

Strange place for WS* to respawn.

rch commented on Llama-OCR: Document to Markdown llamaocr.com/... · Posted by u/lapnect

nutlope · a year ago

Hi all, I'm the author of llama-ocr. Thank you for sharing & for the kind comments! I built this earlier this week since I wanted a simple API to do OCR – it uses llama 3.2 vision (hosted on together.ai, where i work) to parse images into structured markdown. I also have it available as an npm package.

Planning to add a bunch of other features like the ability to parse PDFs, output a response in JSON, ect... If anyone has any questions, feel free to send them and I'll try to respond!

rch · a year ago

I've had trouble with pulling scientific content out of poster PDFs, mostly because e.g. nougat falls apart with different layouts.

Have you considered that usage yet?

rch commented on Ask HN: What type of Auth are you using on your side projects? · Posted by u/honksillet

gedy · a year ago

Auth0 and FusionAuth

rch · a year ago

+1 for FusionAuth

rch commented on Moshi: A speech-text foundation model for real time dialogue github.com/kyutai-labs/mo... · Posted by u/gkucsko

rch · a year ago

Do app running in an a-shell terminal on the iPad have a convenient way provide a tts interface?

rch commented on Breaking down a record-setting day on the Texas grid blog.gridstatus.io/a-reco... · Posted by u/kmax12

bdcravens · a year ago

After giving my electricity provider access to my EV for optimal charging pretty much killed the 12v battery (they were pinging it hundreds of times an hour, meaning it never went to sleep), I'm never going to give them access to anything.

rch · a year ago

Next day load shapes are predictable, so devices should optimize their charging accordingly.

rch commented on Google is a monopoly – the fix isn't obvious theregister.com/2024/08/1... · Posted by u/rntn

bearjaws · a year ago

If what comes after is not better, then we waited too long to break this monopoly up.

We either start ripping this band-aids off or we will just continually have a worse and worse internet.

rch · a year ago

Break them all up simultaneously or find a better approach.

My perception is that there are too many politicians trying to pick winners for their own benefit.

Posted by u/rch a year ago

Mixture of Nested Experts: Adaptive Processing of Visual Tokens arxiv.org/abs/2407.19985...

u/rch

KarmaCake day6317July 14, 2010

About

[ my public key: https://keybase.io/rch; my proof: https://keybase.io/rch/sigs/N4CAr2P1I5D742LNn3UgGKtObAuYd4gbImkG5fjfsxo ]

Buena Vista, CO

http://zndx.org (sometimes)

@zndx

View Original