nattaylor (u/nattaylor)

nattaylor commented on AP to end its weekly book reviews dankennedy.net/2025/08/08... · Posted by u/thm

smelendez · 16 days ago

It's surprising to me that AP customer newspapers don't want book reviews to pad out their Sunday papers. Bookstores are opening more than closing in the US, and people love library apps like Libby, so you'd think they'd want reviews too. But I guess it's possible people are getting as many book recommendations as they can use from social media and TikTok and aren't interested in more detailed reviews.

It doesn't surprise me that people aren't seeking out book reviews on the AP website or app—I don't think AP is particularly associated with reviews, maybe deliberately because they've historically been read in local papers that don't emphasize the AP sourcing, so people wanting reviews from a national source probably go to NYT, WaPo, WSJ, the New Yorker, etc. first.

nattaylor · 16 days ago

My read is that no customers will leave since they are much more interested in news coverage -- and this helps the AP focus more on news.

This is a tangent, but I wonder if they feel that they are just creating LLM training data and that few readers (even of Sunday papers) will actually read their reviews.

nattaylor commented on Nanonets-OCR-s – OCR model that transforms documents into structured markdown huggingface.co/nanonets/N... · Posted by u/PixelPanda

gibsonf1 · 2 months ago

Does it hallucinate with the LLM being used?

nattaylor · 2 months ago

The base model is Qwen2.5-VL-3B and the announcement says a limitation is "Model can suffer from hallucination"

nattaylor commented on PostgreSQL Full-Text Search: Fast When Done Right (Debunking the Slow Myth) blog.vectorchord.ai/postg... · Posted by u/VoVAllen

nattaylor · 5 months ago

I wish there were some explain plans in either post, since I don't get what's going on.

If the query uses the index, then the on the fly tsvector rechecks are only on the matches and the benchmark queries have LIMIT 10, so few rechecks right?

Edit: yes but the query predicates have conditions on 2 gin indexes, so I guess the planner chooses to recheck all the matches for one index first even though it could avoid lots of work by rechecking row-wise

nattaylor commented on The Llama 4 herd ai.meta.com/blog/llama-4-... · Posted by u/georgehill

nattaylor · 5 months ago

Is pre-training in FP8 new?

Also, 10M input token context is insane!

EDIT: https://huggingface.co/meta-llama/Llama-3.1-405B is BF16 so yes, it seems training in FP8 is new.

nattaylor commented on Preview: Amazon S3 Tables and Lakehouse in DuckDB duckdb.org/2025/03/14/pre... · Posted by u/hn1986

yodon · 5 months ago

Can someone Eli5 the difference between AWS S3 Tables and AWS SimpleDB?

nattaylor · 5 months ago

S3 Tables is designed for storing and optimizing tabular data in S3 using Apache Iceberg, offering features like automatic optimization and fast query performance. SimpleDB is a NoSQL database service focused on providing simple indexing and querying capabilities without requiring a schema.

nattaylor commented on Show HN: In-Browser Graph RAG with Kuzu-WASM and WebLLM blog.kuzudb.com/post/kuzu... · Posted by u/sdht0

nattaylor · 6 months ago

This is very cool. Kuzu has a ton of great blog content on all the ways they make Kuzu light and fast. WebLMM (or in the future chrome.ai.* etc) + embedded graph could make for some great UXes

At one time I thought I read that there was a project to embed Kuzu into DuckDB, but bringing a vector store natively into kuzu sounds even better.

nattaylor commented on · Posted by u/rabidonrails

nattaylor · 6 months ago

500 errors for me

nattaylor commented on Web Origami, for making websites where you can understand how they’re made weborigami.org... · Posted by u/ayoreis

groby_b · 8 months ago

Because some people care about low-bandwith users. Or about not being wasteful as a principle.

nattaylor · 8 months ago

Doesn't compression make any minification gains negligible?

nattaylor commented on TinyJS – Shorten JavaScript QuerySelect with $ and $$ github.com/victorqribeiro... · Posted by u/synergy20

akira2501 · a year ago

The id attribute can take on values that are already present or reserved in window. "fetch", "opener", etc..

The reason to have a separate system that correctly calls getElementById() is to avoid this issue.

So, it's actually a _less_ brittle mechanism that doesn't rely on legacy mechanisms and lacks the surprises that come with that.

nattaylor · a year ago

Sorry, I didn't mean to suggest your solution was brittle -- I actually quite like it and want to adopt it!

But I do think the legacy browser behavior with the ID attribute as window properties is very brittle for the reasons you suggest

nattaylor commented on TinyJS – Shorten JavaScript QuerySelect with $ and $$ github.com/victorqribeiro... · Posted by u/synergy20

akira2501 · a year ago

I've been using this for years:

    const $id = new Proxy({}, {
        // get element from cache, or from DOM
        get: (tgt, k, r) => (tgt[k] || ((r = document.getElementById(k)) && (tgt[k] = r))),
        
        // prevent programming errors
        set: () => $throw(`Attempt to overwrite id cache key!`)
    });

It's nice to be able to refer to elements by property name, so:

    <div id="thing"></div>

Is reachable with:

    $id.thing

And, since the underlying structure is just an object, you can still enumerate it with Object.keys, which can sometimes be a useful debugging aid and general catalog of accessed elements.

Anyways.. Proxy is a wildly underappreciated and used class in JavaScript.

nattaylor · a year ago

If you like brittle things, the id attribute is already made into an attribute on the window for legacy reasons

Edit: My tone may have indicated that parent's solution was brittle. It's not!

u/nattaylor

KarmaCake day582November 24, 2013

About

Product Manager. nattaylor at gmail

View Original