yonl (u/yonl) - Readit News

yonl commented on Launch HN: Hyprnote (YC S25) – An open-source AI meeting notetaker · Posted by u/yujonglee

yonl · a month ago

Congrats on the launch. I never understood why an AI meeting notetaker needed sota LLMs and subscriptions (talking about literally all the other notetakers) - thanks for making it local first. I use a locally patched up whisperx + qwen3:1.7 + nomic embed (ofcourse with a swift script that picks up the audio buffer from microphone) and it works just fine. Rarely i create next steps / sop from the transcript - i use gemini 2.5 and export it as pdf. I’ll give Hyprnote a try soon.

I hope, since it’s opensource, you are thinking about exposing api / hooks for downstream tasks.

yonl commented on Fetch-MCP: Playwright-Based MCP Server with Batch URL Fetching Support github.com/jae-jae/fetch-... · Posted by u/Sulfide6416

tomjen3 · 6 months ago

Cool, but playwright doesn’t use your cookies.

Increasingly I want to stop spending time on twitter, but it’s also where the AI news drops first - and I can’t just scrape the data without being logged in.

If there was a way to have the ai go ahead and gather the data for me, that would be great.

yonl · 6 months ago

I would agree to this point as well.

Speaking of implementation, i don’t mind if a browser extension forward cookies from my browser to the automation (privacy and security is an issue of course, and i’d ideally want the cookies to not leave my device, but personally i’m okay with some trade off).

yonl commented on Building AI agents to query your databases blog.dust.tt/spreadsheets... · Posted by u/vortex_ape

mritchie712 · 6 months ago

We (https://www.definite.app/) solved this a bit differently.

We spin up a data lake and pipelines (we support 500+ integrations / connectors) to populate the data lake for you then put DuckDB on top as a single query engine to access all your data.

yonl · 6 months ago

This is really interesting. At my previous company, I built a data lakehouse for operational reporting with recency prioritization (query only recent data, archive the rest). While there was no LLM integration when I left, I've learned from former colleagues that they've since added a lightweight LLM layer on top (though I suspect Dustt's implementation is more comprehensive).

Our main requirement was querying recent operational data across daily/weekly/monthly/quarterly timeframes. The data sources included OLTP binlogs, OLAP views, SFDC, and about 15 other marketing platforms. We implemented a datalake with our own query and archival layers. This approach worked well for queries like "conversion rate per channel this quarter" where we needed broad data coverage (all 17 integrations) but manageable depth (reasonable row scanned).

This architecture also enabled quick solutions for additional use cases, like on-the-fly SFDC data enrichment that our analytics team could handle independently. Later, I learned the team integrated LLMs as they began dumping OLAP views inside the datalake for different query types, and eventually replaced our original query layer with DuckDB.

I believe approaches like these (what I had done as in house solution and what definite may be doing more extensively) are data and query-pattern focused first. While it might initially seem like overkill, this approach can withstand organizational complexity challenges - with LLMs serving primarily as an interpretation layer. From skimming the Dustt blog, their approach is refreshing, though it seems their product was built primarily for LLM integration rather than focusing first on data management and scale. They likely have internal mechanisms to handle various use cases that weren't detailed in the blog.

yonl commented on Helix: a post-modern modal text editor helix-editor.com/... · Posted by u/bpierre

msoad · 4 years ago

I never really felt faster eliminating mouse from my coding workflow. Point and click to navigate things is pretty powerful. Why some devs try to avoid the mouse?

yonl · 4 years ago

Point and click to navigate is pretty slow compared to keyboard driven approaches. Realized after switching to vim. Probably because, after certain point, it just the reflex which drives vim.

yonl commented on Google Docs will now use canvas based rendering workspaceupdates.googlebl... · Posted by u/lewisjoe

skyde · 4 years ago

Can someone explain why https://makepad.dev/ is extremely slow and "unusable" on Microsoft Edge browser but run smoothly on Chrome? Is it because of bad WebGL perf on JavasSript perf in general?

yonl · 4 years ago

For me, it's not very fast in chrome.

yonl commented on Ask HN: How viable is this idea? · Posted by u/lifeplusplus

yonl · 5 years ago

In our org we have video tutorial for env setup, how to use tools etc. Used to use native video recorder + notion. Now we use https://www.letsflyby.com/. It's a general purpose recording and annotation tool (which helps bookmarking a section of the video of the codebase).

yonl commented on Show HN: I built a request payload validator for JavaScript based server github.com/adotg/helson/b... · Posted by u/yonl

yonl · 5 years ago

The issue I faced writing JS based Server is the amount of validation that goes behind request payload. Validation includes

- if the key is present

- if the value of the key is in given range (any filter function you can imagine)

- conditional key presence (Ex: if key1 is present key2 should be less than 10)

- complex nested structure of the payload

- array validation (length / type / property)

I end up write a significant amount of if else block just to validate if the payload is acceptable. So I wrote a tool a long time back which I use in all my JS based server project (My gateway servers are always in Node for past 3-4 years because of the amount of instrumentation / ease of debugging)

I wrote helson 2 years back, where a dev would define schema of a payload

``` typedef Payload { str "url": pass, []str "tags": pass, bool "isCollection": pass, obj "content": { str "body": strShouldNotBeEmpty | shouldBeOfMinimumLength 15 | shouldHaveMinimumWordLength 5, }, } ```

with primitive type support of str, bool, number and compound type support array, enum, ref (https://github.com/adotg/helson/blob/develop/test/helson.tes...) for type checking and custom function (strShouldNotBeEmpty, shouldBeOfMinimumLength above example) as value checking

And it tests against incoming payload

``` { url: '#/e/hash-of-a-link', tags: ['a1', 'a2', 'a3'], content: { body: 'This is a body', } ```

Now that I use typescript, I was in the verge of deciding (I don't have enough time solving just for myself as the value addition is not justified for myself, but if enough people wants it I'll build it) should I also build a typescript supported workflow. Like from a schema like above generate interfaces (and vice versa). Or is this lib meaningful to you at all.

It also throws meaningful error automatically (which you can override) to return to client directly