lysecret (u/lysecret)

lysecret commented on LLM Year in Review karpathy.bearblog.dev/yea... · Posted by u/swyx

lysecret · 5 days ago

It’s funny how every podcaster/public ai figure is so certain text as a Ui will go away and it’s not going anywhere.

lysecret commented on Mistral OCR 3 mistral.ai/news/mistral-o... · Posted by u/pember

loaf_api · 6 days ago

I've just finished processing thousands of documents using the Gemini Pro 3 vision model and it outperformed every OCR and image model I've tested by a long shot, perfect markdown with latex for the math every time.

lysecret · 5 days ago

3 flash is also insanely good even slightly outperforms 3 pro for me.

lysecret commented on Announcing the Beta release of ty astral.sh/blog/ty... · Posted by u/gavide

lysecret · 8 days ago

Super excited about this generally ok satisfied with pyright but so I was with conda before uv or black before ruff.

lysecret commented on I tried Gleam for Advent of Code blog.tymscar.com/posts/gl... · Posted by u/tymscar

socalgal2 · 11 days ago

The `echo` part seemed interesting but it made me think, debuggers need this as a built in feature. If I have

    array
      .slice(0, 10)
      .filter(s => s[0].toLowerCase() < 'm') // a<->l
      .map(s => s.toUpperCase());

It seems like it should be a common feature to be able to view between each array operation in a debugger without having to augment the code with `echo`

The out of bounds handling didn't seem all that good to me. Sure you can filter out undefined. You could also just make a function that returns an empty array if out of bounds, or array of 1 element if not.

     // JS
     function getElemFromGrid(grid, x, y) {
       return (x < 0 || x >= grid.width ||
               y < 0 || y >= grid.height)
         ? []
         : [grid.elems[y][x]]
     }
     
     ...
     
     neighbors = [
       ...getElemFromGrid(grid, x + 1, y + 0),
       ...getElemFromGrid(grid, x + 1, y + 1),
       ...getElemFromGrid(grid, x + 0, y + 1),
       ...getElemFromGrid(grid, x - 1, y + 1),
       ...getElemFromGrid(grid, x - 1, y + 0),
       ...getElemFromGrid(grid, x - 1, y - 1),
       ...getElemFromGrid(grid, x + 0, y - 1),
       ...getElemFromGrid(grid, x + 1, y - 1),
     ]

I also find grids made of 2 dimensional array to be code small. An array of arrays is NOT A GRID as there is nothing enforcing the inner arrays to be the same length as each other. Also, in efficienct code it would be better to provide a 1 dimensional array and bounds. Now, out of bounds checks based on accessing outside the array won't work. You need to check actual bounds. Again giving preference using a helper

lysecret · 11 days ago

When I was debugging through f# code they definitely had that.

lysecret commented on Getting a Gemini API key is an exercise in frustration ankursethi.com/blog/gemin... · Posted by u/speckx

lysecret · 14 days ago

I see this, a hot take form my side as someone who is bought in to GCP i quite like being able to put everything on the same billing account / handle it easily through service accounts.

lysecret commented on Gemini 3 Pro: the frontier of vision AI blog.google/technology/de... · Posted by u/xnx

MostlyStable · 20 days ago

Going to compare this to our current solution of Amazon's Textract service for analyzing handwritten datasheets. Textract, when extracting tables (which is what we use it for) does not allow for providing any context or information about the tables and what we expect them to contain, but it is really good at correctly recognizing hand written characters. All of my attempts at less specialized, more general models allow me to provide that context, which is helpful in some ways, but fail at the basic part of almost always correctly getting the character.

Hopefully Google pro marries the two together.

lysecret · 19 days ago

2.5 pro is already excellent at this.

lysecret commented on OpenAI needs to raise at least $207B by 2030 ft.com/content/23e54a28-6... · Posted by u/akira_067

0xbadcafebee · a month ago

I just realized that other industries are way larger than AI. Assuming they capture the entire advertising market, only $390 Billion was spent in the US last year. Compare that to health care, where 4.3 Trillion was spent in the US last year, or commercial banking's revenue of 1.5 Trillion, commercial real estate's 1.5 Trillion, gasoline stations' 1.10 Trillion, etc. What's amazing is, despite the fact that AI isn't making much money, is taking on considerable debt, and isn't even assured to be all that useful, one third of the stock market is now just AI crap. The economy is going to collapse because of a small, brand-new industry. This... shouldn't be possible.

lysecret · a month ago

Now compare on free cash flow

lysecret commented on Building a Durable Execution Engine with SQLite morling.dev/blog/building... · Posted by u/ingve

roughly · a month ago

One thing that needs to be emphasized with “durable execution” engines is they don’t actually get you out of having to handle errors, rollbacks, etc. Even the canonical examples everyone uses - so you’re using a DE engine to restart a sales transaction, but the part of that transaction that failed was “charging the customer” - did it fail before or after the charge went through? You failed while updating the inventory system - did the product get marked out or not? All of these problems are tractable, but once you’ve solved them - once you’ve built sufficient atomicity into your system to handle the actual failure cases - the benefits of taking on the complexity of a DE system are substantially lower than the marketing pitch.

lysecret · a month ago

Yep fully agreed the main thing is to break apart the systems so any retries don’t lead to issues like you mentioned.

I do still think there is sufficient amount of boilerplate to potentially justify some engine like this.

lysecret commented on Building more with GPT-5.1-Codex-Max openai.com/index/gpt-5-1-... · Posted by u/hansonw

spmartin823 · a month ago

I still want something no one has, which is the ability to launch agents in different git worktrees simultaneously and check the results out on my main branch for testing when they are finished.

lysecret · a month ago

Cursor has this too

lysecret commented on WeatherNext 2: Our most advanced weather forecasting model blog.google/technology/go... · Posted by u/meetpateltech

counters · a month ago

To add to the existing answers - L2 losses induce a "blurring" effect when you autoregressively roll out these models. That means you not only lose import spatial features, you also truncate the extrema of the predictions - in other terms, you can't forecast high-impact extreme weather with these models at moderate lead times.

lysecret · a month ago

Yes very good point this to me is one of the most magical elements of this loss how it suddenly makes the model "collapse" on one output and the predictions become sharp.