Readit News logoReadit News
FailMore commented on LLMs aren't world models   yosefk.com/blog/llms-aren... · Posted by u/ingve
FailMore · 17 days ago
It seems clear to me that LLMs are a useful sort of dumb smart activity. They can take some pretty useful stabs in the dark, and therefore do much better in an environment which can give them feedback (coding) or where there is no objective correct answer (write a poem). It opens the door for some novel type of computational tasks, and the more feedback you can provide within the architecture of your application, the more useful the LLM will probably be. I think the hype of their genuine intelligence is overblown, but doesn’t mean they are not useful.
FailMore commented on Ask HN: Who is hiring? (August 2025)    · Posted by u/whoishiring
FailMore · a month ago
London | In person/hybrid or Abroad | Remote

Magic position for people who know HEAPs about the playwright testing framework.

We are an early stage company with a different, and we hope refreshing attitude, towards work.

We think it’s okay to not be obsessed with your job. We think if you’re a bit older (out of your 20s) you might be more of a guide for others (a Gandalf) than a power train (we hire young people for that energy who are eager to learn from you). We invest a lot in our team. We pay for you to have therapy (optional), but we want you to have it as we think you’ll be happier, calmer and “unblock” yourself. We spend half a day a week reading documentation and doing nothing else, and another full day coaching the team on cutting edge tech (how to design, build, train, deploy ai models).

We’re based in London, but open to applications from anywhere.

The main thing is that you have a well of knowledge about the open source playwright framework!

Personal email in bio

FailMore commented on Show HN: Vibe Kanban – Kanban board to manage your AI coding agents   github.com/BloopAI/vibe-k... · Posted by u/louiskw
FailMore · 2 months ago
Do you think you will keep it free or can you see a business model developing around it? If so, what do you think it would be? / How would you split paid tiers vs free users? Not a big deal to me...!! But I'm curious how one might commercialise these types of free/open source projects
FailMore commented on Ask HN: What Are You Working On? (June 2025)    · Posted by u/david927
diarmuid_glynn · 2 months ago
Working on two projects right now:

- LegalJoe: AI-powered contract reviews for startups, at the "tech demo" phase right now: https://www.legaljoe.ai/

- ClipMommy: A macOS tool to help (professionals who record a lot of videos | influencers) organize their raw video clips. Simply drag a folder of "disorganized" videos onto ClipMommy, and ClipMommy organizes the videos into folders / subfolders, adding tags, based upon some special statements that you can make at either the start or the end of your video (think audio-based "clapboard"). I'm expecting to release this within a week or two on the Mac App Store (Apple allowing...).

As an aside, I've been very impressed with Claude Code, it's (for me at least!) leading the way for how the next generation of business software might leverage AI. I plan to iterate on LegalJoe to make more "agentic" as a result of what I've seen is possible in Claude Code.

FailMore · 2 months ago
Legal Joe looks great. Nice video. Don't need it now, but it seems very useful
FailMore commented on Gemini CLI   blog.google/technology/de... · Posted by u/sync
simonw · 2 months ago
FailMore · 2 months ago
You must be a busy man! Always new tools to review. How did you get interested in doing this?
FailMore commented on AI Saved My Company from a 2-Year Litigation Nightmare   tylertringas.com/ai-legal... · Posted by u/anitil
FailMore · 3 months ago
Thank you, that was an interesting read and gave me a perspective on something I knew nothing about
FailMore commented on In defense of shallow technical knowledge   seangoedecke.com/shallow-... · Posted by u/swah
FailMore · 3 months ago
Here here!
FailMore commented on Mistral Agents API   mistral.ai/news/agents-ap... · Posted by u/pember
FailMore · 3 months ago
Is this basically a LLM that has tools automatically configured so I don’t have to handle that myself? Or am I not understanding it correctly? As in do I just make standard requests , but the LLM does more work than normal before sending me a response? Or I get the response to every step?

u/FailMore

KarmaCake day1590March 5, 2013
About
eichler [dottttyt] summers [attttty] gmail [dottttt] com
View Original