taw1285 (u/taw1285) - Readit News

taw1285 commented on We put a coding agent in a while loop github.com/repomirrorhq/r... · Posted by u/sfarshid

taw1285 · 7 days ago

This is so amazing. Are there any resources or blogs on how people do this for production services? In my case, I need to rewrite a big chunk of my commerce stack from Ruby to Typescript.

taw1285 commented on Gemini Embedding: Powering RAG and context engineering developers.googleblog.com... · Posted by u/simonpure

stillpointlab · a month ago

> Embeddings are crucial here, as they efficiently identify and integrate vital information—like documents, conversation history, and tool definitions—directly into a model's working memory.

I feel like I'm falling behind here, but can someone explain this to me?

My high-level view of embedding is that I send some text to the provider, they tokenize the text and then run it through some NN that spits out a vector of numbers of a particular size (looks to be variable in this case including 768, 1536 and 3072). I can then use those embeddings in places like a vector DB where I might want to do some kind of similarity search (e.g. cosine difference). I can also use them to do clustering on that similarity which can give me some classification capabilities.

But how does this translate to these things being "directly into a model's working memory'? My understanding is that with RAG I just throw a bunch of the embeddings into a vector DB as keys but the ultimate text I send in the context to the LLM is the source text that the keys represent. I don't actually send the embeddings themselves to the LLM.

So what is is marketing stuff about "directly into a model's working memory."? Is my mental view wrong?

taw1285 · a month ago

Your comment really helps me improve my mental model about LLM. Can someone smarter help me verify my understanding:

1) at the end of the day, we are still sending raw text over LLM as input to get output back as response.

2) RAG/Embedding is just a way to identify a "certain chunk" to be included in the LLM input so that you don't have to dump the entire ground truth document into LLM Let's take Everlaw for example: all of their legal docs are in embeddings format and RAG/tool call will retrieve relevant document to feed into LLM input.

So in that sense, what do these non-foundational models startups mean when they say they are training or fine tuning models? Where does the line end between inputting into LLM vs having them baked in model weights

taw1285 commented on Databricks in talks to acquire startup Neon for about $1B upstartsmedia.com/p/scoop... · Posted by u/ko_pivot

taw1285 · 4 months ago

I am fairly new to all this data pipeline services (Databricks, Snowflakes etc).

Say right now I have an e-commerce site with 20K MAU. All metrics are going to Amplitude and we can use that to see DAU, retention, and purchase volume. At what point in my startup lifecycle do we need to enlist the services?

taw1285 commented on Ask HN: Has anyone quit their startup (VC-backed) over cofounder disagreements? · Posted by u/stuck12345

nsypteras · 5 months ago

Cofounder splits are extremely common. Cofounder "couples" counselors are a thing you could look into to help resolve your differences. Your VC might have recommendations for one. If you ultimately decide to split, I'd recommend at least one of you (or "the company"?) getting a lawyer to draw up a formal separation agreement you both sign in order to split in the cleanest possible way.

taw1285 · 5 months ago

This is very interesting to me. From this thread: https://news.ycombinator.com/item?id=43472971, I am wondering if there are anecdotal stories of how equity is being handled after a split.

On one hand, if the leaving co-founder retains all equity, it creates a sandbagging situation on a cap table that's no longer useful to the business. On the other hand, it feels right for the leaving co-founder to enjoy some upside for the years they put in.

taw1285 commented on Write to Escape Your Default Setting kupajo.com/write-to-escap... · Posted by u/kolyder

agentultra · 6 months ago

I heard once that, "Writing is thinking," which has stuck with me throughout my life.

You really haven't thought about it hard enough if you haven't tried writing it down.

I have a whole system of journals that I use to collect my thoughts across various subjects I dabble in. Algorithms: there's a journal for that. Abstract algebra? There's a journal for that. Etc.

At work? I use bullet journal... I add sections in for projects I'm working on. When I'm working on refactoring an old area of the code or investigating a hard-to-diagnose error I start writing. I ask questions, get answers, and I update my project journal. It helps me clarify the issue and I find once I can explain the system or the error clearly the answers (or how to find them) becomes obvious.

It may seem quaint, eccentric, or out-dated but it's a practical, reliable tool. Ask questions and write down the answers. Eventually a coherent narrative and a full thought will form before you.

taw1285 · 6 months ago

I want to get better at taking project notes for work via Obsidian. I'm curious if you have a different page per project or do you just put everything in the same giant log? I like the idea of organizing it, but it takes me a bit of time to find out which notebook it should go under.

taw1285 commented on Show HN: PurePlates – A Recipe Scraping iOS App apps.apple.com/ca/app/pur... · Posted by u/CZubrecki

taw1285 · a year ago

Love it! It would be cool to be able to auto tag cuisine type. Did you use an LLM to scrape and parse receipt details?

taw1285 commented on Does your startup need complex cloud infrastructure? hadijaveed.me/2024/09/08/... · Posted by u/hjaveed

taw1285 · a year ago

> 20-30 Lambda functions for different services

My team of 6 engineers have a social app at around 1,000 DAU. The previous stack has several machines serving APIs and several machines handling different background tasks. Our tech lead is forcing everyone to move to separate Lambdas using CDK to handle each each of these tasks. The debugging, deployment, and architecting shared stacks for Lambdas is taking a toll on me -- all in the name of separation of concerns. How (or should) I push back on this?

taw1285 commented on Speeding Up Your Website Using Cloudflare Cache pillser.com/engineering/2... · Posted by u/lilouartz

taw1285 · a year ago

does this approach make sense for an app-router based nextjs app hosted on vercel?

taw1285 commented on Show HN: InstantDB – A Modern Firebase github.com/instantdb/inst... · Posted by u/nezaj

taw1285 · a year ago

This looks fantastic. I want to recommend this to my team. We are a small consulting team building apps for clients. I have a few questions to help me pitch my team and clients better: 1. the usual "vendor locked in". Is there a recommended escape hatch? 2. any big clients on this yet or at what scale do you expect people to start rolling their in house product