Readit News logoReadit News
langcss commented on Llama 3.1 Omni Model   github.com/ictnlp/LLaMA-O... · Posted by u/taikon
falcor84 · a year ago
Interesting, I've actually been using gpt4o extensively for OCR and didn't encounter any significant issues - could I ask you to please give an example of an image of (otherwise legible) text that it hallucinates on?
langcss · a year ago
I'll send you an email.
langcss commented on Show HN: I built your next webapp, monitoring, and CI/CD for you, on your stack    · Posted by u/abewheeler
abewheeler · a year ago
Thanks for flagging that! The intent is that you're welcome (and encouraged!) to use your launchpad as many times as you like, for as many projects as you like.

However, that code will slowly become out-of-date in the form of deprecations, dependency updates, new launchpad features, and a changing tech landscape. If you always want the most up-to-date starting point, that requires another (discounted) purchase, or the ongoing support plan!

Does that make sense?

langcss · a year ago
It does thanks. Sounds like "support plus access to future versions when released"

A pricing table might help here.

langcss commented on Show HN: I built your next webapp, monitoring, and CI/CD for you, on your stack    · Posted by u/abewheeler
langcss · a year ago
A point of confusion is where it says both:

》Want unlimited launches of the latest launchpads? Need ongoing support? Reach out to (email)

》Reuse your code as many times as you like.

Is the first one a cloud offering and second referring to self hosted?

langcss commented on Llama 3.1 Omni Model   github.com/ictnlp/LLaMA-O... · Posted by u/taikon
a2128 · a year ago
There's a lot of data loss and guessing with STT/TTS.

An STT model might misrecognize a word, but an audio LLM may understand the true word because of the broad context. A TTS model needs to guess the inflection and it can get it completely wrong, but an audio LLM could understand how to talk naturally and with what tone (e.g. use a higher tone if it's interjecting)

Speaking of interjection, an STT/TTS system will never interject because it relies on VAD and heuristics to guess when to start talking or when to stop, and generally the rule is to only talk after the user stopped talking. An audio LLM could learn how to conversate naturally, avoid taking up too much conversation time or even talk with a group of people.

An audio LLM could also produce music or sounds or tell you what the song is when you hum it. There's a lot of new possibility

I say "could learn" for most of this because it requires good training data, but from my understanding most of these are currently just trained with normal text datasets synthetically turned into voice with TTS, so they are effectively no better than a normal STT/TTS system; it's a good way to prove an architecture but it doesn't demonstrate the full capabilities

langcss · a year ago
You need a lot more power. I found gpt4o struggles doing basic OCR of printed text by hallucinating alot, while tesseract engine (old skool) gets it perfect. You need the model to be powerful enough to do everything.

You can work around this by the way by sending the output through a checking stage.

So picture -> gpt4o -> out1, picture -> tesseract -> out2, out1,out2 -> llm.

Might work for sound too.

langcss commented on Show HN: Ultra-minimalist web framework, a year after release   renomad.com/blogposts/fir... · Posted by u/7ep
langcss · a year ago
The built in DB makes this very interesting to me. Although I would prefer sqlite. The low LOC count is nice to know.

Minimilism is something that is hard to get right on the web. The cliche is people avoid django because it is too complex then hand build all its features anyway over time!

However some jobs are minimal esp. internal dashboards so there is a place for it. You may find fasthtml which is also minimal interesting.

Curious: why Java and not Go? Go seems more in line with being minimal. Especially for deployment.

langcss commented on How to Lead Your Team When the House Is on Fire   peterszasz.com/how-to-lea... · Posted by u/kiyanwang
drekipus · 2 years ago
That actually reminds me of a great story, for when I had to do exactly what the title is suggesting.. back when I worked at Amazon as a software engineer, the CRAZIEST thing happened to me. Here’s the story… I was working from home with my girlfriend (at the time), when suddenly I get an urgent ping from my coworker: “Our service is experiencing a SEV 2! We need all hands on deck!” Uh oh, our team’s application has gone down! However, as I scrambled to figure out how to fix the issue, I smelled something burning from another room and heard a fire alarm go off. “Will! There’s a fire! Help!” I heard my girlfriend shout. Now I was stuck in a conundrum — restore a critical Amazon service, or put out the fire in my apartment? It was at that time I remembered Amazon’s famous leadership principle “Customer Obsession”. There are customers who depend on my team’s application — I can’t let them down! So I ignored the fire and my girlfriend’s pleas, and started debugging the production issue. But all of a sudden, the smoke in my apartment cleared and the fire alarm fell silent. My girlfriend walked into the room, and to my astonishment, peeled off a wig and revealed herself to be Jeff Bezos himself! “I’m proud of you for being obsessed with our customers,” he said, and gave me a $5 Amazon gift card. He then leaped out of my window and hopped into a waiting Amazon Prime delivery van that quickly peeled away. Even though I no longer work at Amazon, I’m so grateful for these experiences that taught me lessons I’ll never forget. Agree?
langcss · 2 years ago
And that day, I was told I passed the Amazon interview and was offered a job as a L4 Software Engineer.

Deleted Comment

langcss commented on Terence Tao on O1   mathstodon.xyz/@tao/11313... · Posted by u/dselsam
JumpCrisscross · 2 years ago
The first profession AI seems on track to decimate is programming. In particular, the brilliant but remote and individual contributor. There is an obvious conflict of interest in this forum.
langcss · 2 years ago
I believe LLMs decimating the role of a software engineer requires AGI, which the second that happens decimates all jobs.

What it may do is change the job requrements. Web/JS has decimated (reduced by 90% or more) MFC C++ jobs after all.

The programmer doesnt just write Python. That is the how... not the what.

langcss commented on Ask HN: Is a Masters in CS/Engineering worth it for a mid-career PM?    · Posted by u/deeptechdreamer
langcss · 2 years ago
For clarity: do you want to be a software engineer, platform engineer, data scientist, ML expert or something else?
langcss commented on Show HN: A whiteboard that writes math equations   whiteboard.lasky.io/... · Posted by u/Ilasky
langcss · 2 years ago
I asked it to prove Fermats Last Theorem and it started writing (some nonsense) into the margin. Which was funnny.

Cool project tho

u/langcss

KarmaCake day233April 24, 2024
About
LangCSS makes it easier to get custom Tailwind designs.

Home: https://langcss.com

Demo: https://langcss.com/demo

Contact me on "info", at the domain above.

langcss.at.hn

View Original