Readit News logoReadit News
cr4zy commented on GPT-4.1 in the API   openai.com/index/gpt-4-1/... · Posted by u/maheshrijal
lxgr · 8 months ago
As a ChatGPT user, I'm weirdly happy that it's not available there yet. I already have to make a conscious choice between

- 4o (can search the web, use Canvas, evaluate Python server-side, generate images, but has no chain of thought)

- o3-mini (web search, CoT, canvas, but no image generation)

- o1 (CoT, maybe better than o3, but no canvas or web search and also no images)

- Deep Research (very powerful, but I have only 10 attempts per month, so I end up using roughly zero)

- 4.5 (better in creative writing, and probably warmer sound thanks to being vinyl based and using analog tube amplifiers, but slower and request limited, and I don't even know which of the other features it supports)

- 4o "with scheduled tasks" (why on earth is that a model and not a tool that the other models can use!?)

Why do I have to figure all of this out myself?

cr4zy · 8 months ago
For code it's actually quite good so far IME. Not quite as good as Gemini 2.5 Pro but much faster. I've integrated it into polychat.co if you want to try it out and compare with other models. I usually ask 2 to 5 models the same question there to reduce the model overload anxiety.
cr4zy commented on My Response to Superintelligence Strategy   nationalsecurityresponse.... · Posted by u/cr4zy
cr4zy · 8 months ago
I discuss how the automation wave is already starting with white-collar job openings at a 12 year low in the U.S. I also talk about how we cannot simply count on taxing AI to support automated workers, as countries that don't tax will outcompete those who do. We therefore need international cooperation, based in MAIM, from the original Superintelligence Strategy paper.

I also discuss how bioweapons cannot be avoided via restricting open weight models as originally suggested in Dan's paper. Rather we need to heavily invest in bioweapon defense, and in particular use AI for wastewater monitoring and accelerating metagenomics (detangling mixed DNA).

cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
cr4zy · a year ago
So it looks like some folks are getting errors with the non-streaming models, i.e. the o1 models. I think their long running cxns with zero packets may cause some networks to drop the requests. Will look into a hearbeat/keepalive on those.
cr4zy · a year ago
I've added a "Thinking...." which sends server side events to keep the cxn open. Would love to hear if o1 models now work for anyone who they were broken for.
cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
liberix · a year ago
I've noticed that you offer an API key under Settings / Account. How does the API work? Is there any documentation? I'd also like to see the pricing details for the different plans.
cr4zy · a year ago
Open WebUI does offer an API, but I have it disabled for PolyChat.
cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
iandanforth · a year ago
You should also note that Chorus allows you to bring your own API keys and interact with local models. Both features I very much appreciate!
cr4zy · a year ago
One tradeoff of bringing your own API keys is that as you add more model providers, you get more billing accounts to deal with. Chorus also doesn't have an incentive to efficiently use your tokens. We save 67% on Anthropic token costs using Claude Caching. We also use cheaper "task models" for conversation titles, tagging, and parts of the RAG pipeline which all drastically cuts token costs.

For local models I highly recommend https://github.com/crizCraig/open-webui

They do the side by side thing that Chorus does and you can serve it to anywhere including your phone.

cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
cr4zy · a year ago
So it looks like some folks are getting errors with the non-streaming models, i.e. the o1 models. I think their long running cxns with zero packets may cause some networks to drop the requests. Will look into a hearbeat/keepalive on those.
cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
moralestapia · a year ago
Unlimited free OpenAI o1?

OP, isn't that really expensive to maintain?

cr4zy · a year ago
It's not unlimited free unfortunately. After some free use, we provide monthly usage-tier plans. But there are no rate limits like other providers as you can move up to the next usage-tier.
cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
noahjk · a year ago
Sounds like you made most of these changes upstream? What about the background chats and the chat tree overview, are either of them in Open WebUI, or are they also custom to PolyChat? I run OWUI locally and am interested in those features for selfish reasons. If for some reason your multi-model idea doesn’t pan out, I’d love to see it merged upstream, too. Thanks for your contributions!

(Another annoying thing about OWUI is getting logged out every time the image upgrades… is that something else you’ve looked at?)

cr4zy · a year ago
Background chats are new in v5 of Open WebUI, so you can use it too. Overview has been there also, but it's kind hidden in the hamburger menu.

The upgrade/logout issue you're facing is likely due to not setting WEBUI_SECRET_KEY outside of your docker container. This causes all previous cookies to be unreadable as a new key will get generated by start.sh and won't decrypt the old cookies.

cr4zy commented on Show HN: Chat with multiple LLMs: o1-high-effort, Sonnet 3.5, GPT-4o, and more   polychat.co... · Posted by u/cr4zy
hoerzu · a year ago
Created an open source alternative just using the browser and not sharing your data with a third party: https://chromewebstore.google.com/detail/tabgpt-ask-chatgpt-...
cr4zy · a year ago
Cool! I should say most of PolyChat is open source at https://github.com/open-webui - just the combo models and payment are closed source right now. Open to arguments on making PolyChat fully open source as well!

u/cr4zy

KarmaCake day1612October 28, 2011
About
gmail cquiter @crizCraig : deepdrive.io
View Original