Readit News logoReadit News
Posted by u/palmdeezy 3 years ago
Show HN: AI Playground by Vercel Labsplay.vercel.ai...
Hey, Jared Palmer (creator of this playground) here. Really excited to ship this. I’ve been building this over the past few weeks to compare LLMs from different providers like OpenAI, Anthropic, Cohere, etc. At Vercel, I manage our Frameworks division (including Next.js, Svelte, and Turbo) and wanted to also dogfood some of the latest features in a slightly larger application. This playground takes a lot of inspiration from https://nat.dev and is built on Tailwind, ui.shadcn.com, and some upcoming Vercel products we’re announcing soon. We’re going to continue adding models to compare and add other frameworks to generate code snippets from.
rendall · 3 years ago
As a joke, I wrote "Give me a disruptive startup idea". This is what claude wrote:

  Here is a potentially disruptive startup idea:

  • On-demand drone delivery for e-commerce. The idea is to offer ultra fast delivery of small packages via drones. This could significantly reduce delivery times for e-commerce orders and disrupt the traditional shipping industry. Some key challenges to overcome would be regulatory approvals, safety, noise pollution, and scaling up a drone delivery network. But if executed well, it could transform how people receive goods and packages.

  A few other disruptive ideas:

  • An Uber-like model for renting out spare rooms or couches for short term stays. This could disrupt the hotel industry. 

  • A Netflix model for streaming live TV channels over the internet. This could disrupt the cable TV industry.
  
  • An automated vertical farming system to grow organic food locally in urban areas. This could disrupt the traditional agriculture and food supply chain.

  • Decentralized blockchain-based social network. The idea is to build a social network that is not owned or controlled by any single company. It would be an open, decentralized platform built on blockchain technology. Users would have more control and ownership over their data and profiles. The business model could be built around micro-payments or subscriptions. It could disrupt the dominance of companies like Facebook and offer an alternative distributed social networking experience

  • A virtual reality social network. If VR/AR technologies become more mainstream,
I mean, they are disruptive startup ideas, sure enough. GPT-4 didn't bother answering.

spaceman_2020 · 3 years ago
> GPT-4 didn't bother answering.

Honestly, a real sign of intelligence :D

rpastuszak · 3 years ago
I remember some people from Vooza pitching "Tinder for Drones" during a conference 10 years or so ago!

(Truly disruptive if we manage to turn Universal Paperclips into reality.)

IanCal · 3 years ago
Here's gpt4, it's formatting is better but this is the text. Though this isn't really the right way to use it, single responses miss a large amount of the potential imo.

Startup Idea: SmartUpCycle

Overview:

SmartUpCycle is a disruptive startup aimed at revolutionizing the way we handle waste and promote sustainability through upcycling. The platform uses advanced AI and machine learning algorithms to intelligently identify and match discarded materials with potential new uses, connecting individuals and businesses with creative solutions for repurposing waste.

Key Features:

AI-Powered Material Matching: The platform utilizes AI to analyze and categorize various types of waste materials, identifying their potential for upcycling and suggesting creative and functional uses for them. This allows users to find new life for discarded materials, reducing waste and contributing to a circular economy.

Marketplace for Upcycled Goods:

SmartUpCycle provides a marketplace for users to buy and sell upcycled products, offering a wide range of unique, environmentally-friendly, and innovative items. This encourages more people to engage with upcycling and supports the growth of a new sustainable industry.

Waste Collection & Redistribution Network:

The startup collaborates with local waste management companies, recycling centers, and businesses to collect and redistribute waste materials to users, promoting a circular economy and reducing the need for new raw materials.

Educational Resources & Community:

SmartUpCycle offers educational resources, tutorials, and workshops to teach users how to upcycle waste materials and create new products. Additionally, the platform fosters a community of like-minded individuals who can share ideas, collaborate on projects, and inspire each other to embrace sustainability.

Business Integration:

The platform offers integration solutions for businesses, helping them reduce waste, lower costs, and adopt sustainable practices. By providing access to a network of upcyclers and a marketplace for upcycled goods, businesses can transform their waste into revenue-generating products.

Revenue Model:

SmartUpCycle generates revenue through various channels, including:

Transaction fees on the marketplace for upcycled goods.

Subscription fees for premium features, such as advanced AI recommendations and business integration services.

Sponsorships and partnerships with waste management companies, recycling centers, and eco-friendly brands.

Fees from workshops and educational resources.

Impact:

SmartUpCycle promotes sustainability by reducing waste, conserving resources, and encouraging a circular economy. By connecting people and businesses with creative upcycling solutions, the platform fosters innovation, supports the growth of a new sustainable industry, and contributes to a healthier planet

phnofive · 3 years ago
> The platform uses advanced AI and machine learning algorithms to intelligently identify and match discarded materials with potential new uses, connecting individuals and businesses with creative solutions for repurposing waste.

In other words, GPT-4

jutrewag · 3 years ago
What do you mean by GPT4 didn’t bother responding. It has better ideas that this when I just asked it. This is generic af. Are you a Claude shill or something.
neom · 3 years ago
I dunno what Cohere is, but it's answers are super weird. I asked it "Where does the phrase "the rain in spain falls mainly on the plane" come from?" and it said:

Does anybody know a green song? What happens if a girl doesn't have a twin brother? What musical instrument did larry black sing 'jealous heart' on? Is country music more country than country music. I'm confused.? What is the difference between a copyright and a trademark? What is the best piano chords software u can get for beginners? What is the best music software for beginners? What is the song from the new dexter episode? Did he kill his brother? What is the name of the song dueling banjos? Was the raiders of the lost ark the first jungle hero movie? Who really invented reverse tape effect? Why can't me and my neighbor get along? What prophet spoke to the teachers of the law at jerusalem about their treatment of jesus? Are there any songs that describe the emotions of a cult member? What is the most ugly song ever made?

jamilton · 3 years ago
IIRC, GPT-2 and 3 are somewhat like this, asking a question often results in it giving you more questions instead of an answer, unless you give. ChatGPT and GPT-4's tendency to answer questions instead of continuing them comes from the RLHF, I believe.
spacemanspiff01 · 3 years ago
Clearly the most advanced model we've seen to date.

It's already trolling you.

quickthrower2 · 3 years ago
And the comment means the next bot gets trained on this answer
jayalammar · 3 years ago
For Cohere, make sure you're using Command-Xlarge-Nightly.

Otherwise, you may be prompting a Base LLM expecting the behavior of a different kind of LLM (an instruction-tuned chat model).

Cohere's Command model builds on top of the base model, giving it the capability to follow instructions and user commands.

d1sxeyes · 3 years ago
The rain in Spain falls mainly on the plain. Not sure if it makes a difference to the response.
IanCal · 3 years ago
Coheres model is Claude and it's a completion style one. So it's trying to carry on what you wrote, and a list of questions is reasonable. It worked just fine for me copying and pasting but you might try this format:

Q: your question

A:

Then it is more likely to fill in an answer for you.

jayalammar · 3 years ago
Cohere actually trains its own models and they are not based on models from other providers [I work at Cohere].

Your prompt suggestion is a good one for LLMs as a whole. Any information added to the context informs the model and nudges it towards the expected answer format.

IanCal · 3 years ago
I got mixed up with the models, Claude is from anthropic.
sangnoir · 3 years ago
Just like your own question - it seems to me it's riffing on "Bar Quiz" type questions. Have tried the 'Question:'...'Answer:' prompt format?
jozvolskyef · 3 years ago
You didn't tell it to act as an assistant so it just assumes that your input is some list of questions that it's supposed to complete.
psygn89 · 3 years ago
When I asked if elephant farts were loud it answered with a lot more detail than ChatGPT.
rambojohnson · 3 years ago
it seems that Cohere might be suggesting that the phrase is as devoid of meaning as its unrelated questions, considering that the phrase was originally just a vocal exercise for a character in a film.
gardenhedge · 3 years ago
You manage the Frameworks division (including Next.js, Svelte, and Turbo)? How did you get the time to build this?
onion2k · 3 years ago
Speaking about high-level strategic roles in general (eg CTO), assembling a good team that you trust to get on with things means you don't need to spend a huge amount of time managing them on a day-to-day basis. If you do your job well you can get to a point where 'managing' is mostly about steering the direction of the team - figuring out the answers to questions about 'why' and 'what' you're doing as a team. The team members themselves can deal with the 'how' part. That shouldn't be your job.

The 'why' and the 'what' don't actually change very often (hopefully!) so you can have plenty of time for experimentation and 'play' like this. This is a part of how you learn and answer the 'why' and 'what' parts of your job.

syspec · 3 years ago
A growing backlog
yding · 3 years ago
Helps when your boss is a prolific coder himself.
reducesuffering · 3 years ago
Right. It seems like the entire executive team is technical and producing demos.
zild3d · 3 years ago
Using GPT4
atleastoptimal · 3 years ago
How much does it cost to query these models all the time? It's offering a lot for free.
Trystanr · 3 years ago
There is a rate limit.
jondwillis · 3 years ago
Even still. GPT4 in particular gets expensive at scale.

Vercel is probably noticing that the LLM community is using NextJS (and Vercel) by default a lot of times, and that supporting the community and building more goodwill will pay off.

doodlesdev · 3 years ago
Please provide an icon that makes sense on other platforms for the shortcut on the right bottom, not everyone uses Macs! Overall, pretty cool idea, how is Vercel gonna handle API costs for this? I imagine this is pretty expensive to run, specially considering GPT4 access.

Also, I have to link this here because I thought it was too funny:

https://play.vercel.ai/r/G9y3eXJ

edit: Oh I just hit the token limit, I guess that's how you're dealing with cost lol. Something I was gonna suggest is that maybe answers should be cached to prevent repeated calls to the same API when the prompt hasn't changed, this could potentially increase the amount of tests I could do.

steventey · 3 years ago
"How many words are there in "The Star Spangled Banner"?" LOL hahaha

Will relay your feedback about the icon back to the team - thank you for trying this out!

swyx · 3 years ago
beautiful work as always jared!

one of the useful things from nat.dev was the tokens/second calculation, latency is impt as usual. would be nice to enable?

i'm excited to see what the upcoming products are. making things default sharable like this across all frameworks would be really nice.

palmdeezy · 3 years ago
Thanks man! Good idea on token/s. Will add
tough · 3 years ago
Would be nice to know cost too!

tokmon is on showHN today too

https://news.ycombinator.com/item?id=35616871

cramforce · 3 years ago
I'm super excited in graphing this over time. Will be interesting to see how the providers develop over the coming months. FWIW we also need TTFB not just tokens per second. The first token is key for UX
etaioinshrdlu · 3 years ago
How did Anthropic agree to let anonymous users call their API? This was a big stumbling block for me.
ShamelessC · 3 years ago
Anything to boost the ole DAU's.
zurfer · 3 years ago
Thanks for setting up a limited free version. The UX is great.

I had some failures for huggingface and cohere. Replicate produced an output that was not rendered (but returned an answer in the network tab).

It would also be great to add: https://docs.aleph-alpha.com/docs/introduction/luminous/

palmdeezy · 3 years ago
Investigating now. Thanks for the feedback!
palmdeezy · 3 years ago
Fixed Cohere and Replicate. Will add the other provider now! Appreciate the help!