DeepThought-8B: A small, capable reasoning model

Legally, you cannot name the llama3 based models like that, YOu have to use, llama in the name

https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blo...

alexvitkov · 9 months ago

Too bad :)

Facebook trained the model on an Internet's worth of copyrighted material without any regard for licenses whatsoever - even if model weights are copyrightable, which is an open question, you're doing the exact same thing they did. Probably not a bulletproof legal defense though.

tourmalinetaco · 9 months ago

At least Zuck had the decency to release model weights, unlike these worthless clowns.

littlestymaar · 9 months ago

Can't wait until Meta sue them so we can have a judgment on whether or not models weights are subject to copyright.

euroderf · 9 months ago

Model weights are (abstractly speaking) a very intensive, concentrated form of website scraping, yes ?

What does the (USA) law say about scraping ? Does "fair use" play a role ?

Am I wrong to think that "reasoning model" is a misleading marketing term?

Isn't it a LLM with an algo wrapper?

viraptor · 9 months ago

Whether you bake the behaviour in or wrap it in an external loop, you need to train/tune the expected behaviour. Generic models can do chain of thought if asked for, but will be worse than the specialised one.

benchmarkist · 9 months ago

They're not baking anything in. Reasoning, as it is defined by AI marketing departments, is just beam search.

benchmarkist · 9 months ago

AI marketing departments are fond of anthropomorphic language but it's actually just regular beam search.

Deleted Comment

JTyQZSnP3cQGa8B · 9 months ago

The same way they now call "open-source" a completely closed-source binary blob full of copyright infringement.

Kiro · 9 months ago

"reasoning model" means nothing so I don't think it's misleading.

astrobe_ · 9 months ago

Reasoning means "inference" or "deduction" to me, or at least some process related to first order logic.

tkgally · 9 months ago

There's been a rush of releases of reasoning models in the past couple of weeks. This one looks interesting, too.

I found the following video from Sam Witteveen to be a useful introduction to a few of those models:

https://youtu.be/vN8jBxEKkVo

CGamesPlay · 9 months ago

In what way did they "release" this? I can't find it in hugging face or ollama, and they only seem to have a "try online" link in the article. "Self-sovereign intelligence", indeed.

wongarsu · 9 months ago

They released it in the same sense OpenAI released GPT4. There is an online demo you can chat with, and a form to get in touch with sales to get API access

underlines · 9 months ago

they didn't

tanakai24 · 9 months ago

jb_briant · 9 months ago

codetrotter · 9 months ago

Given the name they gave it, someone with access should ask it for the “Answer to the Ultimate Question of Life, The Universe, and Everything”

If the answer is anything other than a simple “42”, I will be thoroughly disappointed. (The answer has to be just “42”, not a bunch of text about the Hitchhikers Guide to the Galaxy and all that.)

vintermann · 9 months ago

Deep Thought didn't answer right away either.

lowbloodsugar · 9 months ago

“Right away”. lol.

asah · 9 months ago

"what is the population of manhattan below central park"

ChatGPT-o1-preview: 647,000 (based on 2023 data, breaking it down by community board area): https://chatgpt.com/share/674b3f5b-29c4-8007-b1b6-5e0a4aeaf0... (this appears to be the most correct, judging from census data)

DeepThought-8B: 200,000 (based on 2020 census data) Claude: 300-350,000 Gemini: 2.7M during peak times (strange definition of population !)

I followed up with DeepThought-8B: "what is the population of all of manhattan, and how does that square with only having 200,000 below CP" and it cut off its answer, but in the reasoning box it updated its guess to 400,000 by estimating as a fraction of land area.

igleria · 9 months ago

I asked it "Describe how a device for transportation of living beings would be able to fly while looking like a sphere" and it just never returned an output

Timwi · 9 months ago

I asked it to just count letters in a long word and it never returned an output (been waiting for 30 minutes now)

m3kw9 · 9 months ago

It isn’t pleased you ask it such questions

ConspiracyFact · 9 months ago

Blaine is a pain

nyoomboom · 9 months ago

The reasoning steps look reasonable and the interface is simple and beautiful, though Deepthought-8b fails to disambiguate the term "the ruliad" as the technical concept from Wolfram physics, from this company's name Ruliad. Maybe that isn't in the training data, because it misunderstood the problem when asked "what is the simplest rule of the ruliad?" and went on to reason about the company's core principles. Cool release, waiting for the next update.

segalord · 9 months ago

Xd, Gotta love how your first question to a test a model is about a “ruliad”. It’s not even in my ios dictionary