Meta Llama 3 - Readit News

They've got a console for it as well, https://www.meta.ai/

And announcing a lot of integration across the Meta product suite, https://about.fb.com/news/2024/04/meta-ai-assistant-built-wi...

Neglected to include comparisons against GPT-4-Turbo or Claude Opus, so I guess it's far from being a frontier model. We'll see how it fares in the LLM Arena.

CuriouslyC · 2 years ago

They didn't compare against the best models because they were trying to do "in class" comparisons, and the 70B model is in the same class as Sonnet (which they do compare against) and GPT3.5 (which is much worse than sonnet). If they're beating sonnet that means they're going to be within stabbing distance of opus and gpt4 for most tasks, with the only major difference probably arising in extremely difficult reasoning benchmarks.

Since llama is open source, we're going to see fine tunes and LoRAs though, unlike opus.

blackeyeblitzar · 2 years ago

Llama is open weight, not open source. They don’t release all the things you need to reproduce their weights.

danielhanchen · 2 years ago

On the topic of LoRAs and finetuning, have a Colab for LoRA finetuning Llama-3 8B :) https://colab.research.google.com/drive/135ced7oHytdxu3N2DNe...

wiz21c · 2 years ago

"within stabbing distance"

dunno if english is your mother tongue, but this sounds really good (although a tad aggressive :-) )) !

htrp · 2 years ago

ML Twitter was saying that they're working on a 400B parameter version?

LrnByTeach · 2 years ago

Losers & Winners from Llama-3-400B Matching 'Claude 3 Opus' etc..

Losers:

- Nvidia Stock : lid on GPU growth in the coming year or two as "Nation states" use Llama-3/Llama-4 instead spending $$$ on GPU for own models, same goes with big corporations.

- OpenAI & Sam: hard to raise speculated $100 Billion, Given GPT-4/GPT-5 advances are visible now.

- Google : diminished AI superiority posture

Winners:

- AMD, intel: these companies can focus on Chips for AI Inference instead of falling behind Nvidia Training Superior GPUs

- Universities & rest of the world : can work on top of Llama-3

vineyardmike · 2 years ago

I also disagree on Google...

Google's business is largely not predicated on AI the way everyone else is. Sure they hope it's a driver of growth, but if the entire LLM industry disappeared, they'd be fine. Google doesn't need AI "Superiority", they need "good enough" to prevent the masses from product switching.

If the entire world is saturated in AI, then it no longer becomes a differentiator to drive switching. And maybe the arms race will die down, and they can save on costs trying to out-gun everyone else.

gliched_robot · 2 years ago

Disagree on Nvidia, most folks fine-tune model. Proof: there are about 20k models in huggingface derived from llama 2, all of them trained on Nvidia GPUs.

drcode · 2 years ago

The memory chip companies were done for, once Bill Gates figured out no one would ever need more than 64K of memory

Rastonbury · 2 years ago

If anything a capable open source model is good for Nvidia, not commenting on their share price but business of course.

Better open models lower the barrier to build products and drive the price down, more options at cheaper prices which means bigger demand for GPUs and Cloud. More of what the end customers pay for goes to inference and not IP/training of proprietary models

edward28 · 2 years ago

Pretty sure meta still uses NVIDIA for training.

whywhywhywhy · 2 years ago

>AMD, intel: these companies can focus on Chips for AI Inference

No real evidence either can pull that off in any meaningful timeline, look how badly they neglected this type of computing the past 15 years.

nickthegreek · 2 years ago

And they even allow you to use it without logging in. Didnt expect that from Meta.

mvkel · 2 years ago

1. Free rlhf 2. They cookie the hell out of you to breadcrumb your journey around the web.

They don't need you to login to get what they need, much like Google

sdesol · 2 years ago

I had the same reaction, but when I saw the thumbs up and down icon, I realized this was a smart way to crowd source validation data.

salil999 · 2 years ago

I do see on the bottom left:

lairv · 2 years ago

Not in the EU though

visarga · 2 years ago

Doesn't work for me, I'm in EU.

applecrazy · 2 years ago

I imagine that is to compete with ChatGPT, which began doing the same.

unshavedyak · 2 years ago

Which indicates that they get enough value out of logged ~in~ out users. Potentially they can identify you without logging in, no need to. But also ofc they get a lot of value by giving them data via interacting with the model.

MichaelCharles · 2 years ago

But not from Japan, and I assume most other non-English speaking countries.

HarHarVeryFunny · 2 years ago

Yeah, but not for image generation unfortunately

I've never had a FaceBook account, and really don't trust them regarding privacy

Dead Comment

josh-sematic · 2 years ago

They also stated that they are still training larger variants that will be more competitive:

> Our largest models are over 400B parameters and, while these models are still training, our team is excited about how they’re trending. Over the coming months, we’ll release multiple models with new capabilities including multimodality, the ability to converse in multiple languages, a much longer context window, and stronger overall capabilities.

glenstein · 2 years ago

Anyone have any informed guesstimations as to where we might expect a 400b parameter model for llama 3 to land benchmark wise and performance wise, relative to this current llama 3 and relative to GPT-4?

I understand that parameters mean different things for different models, and llama two had 70 b parameters, so I'm wondering if anyone can contribute some guesstimation as to what might be expected with the larger model that they are teasing?

matsemann · 2 years ago

> Meta AI isn't available yet in your country

Where is it available? I got this in Norway.

schleck8 · 2 years ago

Just use the Replicate demo instead, you can even alter the inference parameters

https://llama3.replicate.dev/

Or run a jupyter notebook from Unsloth on Colab

https://huggingface.co/unsloth/llama-3-8b-bnb-4bit

sunaookami · 2 years ago

>We’re rolling out Meta AI in English in more than a dozen countries outside of the US. Now, people will have access to Meta AI in Australia, Canada, Ghana, Jamaica, Malawi, New Zealand, Nigeria, Pakistan, Singapore, South Africa, Uganda, Zambia and Zimbabwe — and we’re just getting started.

https://about.fb.com/news/2024/04/meta-ai-assistant-built-wi...

miohtama · 2 years ago

The EU does not want you to have the AI.

niek_pas · 2 years ago

Got the same in the Netherlands.

sunny-beast · 2 years ago

Anakin AI has Llama 3 models available right now: https://app.anakin.ai/

ks2048 · 2 years ago

Everyone saying it's an EU problem. Same message in Guatemala.

dom96 · 2 years ago

This is so frustrating. Why don't they just make it available everywhere?

Dead Comment

geepytee · 2 years ago

Also added Llama 3 70B to our coding copilot https://www.double.bot if anyone wants to try it for coding within their IDE and not just chat in the console

8n4vidtmkvmk · 2 years ago

Can we stop referring to VS Code as "their IDE"?

Do you support any other editors? If the list is small, just name them. Not everyone uses or likes VS Code.

rdez6173 · 2 years ago

Double seems more like a feature than a product. I feel like Copilot could easily implement those value-adds and obsolete this product.

I also don't understand why I can't bring my own API tokens. I have API keys for OpenAI, Anthropic, and even local LLMs. I guess the "secret" is in the prompting that is being done on the user's behalf.

I appreciate the work that went into this, I just think it's not for me.

doakes · 2 years ago

That was fast! I've really been enjoying Double, thanks for your work.

ionwake · 2 years ago

Cool thanks! Will try

dawnerd · 2 years ago

Tried a few queries and was surprised how fast it responded vs how slow chatgpt can be. Responses seemed just as good too.

gliched_robot · 2 years ago

Inference speed is not a great metric given the horizontal scalability of LLMs.

jaimex2 · 2 years ago

Because no one is using it

schleck8 · 2 years ago

> Neglected to include comparisons against GPT-4-Turbo or Claude Opus, so I guess it's far from being a frontier model

Yeah, almost like comparing a 70b model with a 1.8 trillion parameter model doesn't make any sense when you have a 400b model pending release.

cjbprime · 2 years ago

(You can't compare parameter count with a mixture of experts model, which is what the 1.8T rumor says that GPT-4 is.)

niutech · 2 years ago

Why does Meta embed a 3.5MB animated GIF (https://about.fb.com/wp-content/uploads/2024/04/Meta-AI-Expa...) on their announcement post instead of much smaller animated WebP/APNG/MP4 file? They should care about users with low bandwidth and limited data plan.

dazuaz · 2 years ago

I'm based on LLaMA 2, which is a type of transformer language model developed by Meta AI. LLaMA 2 is a more advanced version of the original LLaMA model, with improved performance and capabilities. I'm a specific instance of LLaMA 2, trained on a massive dataset of text from the internet, books, and other sources, and fine-tuned for conversational AI applications. My knowledge cutoff is December 2022, and I'm constantly learning and improving with new updates and fine-tuning.

salesynerd · 2 years ago

Strange. The Llama 3 model card mentions that the knowledge cutoff dates are March 2023 for the 8B version and December 2023 for the 70B version (https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md)

davidmurdoch · 2 years ago

Are you trying to say you are a bot?

jamesgpearce · 2 years ago

That realtime `/imagine` prompt seems pretty great.

throwup238 · 2 years ago

> And announcing a lot of integration across the Meta product suite, ...

That's ominous...

iosjunkie · 2 years ago

Spending millions/billions to train these models is for a reason and it's not just for funsies.

krackers · 2 years ago

Are there an stats on if llama 3 beats out chatgpt 3.5 (the free one you can use)?

Deleted Comment

Dead Comment

I just want to express how grateful I am that Zuck and Yann and the rest of the Meta team have adopted an open approach and are sharing the model weights, the tokenizer, information about the training data, etc. They, more than anyone else, are responsible for the explosion of open research and improvement that has happened with things like llama.cpp that now allow you to run quite decent models locally on consumer hardware in a way that you can avoid any censorship or controls.

Not that I even want to make inference requests that would run afoul of the controls put in place by OpenAI and Anthropic (I mostly use it for coding stuff), but I hate the idea of this powerful technology being behind walls and having gate-keepers controlling how you can use it.

Obviously, there are plenty of people and companies out there that also believe in the open approach. But they don't have hundreds of billions of dollars of capital and billions in sustainable annual cash flow and literally ten(s) of billions of dollars worth of GPUs! So it's a lot more impactful when they do it. And it basically sets the ground rules for everyone else, so that Mistral now also feels compelled to release model weights for most of their models.

Anyway, Zuck didn't have to go this way. If Facebook were run by "professional" outside managers of the HBS/McKinsey ilk, I think it's quite unlikely that they would be this open with everything, especially after investing so much capital and energy into it. But I am very grateful that they are, and think we all benefit hugely from not only their willingness to be open and share, but also to not use pessimistic AI "doomerism" as an excuse to hide the crown jewels and put it behind a centralized API with a gatekeeper because of "AI safety risks." Thanks Zuck!

paxys · 2 years ago

You can see from Zuck's interviews that he is still an engineer at heart. Every other big tech company has lost that kind of leadership.

eigenvalue · 2 years ago

For sure. I just started watching the new Dwarkesh interview with Zuck that was just released ( https://t.co/f4h7ko0M7q ) and you can just tell from the first few minutes that he simply has a different level of enthusiasm and passion and level of engagement than 99% of big tech CEOs.

a_wild_dandan · 2 years ago

Also, being open source adds phenomenal value for Meta:

1. It attracts the world's best academic talent, who deeply want their work shared. AI experts can join any company, so ones which commit to open AI have a huge advantage.

2. Having armies of SWEs contributing millions of free labor hours to test/fix/improve/expand your stuff is incredible.

3. The industry standardizes around their tech, driving down costs and dramatically improving compatibility/extensibility.

4. It creates immense goodwill with basically everyone.

5. Having open AI doesn't hurt their core business. If you're an AI company, giving away your only product isn't tenable (so far).

If Meta's 405B model surpasses GPT-4 and Claude Opus as they expect, they release it for free, and (predictably) nothing awful happens -- just incredible unlocks for regular people like Llama 2 -- it'll make much of the industry look like complete clowns. Hiding their models with some pretext about safety, the alarmist alignment rhetoric, will crumble. Like...no, you zealously guard your models because you want to make money, and that's fine. But using some holier-than-thou "it's for your own good" public gaslighting is wildly inappropriate, paternalistic, and condescending.

The 405B model will be an enormous middle finger to companies who literally won't even tell you how big their models are (because "safety", I guess). Here's a model better than all of yours, it's open for everyone to benefit from, and it didn't end the world. So go &%$# yourselves.

ramesh31 · 2 years ago

>Every other big tech company has lost that kind of leadership.

He really is the last man standing from the web 2.0 days. I would have never believed I'd say this 10 years ago, but we're really fortunate for it. The launch of Quest 3 last fall was such a breath of fresh air. To see a CEO actually legitimately excited about something, standing on stage and physically showing it off was like something out of a bygone era.

redbell · 2 years ago

Someone, somewhere on YT [1], coined the term Vanilla CEOs to describe non-tech-savvy CEOs, typically MBA graduates, who may struggle to innovate consistently. Unlike their tech-savvy counterparts, these CEOs tend to maintain the status quo rather than pursue bold visions for their companies..

1. https://youtu.be/gD3RV8nMzh8

m12k · 2 years ago

But also: Facebook/Meta got burned when they missed the train on owning a mobile platform, instead having to live in their competitors' houses and being vulnerable to de-platforming on mobile. So they've invested massively in trying to make VR the next big thing to get out from that precarious position, or maybe even to get to own the next big platform after mobile (so far with little to actually show for it at a strategic level).

Anyways, what we're now seeing is this mindset reflected in a new way with LLMs - Meta would rather that the next big thing belongs to everybody, than to a competitor.

I'm really glad they've taken that approach, but I wouldn't delude myself that it's all hacker-mentality altruism, and not a fair bit of strategic cynicism at work here too.

If Zuck thought he could "own" LLMs and make them a walled garden, I'm sure he would, but the ship already sailed on developing a moat like that for anybody that's not OpenAI - now it's in Zuck's interest to get his competitor's moat bridged as fast as possible.

schleck8 · 2 years ago

Depends on your size threshhold. For anything beyond 100 bn in market cap certainly. There is some relatively large companies with a similar flair though, like Cohere and obviously Mistral.

firecall · 2 years ago

Apple being the most egregious example IMHO.

Purely my opinion as a long time Apple fan, but I cant help but think that Tim Cook's polices are harming the Apple brand in ways that we wont see for a few years.

Much like Balmer did at Microsoft.

But who knows - I'm just making conversation :-)

axus · 2 years ago

I'm happy that he's pouring money into the metaverse, and glad that it's not my money.

bionhoward · 2 years ago

Are you joking? “ v. You will not use the Llama Materials or any output or results of the Llama Materials to improve any other large language model (excluding Llama 2 or derivative works thereof). “ is no sign of a strong engineering culture, it’s a sign of greed.

nmklnlknklnlk · 2 years ago

NVidia, AMD, Microsoft?

Deleted Comment

projectileboy · 2 years ago

Anyone who made it through CS 121 is an engineer for life.

cush · 2 years ago

This is both their biggest strength and weakness

stuckkeys · 2 years ago

Yeah. He did good.

Dead Comment

noiseinvacuum · 2 years ago

Good thing that he's only 39 years old and seems more energetic than ever to run his company. Having a passionate founder is, imo, a big advantage for Meta compared to other big tech companies.

tmalsburg2 · 2 years ago

Love how everyone is romanticizing his engineering mindset. But have we already forgotten that he was even more passionate about the metaverse which, as far as I can tell, was a 50B failure?

emrah · 2 years ago

Let's be honest that he's probably not doing it due to goodness of his heart. He's most likely trying to commoditize the models so he can sell their complement. It's a strategy Joel Spolsky had talked about in the past (for those of you who remember who that is). I'm not sure what the complement of AI models is that Meta can sell exactly, so maybe it's not a good strategy but I'm certain it's a strategy of some sort

nh23423fefe · 2 years ago

You lead with a command to be honest and then immediately speculate on private unknowable motivations and then attribute, without evidence, his decision to a strategy you can't describe.

What is this? Someone said something nice, and you need to "restore balance"

Salgat · 2 years ago

Also keep in mind that it's still a proprietary model. Meta gets all the benefits of open source contributions and testing while retaining exclusive business use.

elevatedastalt · 2 years ago

That's coz he is a founder CEO. Those guys are built different. It's rare for the careerist MBA types to match their passion or sincerity.

There are many things I can criticize Zuck for but lack of sincerity for the mission is not one of them.

szundi · 2 years ago

It is just the reverse: he is successful because he is like that and lots of founder ceos are jellies in comparison

deelowe · 2 years ago

Meta also spearheaded the open compute project. I originally joined Google because of their commitment to open source and was extremely disappointed when I didn't see that culture continue as we worked on exascale solutions. Glad to see Meta carrying the torch here. Hope it continues.

eru · 2 years ago

When did you join Google?

gliched_robot · 2 years ago

I see what you did here <q> carrying the "torch" <q>. LOL

mandeepj · 2 years ago

> I just want to express how grateful I am that Zuck

Praise for him at HN? It should be enough of a reason for him to pop a champagne today

shepherdjerred · 2 years ago

Yeah, I'm also surprised at how many positive comments are in this thread.

I do hate Facebook, but I also love engineers, so I'm not sure how to feel about this one.

jwoq9118 · 2 years ago

The world at large seems to hate Zuck but it’s good to hear from people familiar with software engineering and who understand just how significant his contributions to open source and raising salaries have been through Facebook and now Meta.

swatcoder · 2 years ago

> his contributions to ... raising salaries

It's fun to be able to retire early or whatever, but driving software engineer salaries out of reach of otherwise profitable, sustainable businesses is not a good thing. That just concentrates the industry in fewer hands and makes it more dependent on fickle cash sources (investors, market expansion) often disconnected from the actual software being produced by their teams.

Nor is it great for the yet-to-mature craft that high salaries invited a very large pool of primarly-compensation-motivated people who end up diluting the ability for primarily-craft-motivated people to find and coordinate with each other in pursuit of higher quality work and more robust practices.

mschuster91 · 2 years ago

A person (or a company) can be two very different things at the same time. It's undeniable as you say that there have been a lot of high-profile open source innovations coming from Facebook (ReactJS, LLaMA, HHVM, ...), but the price that society at large paid for all of this is not insignificant either, and Meta hasn't meaningfully apologized for the worst of it.

insanebrain · 2 years ago

They're sharing it for a reason. That reason is to disarm their opponents.

insane_dreamer · 2 years ago

Call me cynical, but it was the only way not to be outplayed by OpenAI and to compete with Google, etc.

danielmarkbruce · 2 years ago

100%. It was the only real play they had.

swalsh · 2 years ago

Why is Meta doing it though? This is an astronomical investment. What do they gain from it?

evnc · 2 years ago

They're commoditizing their complement [0][1], inasmuch as LLMs are a complement of social media and advertising (which I think they are).

They've made it harder for competitors like Google or TikTok to compete with Meta on the basis of "we have a super secret proprietary AI that no one else has that's leagues better than anything else". If everyone has access to a high quality AI (perhaps not the world's best, but competitive), then no one -- including their competitors -- has a competitive advantage from having exclusive access to high quality AI.

[0]: https://www.joelonsoftware.com/2002/06/12/strategy-letter-v/

[1]: https://gwern.net/complement

noiseinvacuum · 2 years ago

He went into the details of how he thinks about open sourcing weights for Llama responding to a question from an analyst in one of the earnings call last year after Llama release. I had made a post on Reddit with some details.

https://www.reddit.com/r/MachineLearning/s/GK57eB2qiz

Some noteworthy quotes that signal the thought process at Meta FAIR and more broadly

* We’re just playing a different game on the infrastructure than companies like Google or Microsoft or Amazon

* We would aspire to and hope to make even more open than that. So, we’ll need to figure out a way to do that.

* ...lead us to do more work in terms of open sourcing, some of the lower level models and tools

* Open sourcing low level tools make the way we run all this infrastructure more efficient over time.

* On PyTorch: It’s generally been very valuable for us to provide that because now all of the best developers across the industry are using tools that we’re also using internally.

* I would expect us to be pushing and helping to build out an open ecosystem.

zmmmmm · 2 years ago

I think you really have to understand Zuckerberg's "origin story" to understand why he is doing this. He created a thing called Facebook that was wildly successful. Built it with his own two hands. We all know this.

But what is less understood is that from his point of view, Facebook went through a near death experience when mobile happened. Apple and Google nearly "stole" it from him by putting strict controls around the next platform that happened, mobile. He lives every day even still knowing Apple or Google could simply turn off his apps and the whole dream would come to an end.

So what do you do in that situation? You swear - never again. When the next revolution happens, I'm going to be there, owning it from the ground up myself. But more than that, he wants to fundamentally shift the world back to the premise that made him successful in the first place - open platforms. He thinks that when everyone is competing on a level playing field he'll win. He thinks he is at least as smart and as good as everyone else. The biggest threat to him is not that someone else is better, it's that the playing field is made arbitrarily uneven.

Of course, this is all either conjecture or pieced together from scraps of observations over time. But it is very consistent over many decisions and interactions he has made over many years and many different domains.

tinyspacewizard · 2 years ago

I think what Meta is doing is really smart.

We don't really know where AI will be useful in a business sense yet (the apps with users are losing money) but a good bet is that incumbent platforms stand to benefit the most once these uses are discovered. What Meta is doing is making it easier for other orgs to find those use-cases (and take on the risk) whilst keeping the ability to jump in and capitalize on it when it materializes.

As for X-Risk? I don't think any of the big tech leadsership actually beleive in that. I also think that deep down a lot of the AI safety crowd love solving hard problems and collecting stock options.

On cost, the AI hype raises Met's valuation by more than the cost of engineers and server farms.

schleck8 · 2 years ago

Zuck equated the current point in AI to iOS vs Android and MacOS vs Windows. He thinks there will be an open ecosystem and a closed one coexisting if I got that correctly, and thinks he can make the former.

ativzzz · 2 years ago

Meta is an advertising company that is primarily driven by user generated content. If they can empower more people to create more content more quickly, they make more money. Particularly the metaverse, if they ever get there, because making content for 3d VR is very resource intensive.

Making AI as open as possible so more people can use it accelerates the rate of content creation

farco12 · 2 years ago

Mark probably figured Meta would gain knowledge and experience more rapidly if they threw Llama out in the wild while they caught up to the performance of the bigger & better closed source models. It helps that unlike their competition, these models aren't a threat to Meta's revenue streams and they don't have an existing enterprise software business that would seek to immediately monetize this work.

woile · 2 years ago

If they start selling ai in their platform, it's a really good option, as people know they can run it somewhere else if they had to (for any reason, e.g: you could make a poc with their platform but then because of regulations you need to self host, can you do that with other offers?)

jug · 2 years ago

Zuck is pretty open about this in a recent earnings call:

https://twitter.com/soumithchintala/status/17531811200683049...

bg24 · 2 years ago

Besides everything said here in comments, Zuck would be actively looking to own the next platform (after desktop/laptop and mobile), and everyone's trying to figure what that would be.

He knows well that if competitors have a cash cow, they have $$ to throw at hundreds of things. By releasing open-source, he is winning credibility, establishing Meta as the most used LLM, and finally weakening the competition from throwing money on the future initiatives.

HarHarVeryFunny · 2 years ago

They heavily use AI internally for their core FaceBook business - analyzing and policing user content, and this is also great PR to rehabilitate their damaged image.

There is also an arms race now of AI vs AI in terms of generating and detecting AI content (incl deepfakes, election interference, etc, etc). In order not to deter advertizers and users, FaceBook need to keep up.

eigenvalue · 2 years ago

They will be able to integrate intelligence into all their product offerings without having to share the data with any outside organization. Tools that can help you create posts for social media (like an AI social media manager), or something that can help you create your listing to sell an item on Facebook Marketplace, tools that can help edit or translate your messages on Messenger/Whatsapp, etc. Also, it can allow them to create whole new product categories. There's a lot you can do with multimodal intelligent agents! Even if they share the models themselves, they will have insights into how to best use and serve those models efficiently and at scale. And it makes AI researchers more excited to work at Meta because then they can get credit for their discoveries instead of hoarding them in secret for the company.

neverokay · 2 years ago

The same thing he did with VR. Probably got tipped off Apple is on top of Vision Pro, and so just ruthlessly started competing in that market ahead of time

/tinfoil

Releasing Llama puts a bottleneck on developers becoming reliant on OpenAI/google/microsoft.

Strategically, it’s … meta.

HDThoreaun · 2 years ago

Generative AI is a necessity for the metaverse to take off. Creating metaverse content is too time consuming otherwise. Mark really wants to control a platform so the companies whole strategy seems to be around getting the quest to take off.

Zizizizz · 2 years ago

I would assume it's related to fair use and how OpenAI and Google have closed models that are built on copyrighted material. Easier to make the case that it's for the public good if it's open and free than not...

_factor · 2 years ago

It’s a shame it can’t just be giving back to the community and not questioned.

Why is selfishness from companies who’ve benefited from social resources not a surprising event vs the norm.

mywacaday · 2 years ago

Looks like it can't be accessed outside the states? I get a "Meta AI isn't available yet in your country"

altilunium · 2 years ago

Llama3 is available on Poe.

FrustratedMonky · 2 years ago

It does seem uncharacteristic. Wonder how much of the hate Zuck gets is people that just don't like Facebook, but as person/engineer, his heart is in the right place? It is hard to accept this at face value and not think there is some giant corporate hidden agenda.

xpe · 2 years ago

> but also to not use pessimistic AI "doomerism" as an excuse to hide the crown jewels and put it behind a centralized API with a gatekeeper because of "AI safety risks."

AI safety risk is substantial. It is also testable. (There are prediction markets on it, for example.) Of course, some companies may latch onto various valid arguments for insincere reasons.

I'd challenge everyone to closely compare ideas such as "open source software is better" versus "state of the art trained AI models are better developed in the open". The exact same arguments do NOT work for both.

It is one thing to publish papers about e.g. transformers. It is another thing to publish the weights of something like GPT 3.5+; it might theoretically be a matter of degree, but that matter of degree makes a real difference, if only in terms of time. Time matters because it gives people and society some time to respond.

Software security reports are often made privately or embargoed. Why? We want to give people and companies time to defend their systems.

Now consider this thought-experiment: assume LLMs (and their hybrid derivatives) enable perhaps 1,000,000 new kinds of cyberattacks, 1,000 new bioweapon attacks, and so on. Are there are a correspondingly large number of defensive benefits? This is the crux of the question I think. First, I don't expect we're going to get a good assessment of the overall "balance". Second, any claims of "balance" are beside the point, because these attacks and defenses don't simply cancel each other out. The distribution of the AI-fueled capability advance will probably ratchet up risk and instability.

Open source software's benefits stem from the assumption that bugs get shallower with more eyes. More eyes means that the open source product gets stronger defensively.

With LLMs that publish their weights, both the research and the implementations is out; you can't get guardrails. The closest analogue to an "OSS security report" would take the form of "I just got your LLM to design a novel biological weapon. Do you think you can use it to design an antidote?"

A systematic risk-averse person might want to ask: what happens if we enumerate all offensive vs defensive technological shifts? Should we reasonably believe that the benefits outweigh the risks?

Unfortunately, the companies making these decisions aren't bearing the risks. This huge externality both pisses me off and scares the shit out of me.

lannisterstark · 2 years ago

I too like making up hypothetical insane scenarios in my head. The difference is that they stay with me in the shower.

courseofaction · 2 years ago

This is the organization that wouldn't moderate facebook during Myanmarr yeah? The one with all the mental health research they ignore?

Zuckerberg states during the interview that once the ai reaches a certain level of capability they will stop releasing weights - i.e. they are going the "OpenAI" route: this is just trying to get ahead of the competition, it's a sound strategy when you're behind to leverage open source.

I see no reason to be optimistic about this organization, the open source community should use this an abandon them ASAP.

firecall · 2 years ago

I actually think Mr Zuckerburg is maturing and has a chance of developing a public persona of being decent person!

I say public persona, as I've never met him, and have no idea what he is like as a person on an individual level.

Maturing in general and studying martial arts is likely to be a contributing factor.

atleastoptimal · 2 years ago

It's crazy how the managerial executive class seems to resent the vital essence of their own companies. Based on the behavior, nature, stated beliefs and interviews I've seen of most tech CEOs and CEOs in general, there seems to be almost a natural aversion to talking about things in non hyper-abstracted terms.

I get the feeling that the nature of the corporate world is often better understood as a series of rituals to create the illusion of the necessity of the capitalist hierarchy itself. (not that this is exclusive to capitalism, this exists in politics and any system that becomes somewhat self-sustaining) More important than a company doing well is the capacity to use the company as an image/lifestyle enhancement tool for those at the top. So many companies run almost mindlessly as somewhat autonomous machines, allowing pretense and personal egoic myth-making to win over the purpose of the company in the first place.

I think this is why Elon, Mark, Jensen, etc. have done so well. They don't perceive their position as founder/CEOs as a class position: a level above the normal lot that requires a lack of caring for tangible matters. They see their companies as ways of making things happen, for better or for worse.

charlie0 · 2 years ago

It's because Elon, Mark, and Jensen are true founders. They aren't MBAs who got voted in because shareholders thought they would make them the most money in the shortest amount of time.

m463 · 2 years ago

I kind of wonder. Does what they do counter the growth of Google?

I remember reading years ago that page/brin wanted to build an AI.

This was long before the AI boom, when saying something like that was just weird (like musk saying he wanted to die on mars weird)

Havoc · 2 years ago

The more likely version is that this course of action is in line with strategy recommended by consultants. Takes the wind out of their competitors sail

pankajdoharey · 2 years ago

Always bet on Zuck!

hwbunny · 2 years ago

It's like Elon saying: we have open sourced our patents, use them. Well, use the old patents and stay behind forever....

TechDebtDevin · 2 years ago

Exactly.

jug · 2 years ago

Yes - for sure this AI is trained on their vast information base from their social networks and beyond but at least it feels like they're giving back something. I know it's not pure altruism and Zuck has been open about exactly why they do it (tldr - more advantages in advancing AI through the community that ultimately benefits Meta), but they could have opted for completely different paths here.

nmfisher · 2 years ago

The quickest way to disabuse yourself of this notion is to login to Facebook. You’ll remember that Zuck makes money from the scummiest pool of trash and misinformation the world has ever seen. He’s basically the Web 2.0 tabloid newspaper king.

I don’t really care how much the AI team open sources, the world would be a better place if the entire company ceased to exist.

TechDebtDevin · 2 years ago

Yeah lmao, people are giving meta way too much credit here tbh.

Dead Comment