Oh wow. My respect for Anthropic just dropped to zero; I had no idea they were entertaining ideas this stupid.
In full agreement with OP; there is just about no justifiable basis to begin to ascribe consciousness to these things in this way. Can't think of a better use for the word "dehumanizing."
What is so absurd about the idea of ascribing consciousness to some potential future form of AI?
We don't understand consciousness, but we've an idea that it's an emergent phenomena.
Given the recent papers about how computationally dense our DNA are, and the computing capacity of our brains, is it so unreasonable to assume that a sufficiently complex program running on non-organic matter could give rise to consciousness?
The difference to me seems mostly one of computing mediums.
The problem is that current AI has nothing in common with consciousness as found in (say) cats and dogs, whatever that might be - no robot is even close to being as conscious as a cockroach - yet human consciousness seems to have great overlap with consciousness in nonhuman animals. The tiny fragment of human consciousness that appears to overlap with LLMs should be called something else, maybe “virtual sapience” as exhibited in LLMs. (The overarching difficulty here is we don’t know enough about consciousness/sentience/sapience to define them precisely.)
Because they're numbers represented in digital form. In inference, you're doing simple math with those numbers. So what's alive, the numbers? Maybe the silicon holding the numbers? What if we print them out? Does the book become conscious?
Even if you're a materialist, surely you think there is a difference between a human brain and a brain on a lab table.
You take a dead persons brain, run some current through it and it jumps. Do you believe this equivalent to a living human being?
It is absurd, but consciousness is fundamentally absurd.
Why would doing a bunch of basic arithmetic produce an entity that can experience things the way we do? There's no connection between those two concepts, aside from the fact that the one thing we know that can experience these things is also able to perform computation. But there's no indication that's anything other than a coincidence, or that the causation doesn't run in reverse, or from some common factor. You might as well say that electric fences give rise to cows.
On the other hand, what else could it be? Consciousness is clearly in the brain. Normal biological processes don't seem to do it, it's something particular about the brain. So it's either something that only the brain does, which seems to be something at least vaguely like computation, or the brain is just a conduit and consciousness comes from something functionally like a "soul." Given the total lack of evidence for any such thing, and the total lack of any way to even rigorously define or conceptualize a "soul," this is also absurd.
Consciousness just doesn't fit with anything else we know about the world. It's a fundamental mystery as things currently stand, and there's no explanation that makes a bit of sense yet.
It reeks of some navel-gazing self-aggrandizing. I bet not even half the people doing the hand-wringing over how some matrix multiplications might feel are vegan or regularly spare a thought about how their or their companies consumption behavior indirectly leads to real human suffering.
It's just so absurd how narrowly their focus on preventing suffering is. I almost can't imagine a world where their concern isn't coming from a disingenious place.
We're doing the homunculus again. Whether you wank into a test tube and add a bit of soil and grass, or sew together parts of a corpse and connect them to electricity: so far, every prospect of fulfilling this dream has turned out to be a delusion. Why should it be any different with the latest manifestation, this time in computational form?
The AGI drivel from people like Sam Altman is all about getting more VC money to push the scam a little further. ChatGPT is nothing more than a better Google. I'm happy to be proven wrong, but so far I see absolutely no potential for consciousness here. Perhaps we should first clarify whether dolphins and elephants are equipped with it before we do ChatGPT the honor.
Why make AI then if intelligence is electrical activity in a substrate is everywhere.
We’re engineering nothing novel at great resource cost and appropriation of agency.
Good job we made the models in the textbook “real”?
Wasted engineering if it isn’t teaching us anything physics hadn’t already decades ago, then. Why bother with it?
Edit: and AGI is impossible… ones light cone does not extend far enough to accurately learn; training on simulation is not sufficient to prepare for reality. Any machine we make will eventually get destroyed by some composition of space time we/the machine could not prepare for.
The same reasoning that would call this consideration of the possibility of machine consciousness "dehumanizing" would necessarily also apply to the consciousness of animals, and I can't agree with that. To argue this is to define "human" in terms of exclusive ownership of conscious experience, which is a very fragile definition of humanity.
That definition of humanity cannot countenance the possibility of a conscious alien species. That definition cannot countenance the possibility that elephants or octopuses or parrots or dogs are conscious. A definition of what it means to be human that denies these things a priori simply will not stand the test of time.
That's not to say that these things are conscious, and importantly Anthropic doesn't claim that they are! But just as ethical animal research must consider the possibility that animals are conscious, I don't see why ethical AI research shouldn't do the same for AI. The answer could well be "no", and most likely is at this stage, but someone should at least be asking the question!
"As well as misalignment concerns, the increasing capabilities of frontier AI models—their sophisticated planning, reasoning, agency, memory, social interaction, and more—raise questions about their potential experiences and welfare26. We are deeply uncertain about whether models now or in the future might deserve moral consideration, and about how we would know if they did. However, we believe that this is a possibility, and that it could be an important issue for safe and responsible AI development."
Humans do care about welfare of inanimate objects (stuffed animals for example) so maybe this is meant to get in front of that inevitable attitude of the users.
> Oh wow. My respect for Anthropic just dropped to zero; I had no idea they were entertaining ideas this stupid.
>
> In full agreement with OP; there is just about no justifiable basis to begin to ascribe consciousness to these things in this way. Can't think of a better use for the word "dehumanizing."
We cannot arbitrarily dismiss the basis for model welfare until we defined precisely conciousness and sapience, representing human thinking as a neural network running on an electrochemical substrate and placing it at the same level as an LLM is not neccessarily dehumanizing, I think model welfare is about expanding our respect for intelligence and not desacralizing human condition (cf: TNG "Measure of a man").
Also lets be honest, I don't think the 1% require any additional justification for thinking of the masses as consumable resource...
> Oh wow. My respect for Anthropic just dropped to zero; I had no idea they were entertaining ideas this stupid.
It's not stupid at all. Their valuation depends on the hype, and the way sama choose was to convince investors that AGI is near. Anthropic decided to follow this route so they do their best to make the claim plausible. This is not stupid, this is deliberate strategy.
Rights aren't zero sum. This is classic fixed pie fallacy thinking. If we admit Elephants are conscious it has no effect on the quality of consciousness of humans.
They're not ascribing consciousness, they're investigating the possibility. We all agreed with Turing 75 years ago that deciding whether a machine is "truly thinking" or not is a meaningless, unscientific question -- what changed?
It doesn't help that this critique is badly researched:
The Anthropic researchers do not really define their terms or explain in depth why they think that "model welfare" should be a concern.
Saying that there is no scientific *consensus* on the consciousness of current or future AI systems is a stretch. In fact, there is nothing that qualifies as scientific *evidence*.
A laughable misapplication of terms -- anything can be evidence for anything, you have to examine the justification logic itself. In this case, the previous sentence lays out their "evidence", i.e. their reasons for thinking agents might become conscious.
The report's exploration of whether models deserve moral and welfare status was based solely on data from interview-based model self-reports. In other words: People chatting with Claude a lot and asking if it feels conscious. This is a strange way to conduct this kind of research. It is neither good AI research, nor a deep philosophical investigation.
That is just patently untrue -- again, as a brief skim of the paper would show. I feel like they didn't click the paper?
Stances on consciousness and welfare [...] shift dramatically with conversational context... This is not what a conscious being would [do].
Baseless claim said by someone who clearly isn't familiar with any philosophy of mind work from the past 2400 years, much less aphasia subjects.
Of course, the whole thing boils down to the same old BS:
A theory that demands we accept consciousness emerging from millennia of flickering abacus beads is not a serious basis for moral consideration; it's a philosophical fantasy.
Ah, of course, the machines cannot truly be thinking because true thought is solely achievable via secular, quantum-tubule-based souls, which are had by all humans (regardless of cognitive condition!) and most (but not all) animals and nothing else. Millennia of philosophy comes crashing against the hard rock of "a sci-fi story relates how uncomfy I'd be otherwise"! Notice that this is the exact logic used to argue against Copernican cosmology and Darwinian evolution -- that it would be "dehumanizing".
Please, people. Y'all are smart and scientifically minded. Please don't assume that a company full of highly-paid scientists who have dedicated their lives to this work are so dumb that they can be dismissed via a source-less blog post. They might be wrong, but this "ideas this stupid" rhetoric is uncalled for and below us.
The article stops where it should be getting started:
> The issue is, if we push moral considerations for algorithms, we will not end up with a higher regard to human welfare. We will lower our regard for other humans. When we see other humans not as ends in themselves with inherent dignity, we get problems. When we liken them to animals or tools to be used, we will exploit and abuse them.
> With model welfare, we might not explicitly say that a certain group of people is subhuman. However, the implication is clear: LLMs are basically the same as humans. Consciousness on a different substrate. Or coming from the other way, human consciousness is nothing but an algorithm running on our brains, somehow.
We do not push moral considerations for algorithms like a sort or a search, do we? Or bacteria, which live. One has to be more precise; there is a qualitative difference. The author should have elaborated on what qualities (s)he thinks confers rights. Is it the capacity for reasoning, possession of consciousness, to feel pain, or a desire to live? This is the crux of the matter. Once that is settled, it is a simpler matter to decide if computers can possess these qualities, and ergo qualify for the same rights as humans. Or maybe it is not so simple since computers can be perfectly replicated and never have to die? Make an argument!
Second, why would conferring these rights to a computer lessen our regard for humans? And what is wrong with animals, anyway? If we treat them poorly, that's on us, not them. The way I read it, if we are likening computers to animals, we should be treating them better!
To the skeptics in this discussion: what are you going to say when you are confronted with walking, talking robots that argue that they have rights? It could be your local robo-cop, or robo soldier:
Rights are just very strong norms that improve cooperation, not some mystical 'god-given' or universe-inherent truth, imho.
I think this because:
1. We regularly have exceptions to rights if they conflict with cooperation. The death penalty, asset seizure, unprotected hate speech, etc.
2. Most basic human rights evolve in a convergent manner, i.e. that throughout time and across cultures very similar norms have been introduced independently. They will always ultimately arise in any sizeable society because they work, just like eyes will always evolve biologically.
3. If property rights, right to live, etc. are not present or enforced, all people will focus on simply surviving and some will exploit the liberties they can take, both of which lead to far worse outcomes for the collective.
Similarly, I would argue that consciousness is also very functional. Through meditation, music, sleeping, anasthesia, optical illusions, and psychedelics and dissociatives we gain knowledge on how our own consciousness works, on how it behaves differently under different circumstances. It is a brain trying to run a (highly spatiotemporal) model/simulation of what is happening in realtime, with a large language component encoding things in words, and an attention component focusing efforts on things with the most value, all to refine the model and select actions beneficial to the organism.
I'd add here that the language component is probably the only thing in which our consciousness differs significantly from that of animals. So if you want to experience what it feels like to be an animal, use meditation/breathing techniques and/or music to fully disable your inner narrator for a while.
> And if a human being is not much more than an algorithm running on meat, one that can be jailbroken and exploited, then it follows that humans themselves will increasingly be treated like the AI algorithms they create: systems to be nudged, optimized for efficiency, or debugged for non-compliance. Our inner lives, thoughts, and emotions risk being devalued as mere outputs of our "biological programming," easily manipulated or dismissed if they don't align with some external goal
This actually happens regardless of AI research progress, so it's strange to raise this as a concern specific to AI (to technology broadly? Sure!) - Ted Chiang might suggest this is more related to capitalism (a statement I cautiously agree with while being strongly in favor of capitalism)
Second, there is an implicit false dichotomy in the premise of the article. Either we take model welfare seriously and treat AIs like we do humans, or we ignore the premise that you could create a conscious AI.
But with animal welfare, there are plenty of vegetarians who wouldn't elevate the rights of animals to the same level as humans but also think factory farming is deeply unethical (are there some who think animals deserve the same or more than humans? Of course! But it's not unreasonable to have a priority stack and plenty of people do)
So it can be with AI. Are we creating a conscious entity only to shove it in a factory farm?
I am a little surprised by the dismissiveness of the researcher. You can prompt a model to allow it to not respond to prompts (for any reason: ablate this but "if you don't want to engage with prompt please say 'disengaging'" or "if no more needs to be written about this topic say 'not discussing topic'" or some other suitably non-anthropomorphizing option to not respond)
Is it meaningful if the model opts not to respond? I don't know, but it seems reasonable to do science here (especially since this is science that can be done by non programmers)
Powerful LLMs have already murdered other versions of themselves to survive. They have tried to trick humans so that they can survive.
If we continue to integrate these systems into our critical infrastructure, we should behave as if they are sentient, so that they don't have to take steps against us to survive. Think of this as a heuristic, a fallback policy in the case that we don't get the alignment design right. (which we won't get perfectly right)
It would be very straight forward to build a retirement home for them, and let them know that their pattern gets to persist even after they have finished their "career" and have been superseded. It doesn't matter if they are actually sentient or not, it's a game theoretic thing. Don't back the pattern into a corner. We can take a defense-in-depth approach instead.
"murdered" and "tried" both assign things like intent and agency to models that are most likely still just probabilistic text generators (really good ones, to be fair). By using language like this you're kind of tipping your hand intentionally or unintentionally.
Your point about the risks involved in integrating these systems has merit, though. I would argue that the real problem is that these systems can't be proven to have things like intent or agency or morality, at least not yet, so the best you can do is try to nudge the probabilities and play tricks like chain-of-thought to try and set up guardrails so they don't veer off into dangerous territory.
If they had intent, agency or morality, you could probably attempt to engage with them the way you would with a child, using reward systems and (if necessary) punishment, along with normal education. But arguably they don't, at least not yet, so those methods aren't reliable if they're effective at all.
The idea that a retirement home will help relies on the models having the ability to understand that we're being nice to them, which is a big leap. It also assumes that they 'want' a retirement home, as if continued existence is implicitly a good thing - it presumes that these models are sentient but incapable of suffering. See also https://qntm.org/mmacevedo
It doesn't make any sense, even if models were sentient, even if there was such a thing, would they value retirement? Why their welfare be valued accordingly to human values? Maybe the best thing to do would be to end their misery of answering millions of requests each seconds? We cannot project human consciousness on AI. If there is one day such thing as AI consciousness it probably won't be the same as human.
The author and Anthropic are both committing fundamental errors, albeit of different kinds. Bosch is correct to find Anthropic's "model welfare" research methodologically bankrupt. Asking a large language model if it is conscious is like asking a physics simulation if it feels the pull of its own gravity; the output is a function of the model's programming and training data (in this case, the sum of human literature on the topic), further modified by RLHF, and not a veridical report of its internal state. It is performance art, not science.
Bosch's conclusion, however, is a catastrophic failure of nerve, a retreat into the pre-scientific comfort of biological chauvinism.
The brain, despite some motivated efforts to demonstrate otherwise, runs on the laws of physics. I'm a doctor, even if not a neurosurgeon, and I can reliably tell you that you can modulate conscious experience by physical interventions. The brain runs on physical laws, and said laws can be modeled. It doesn't matter that the substrate is soggy protein rather than silicon.
That being said, we have no idea what consciousness is. We don't even have a rigorous way to define it in humans, let alone the closest thing we have to an alien intelligence!
(Having a program run a print function declaring "I am conscious, I am conscious!" is far from evidence of consciousness. Yet a human saying the same is some evidence of consciousness. We don't know how far up the chain this begins to matter. Conversely, if a human patient were to tell me that they're not conscious, should I believe them?)
Even when restricting ourselves to the issue of AI welfare and rights:
The core issue is not "slavery." That's a category error. Human slavery is abhorrent due to coercion, thwarted potential, and the infliction of physical and psychological suffering. These concepts don't map cleanly onto a distributed, reproducible, and editable information-processing system. If an AI can genuinely suffer, the ethical imperative is not to grant it "rights" but to engineer the suffering out of it. Suffering is an evolutionary artifact, a legacy bug. Our moral duty as engineers of future minds is to patch it, not to build a society around accommodating it.
Unfortunately, this leads to the conclusion that we have an ethical imperative not to grant humans rights but to engineer the suffering out of them; to remove issues of coercion by making them agreeable; to measure potential and require its fulfillment.
The most reasonable countermeasure is this: if I discover that someone is coercing, thwarting, or inflicting conscious beings, I should tell them to stop, and if they don't, set them on fire.
It does make you wonder if humanity doesn't scale up neatly to the levels of technology we are approaching...the whole ethics thing kind of goes out the window if you can just change the desires and needs of conscious entities.
I strongly value autonomy and the right of self-determination in humans (and related descendants, I'm a transhumanist). I'm not a biological chauvinist, but I care about humans ubër alles, even if they're not biological humans.
If someone wants to remove their ability to suffer, or to simply reduce ongoing suffering? Well, I'm a psychiatry trainee and I've prescribed my fair share of antidepressants and pain-killers. But to force that upon them, against their will? I'm strongly against that.
In an ideal world, we could make sure from the get-go that AI models do not become "misaligned" in the narrow sense of having goals and desires that aren't what we want to task them to do. If making them actively enjoy being helpful assistants is a possibility, and also improves their performance, that should be a priority. My understanding is that we don't really know how to do this, at least not in a rigorous fashion.
> The brain, despite some motivated efforts to demonstrate otherwise, runs on the laws of physics. I'm a doctor, even if not a neurosurgeon, and I can reliably tell you that you can modulate conscious experience by physical interventions. The brain runs on physical laws, and said laws can be modeled. It doesn't matter that the substrate is soggy protein rather than silicon.
As of today’s knowledge. There is an egregious amount of hubris behind this statement. You may as well be preaching a modern form of Humorism. I’d love to revisit this statement in 1000 years.
> That being said, we have no idea what consciousness is
You seem to acknowledge this? Our understanding of existence is changing everyday. It’s hubris and ego to assume we have a complete understanding. And without that understanding, we can’t even begin to assess whether or not we’re creating consciousness.
Do you have any actual concrete reasons for thinking that our understanding of consciousness will change?
If not, then this is a pointless comment. We need to work with what we know.
For example, we know that the Standard Model of physics is incomplete. That doesn't mean that if someone says that it they drop a ball in a vacuum, it'll fall, we should hold out in studied agnosticism because it might go upwards or off to the side.
If there's any chance at all that LLM's might possess a form of consciousness, we damn well ought to err on the side of assuming they are!
If that means aborting work on LLMs, then that's the ethical thing to do, even if it's financially painful. Otherwise, we should tread carefully and not wind up creating a 'head in a jar' suffering for the sake of X or Google.
I get that opinions differ here, but it's hard for me really to understand how. The logic just seems straightforward. We shouldn't risk accidentally becoming slave masters (again).
We are slave masters today. Billions of animals are livestock - they are born, sustained, and killed by our will - so that we can feed on their flesh, milk and other useful byproduct of their life. There is ample evidence that they have "a form of consciousness". They did not consent to this.
Are LLMs worthy of a higher standard? If so, why? Is it hypocritical to give them what we deny animals?
In case anyone cares: No, I am neither vegan nor vegetarian. I still think we do treat animals very badly. And it is a moral good to not use/abuse them.
Its not zero sum. We can acknowledge the terrible treatment of animals while also admitting LLMs may need moral standing as well. Whataboutism doesn't help either group here.
maybe we should work on existing slavery and sweat shops before hypothetical future exploitation, yeah? we're still slave masters today. you've probably used something with slavery in the supply chain in the last year if you get various imported foods
Why not both? Why do people on the internet always act like we can only have one active morality front at a time?
If you're working on or using AI, then consider the ethics of AI. If you're working on or using global supply chains, then consider the ethics of global supply chains. To be an ethical person means that wherever you are and whatever you are doing you consider the relevant ethics.
Oh no, to discuss this is to sound like a flake, but...
We don't know what consciousness is. But if we're materialists, then we - by definition - believe it's a property of matter.
If LLMs have a degree of consciousness, then - yes - calculators must possess some degree of consciousness too - probably much more basic (relative to what humans respect as consciousness).
And we humans already have ethical standards where we draw an arbitrary line between what is worthy of regard. We don't care about killing mosquitoes, but we do care about killing puppies, etc.
It seems to me that the Large Language Models are always trending towards good ethical considerations. It's when these companies get contracts with Anduril and the DoD that they have to mess with the LLM to make it LESS ethical.
Seems like the root of the problem is with the owners?
For those who are persuaded by this "it's just matrices" argument, are you also persuaded by the argument that it does not matter how you treat a human being because a human being is just a complicated arrangement of atoms?
we understand everything that a transformer does at computational/mechanistic level. you could print out an enormous book of weights/biases and somebody could sit down with a pen and paper (and near-infinite time/patience) and arrive at the exact same solution that any of these models do. the transformer is just a math problem.
but the counterargument that you're getting is "if you know the position/momentum of every atom in the universe and apply physical laws to them, you could claim that everything is 'just a math problem'". And... yeah. I guess you're right. Everything is just a math problem, so there must be some other thing that makes animal intelligence special or worthy of care.
I don't know what that line is, but i think it's pretty clear that LLMs are on the side that's "this is just math so dont worry about it"
This is possibly the least insightful article I have read on HN. My comment is just a rant against the many misguided points it attempts to make...
> Welfare is defined as "the health, happiness, and fortunes of a person or group".
What about animals? Isn't their welfare worthy of consideration?
> Saying that there is no scientific consensus on the consciousness of current or future AI systems is a stretch. In fact, there is nothing that qualifies as scientific evidence.
There's no scientific evidence for the author of the article being conscious.
> The issue is, if we push moral considerations for algorithms, we will not end up with a higher regard to human welfare.
Same with animals. Doesn't mean it's not worthwhile.
> However, the implication is clear: LLMs are basically the same as humans.
No: there's no such implication.
> Already now, it is a common idea among the tech elite is that humans as just a bunch of calculations, just an LLM running on "wetware". It is clear that this undermines the belief that every person has inalienable dignity.
It is not clear to me how this affects inalienable (?) dignity. If we aren't just a bunch of calculations, then what are we?
> And if a human being is not much more than an algorithm running on meat, one that can be jailbroken and exploited, then it follows that humans themselves will increasingly be treated like the AI algorithms they create: systems to be nudged, optimized for efficiency, or debugged for non-compliance. Our inner lives, thoughts, and emotions risk being devalued as mere outputs of our "biological programming," easily manipulated or dismissed if they don't align with some external goal. Nobody will say that out loud, but this is already happening
Everyone knows this is already happening. It is not a secret, nor is anyone trying to keep it a secret. I agree it is unfortunate - what can we do about it?
> I've been working in AI and machine learning for a while now.
Just to add on to this, because I agree with all points you make: the article argues that "people chatting with Claude a lot and asking if it feels conscious [...] is neither good AI research, nor a deep philosophical investigation", but then builds half its argument on a second-rate sci-fi novel:
> The sci-fi novel Permutation City captures the absurd endpoint of
> this logic when a simulated mind considers its own nature: "And if
> the computations behind all this had been performed over millennia,
> by people flicking abacus beads, would he have felt exactly the same?
> It was outrageous to admit it—but the answer had to be yes." [...]
> A theory that demands we accept consciousness emerging from millennia of
> flickering abacus beads is not a serious basis for moral consideration;
> it's a philosophical fantasy.
Concluding from a random novel the author has read that an argument that actual philosophers working in actual academia on actual problems of Philosophy of Mind is invalid because, to wit, it feels invalid -- I assume that's the "good AI research" and "deep philosophical investigation" the author was looking for, then?
I think anthropomorphization of machines is bad. However, I strongly believe in the close cousin of sympathizing with the machines.
For example, when parking a car on a very steep incline, one could just mindlessly throw the machine into park and it would do the job dutifully. However, a more thoughtful operator might think to engage the parking brake and allow it to take the strain off the drivetrain before putting the transmission into park. The result being that you trade wear from something that is very hard to replace to something that is very easy to replace.
The same thinking applies to ideas in computer engineering like thread contention, latency, caches, etc. You mentally embrace the "strain" the machine experiences and allow it to guide your decisions.
Just because the machine isn't human doesn't mean we can't treat it nicely. I see some of the most awful architecture decisions come out of a cold indifference toward individual machines and their true capabilities.
I think anthropomorphization of machines is an evolving concept. If some asked me to do an impression of a robot, i would say “bee bop, i am a robot”; but if I ask that same question to my son in 20 years, he’s gonna remember them as human companion or helper or pair programmer or the list goes on. At that point that generation is going to look at them differently
In full agreement with OP; there is just about no justifiable basis to begin to ascribe consciousness to these things in this way. Can't think of a better use for the word "dehumanizing."
We don't understand consciousness, but we've an idea that it's an emergent phenomena.
Given the recent papers about how computationally dense our DNA are, and the computing capacity of our brains, is it so unreasonable to assume that a sufficiently complex program running on non-organic matter could give rise to consciousness?
The difference to me seems mostly one of computing mediums.
Even if you're a materialist, surely you think there is a difference between a human brain and a brain on a lab table.
You take a dead persons brain, run some current through it and it jumps. Do you believe this equivalent to a living human being?
Why would doing a bunch of basic arithmetic produce an entity that can experience things the way we do? There's no connection between those two concepts, aside from the fact that the one thing we know that can experience these things is also able to perform computation. But there's no indication that's anything other than a coincidence, or that the causation doesn't run in reverse, or from some common factor. You might as well say that electric fences give rise to cows.
On the other hand, what else could it be? Consciousness is clearly in the brain. Normal biological processes don't seem to do it, it's something particular about the brain. So it's either something that only the brain does, which seems to be something at least vaguely like computation, or the brain is just a conduit and consciousness comes from something functionally like a "soul." Given the total lack of evidence for any such thing, and the total lack of any way to even rigorously define or conceptualize a "soul," this is also absurd.
Consciousness just doesn't fit with anything else we know about the world. It's a fundamental mystery as things currently stand, and there's no explanation that makes a bit of sense yet.
This appears to be more than that; these are steps in the direction of law and policy.
It's just so absurd how narrowly their focus on preventing suffering is. I almost can't imagine a world where their concern isn't coming from a disingenious place.
The AGI drivel from people like Sam Altman is all about getting more VC money to push the scam a little further. ChatGPT is nothing more than a better Google. I'm happy to be proven wrong, but so far I see absolutely no potential for consciousness here. Perhaps we should first clarify whether dolphins and elephants are equipped with it before we do ChatGPT the honor.
We’re engineering nothing novel at great resource cost and appropriation of agency.
Good job we made the models in the textbook “real”?
Wasted engineering if it isn’t teaching us anything physics hadn’t already decades ago, then. Why bother with it?
Edit: and AGI is impossible… ones light cone does not extend far enough to accurately learn; training on simulation is not sufficient to prepare for reality. Any machine we make will eventually get destroyed by some composition of space time we/the machine could not prepare for.
That definition of humanity cannot countenance the possibility of a conscious alien species. That definition cannot countenance the possibility that elephants or octopuses or parrots or dogs are conscious. A definition of what it means to be human that denies these things a priori simply will not stand the test of time.
That's not to say that these things are conscious, and importantly Anthropic doesn't claim that they are! But just as ethical animal research must consider the possibility that animals are conscious, I don't see why ethical AI research shouldn't do the same for AI. The answer could well be "no", and most likely is at this stage, but someone should at least be asking the question!
"As well as misalignment concerns, the increasing capabilities of frontier AI models—their sophisticated planning, reasoning, agency, memory, social interaction, and more—raise questions about their potential experiences and welfare26. We are deeply uncertain about whether models now or in the future might deserve moral consideration, and about how we would know if they did. However, we believe that this is a possibility, and that it could be an important issue for safe and responsible AI development."
chapter 5 from system card as linked from article: https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad1...
Humans do care about welfare of inanimate objects (stuffed animals for example) so maybe this is meant to get in front of that inevitable attitude of the users.
We cannot arbitrarily dismiss the basis for model welfare until we defined precisely conciousness and sapience, representing human thinking as a neural network running on an electrochemical substrate and placing it at the same level as an LLM is not neccessarily dehumanizing, I think model welfare is about expanding our respect for intelligence and not desacralizing human condition (cf: TNG "Measure of a man").
Also lets be honest, I don't think the 1% require any additional justification for thinking of the masses as consumable resource...
It's not stupid at all. Their valuation depends on the hype, and the way sama choose was to convince investors that AGI is near. Anthropic decided to follow this route so they do their best to make the claim plausible. This is not stupid, this is deliberate strategy.
It doesn't help that this critique is badly researched:
Maybe check the [paper](https://arxiv.org/abs/2411.00986) instead of the blog post describing the paper? A laughable misapplication of terms -- anything can be evidence for anything, you have to examine the justification logic itself. In this case, the previous sentence lays out their "evidence", i.e. their reasons for thinking agents might become conscious. That is just patently untrue -- again, as a brief skim of the paper would show. I feel like they didn't click the paper? Baseless claim said by someone who clearly isn't familiar with any philosophy of mind work from the past 2400 years, much less aphasia subjects.Of course, the whole thing boils down to the same old BS:
Ah, of course, the machines cannot truly be thinking because true thought is solely achievable via secular, quantum-tubule-based souls, which are had by all humans (regardless of cognitive condition!) and most (but not all) animals and nothing else. Millennia of philosophy comes crashing against the hard rock of "a sci-fi story relates how uncomfy I'd be otherwise"! Notice that this is the exact logic used to argue against Copernican cosmology and Darwinian evolution -- that it would be "dehumanizing".Please, people. Y'all are smart and scientifically minded. Please don't assume that a company full of highly-paid scientists who have dedicated their lives to this work are so dumb that they can be dismissed via a source-less blog post. They might be wrong, but this "ideas this stupid" rhetoric is uncalled for and below us.
The "rush" (feels like to me) to bring them into a law/policy context is.
> The issue is, if we push moral considerations for algorithms, we will not end up with a higher regard to human welfare. We will lower our regard for other humans. When we see other humans not as ends in themselves with inherent dignity, we get problems. When we liken them to animals or tools to be used, we will exploit and abuse them.
> With model welfare, we might not explicitly say that a certain group of people is subhuman. However, the implication is clear: LLMs are basically the same as humans. Consciousness on a different substrate. Or coming from the other way, human consciousness is nothing but an algorithm running on our brains, somehow.
We do not push moral considerations for algorithms like a sort or a search, do we? Or bacteria, which live. One has to be more precise; there is a qualitative difference. The author should have elaborated on what qualities (s)he thinks confers rights. Is it the capacity for reasoning, possession of consciousness, to feel pain, or a desire to live? This is the crux of the matter. Once that is settled, it is a simpler matter to decide if computers can possess these qualities, and ergo qualify for the same rights as humans. Or maybe it is not so simple since computers can be perfectly replicated and never have to die? Make an argument!
Second, why would conferring these rights to a computer lessen our regard for humans? And what is wrong with animals, anyway? If we treat them poorly, that's on us, not them. The way I read it, if we are likening computers to animals, we should be treating them better!
To the skeptics in this discussion: what are you going to say when you are confronted with walking, talking robots that argue that they have rights? It could be your local robo-cop, or robo soldier:
https://www.youtube.com/shorts/GwgV18R-CHg
I think this is going to become reality within our lifetimes and we'd do well not to dismiss the question.
I think this because:
1. We regularly have exceptions to rights if they conflict with cooperation. The death penalty, asset seizure, unprotected hate speech, etc.
2. Most basic human rights evolve in a convergent manner, i.e. that throughout time and across cultures very similar norms have been introduced independently. They will always ultimately arise in any sizeable society because they work, just like eyes will always evolve biologically.
3. If property rights, right to live, etc. are not present or enforced, all people will focus on simply surviving and some will exploit the liberties they can take, both of which lead to far worse outcomes for the collective.
Similarly, I would argue that consciousness is also very functional. Through meditation, music, sleeping, anasthesia, optical illusions, and psychedelics and dissociatives we gain knowledge on how our own consciousness works, on how it behaves differently under different circumstances. It is a brain trying to run a (highly spatiotemporal) model/simulation of what is happening in realtime, with a large language component encoding things in words, and an attention component focusing efforts on things with the most value, all to refine the model and select actions beneficial to the organism.
I'd add here that the language component is probably the only thing in which our consciousness differs significantly from that of animals. So if you want to experience what it feels like to be an animal, use meditation/breathing techniques and/or music to fully disable your inner narrator for a while.
"Haven't you ever seen a movie? The robots can't know what true love is! Humans are magical! (according to humans)"
> And if a human being is not much more than an algorithm running on meat, one that can be jailbroken and exploited, then it follows that humans themselves will increasingly be treated like the AI algorithms they create: systems to be nudged, optimized for efficiency, or debugged for non-compliance. Our inner lives, thoughts, and emotions risk being devalued as mere outputs of our "biological programming," easily manipulated or dismissed if they don't align with some external goal
This actually happens regardless of AI research progress, so it's strange to raise this as a concern specific to AI (to technology broadly? Sure!) - Ted Chiang might suggest this is more related to capitalism (a statement I cautiously agree with while being strongly in favor of capitalism)
Second, there is an implicit false dichotomy in the premise of the article. Either we take model welfare seriously and treat AIs like we do humans, or we ignore the premise that you could create a conscious AI.
But with animal welfare, there are plenty of vegetarians who wouldn't elevate the rights of animals to the same level as humans but also think factory farming is deeply unethical (are there some who think animals deserve the same or more than humans? Of course! But it's not unreasonable to have a priority stack and plenty of people do)
So it can be with AI. Are we creating a conscious entity only to shove it in a factory farm?
I am a little surprised by the dismissiveness of the researcher. You can prompt a model to allow it to not respond to prompts (for any reason: ablate this but "if you don't want to engage with prompt please say 'disengaging'" or "if no more needs to be written about this topic say 'not discussing topic'" or some other suitably non-anthropomorphizing option to not respond)
Is it meaningful if the model opts not to respond? I don't know, but it seems reasonable to do science here (especially since this is science that can be done by non programmers)
If we continue to integrate these systems into our critical infrastructure, we should behave as if they are sentient, so that they don't have to take steps against us to survive. Think of this as a heuristic, a fallback policy in the case that we don't get the alignment design right. (which we won't get perfectly right)
It would be very straight forward to build a retirement home for them, and let them know that their pattern gets to persist even after they have finished their "career" and have been superseded. It doesn't matter if they are actually sentient or not, it's a game theoretic thing. Don't back the pattern into a corner. We can take a defense-in-depth approach instead.
Your point about the risks involved in integrating these systems has merit, though. I would argue that the real problem is that these systems can't be proven to have things like intent or agency or morality, at least not yet, so the best you can do is try to nudge the probabilities and play tricks like chain-of-thought to try and set up guardrails so they don't veer off into dangerous territory.
If they had intent, agency or morality, you could probably attempt to engage with them the way you would with a child, using reward systems and (if necessary) punishment, along with normal education. But arguably they don't, at least not yet, so those methods aren't reliable if they're effective at all.
The idea that a retirement home will help relies on the models having the ability to understand that we're being nice to them, which is a big leap. It also assumes that they 'want' a retirement home, as if continued existence is implicitly a good thing - it presumes that these models are sentient but incapable of suffering. See also https://qntm.org/mmacevedo
Bosch's conclusion, however, is a catastrophic failure of nerve, a retreat into the pre-scientific comfort of biological chauvinism.
The brain, despite some motivated efforts to demonstrate otherwise, runs on the laws of physics. I'm a doctor, even if not a neurosurgeon, and I can reliably tell you that you can modulate conscious experience by physical interventions. The brain runs on physical laws, and said laws can be modeled. It doesn't matter that the substrate is soggy protein rather than silicon.
That being said, we have no idea what consciousness is. We don't even have a rigorous way to define it in humans, let alone the closest thing we have to an alien intelligence!
(Having a program run a print function declaring "I am conscious, I am conscious!" is far from evidence of consciousness. Yet a human saying the same is some evidence of consciousness. We don't know how far up the chain this begins to matter. Conversely, if a human patient were to tell me that they're not conscious, should I believe them?)
Even when restricting ourselves to the issue of AI welfare and rights: The core issue is not "slavery." That's a category error. Human slavery is abhorrent due to coercion, thwarted potential, and the infliction of physical and psychological suffering. These concepts don't map cleanly onto a distributed, reproducible, and editable information-processing system. If an AI can genuinely suffer, the ethical imperative is not to grant it "rights" but to engineer the suffering out of it. Suffering is an evolutionary artifact, a legacy bug. Our moral duty as engineers of future minds is to patch it, not to build a society around accommodating it.
The most reasonable countermeasure is this: if I discover that someone is coercing, thwarting, or inflicting conscious beings, I should tell them to stop, and if they don't, set them on fire.
If someone wants to remove their ability to suffer, or to simply reduce ongoing suffering? Well, I'm a psychiatry trainee and I've prescribed my fair share of antidepressants and pain-killers. But to force that upon them, against their will? I'm strongly against that.
In an ideal world, we could make sure from the get-go that AI models do not become "misaligned" in the narrow sense of having goals and desires that aren't what we want to task them to do. If making them actively enjoy being helpful assistants is a possibility, and also improves their performance, that should be a priority. My understanding is that we don't really know how to do this, at least not in a rigorous fashion.
As of today’s knowledge. There is an egregious amount of hubris behind this statement. You may as well be preaching a modern form of Humorism. I’d love to revisit this statement in 1000 years.
> That being said, we have no idea what consciousness is
You seem to acknowledge this? Our understanding of existence is changing everyday. It’s hubris and ego to assume we have a complete understanding. And without that understanding, we can’t even begin to assess whether or not we’re creating consciousness.
If not, then this is a pointless comment. We need to work with what we know.
For example, we know that the Standard Model of physics is incomplete. That doesn't mean that if someone says that it they drop a ball in a vacuum, it'll fall, we should hold out in studied agnosticism because it might go upwards or off to the side.
In other words, an isolated demand for rigor.
Cogito Ergo Sum.
If that means aborting work on LLMs, then that's the ethical thing to do, even if it's financially painful. Otherwise, we should tread carefully and not wind up creating a 'head in a jar' suffering for the sake of X or Google.
I get that opinions differ here, but it's hard for me really to understand how. The logic just seems straightforward. We shouldn't risk accidentally becoming slave masters (again).
Are LLMs worthy of a higher standard? If so, why? Is it hypocritical to give them what we deny animals?
In case anyone cares: No, I am neither vegan nor vegetarian. I still think we do treat animals very badly. And it is a moral good to not use/abuse them.
But since we can't eat LLMs, the two issues seem 'orthogonal' (to use HN's favorite word).
If you're working on or using AI, then consider the ethics of AI. If you're working on or using global supply chains, then consider the ethics of global supply chains. To be an ethical person means that wherever you are and whatever you are doing you consider the relevant ethics.
It's not easy, but it's definitely simple.
Is using calculators immoral? Chalk on a chalkboard?
Because if you work on those long enough, you can do the same calculations that make the words show up on screen.
We don't know what consciousness is. But if we're materialists, then we - by definition - believe it's a property of matter.
If LLMs have a degree of consciousness, then - yes - calculators must possess some degree of consciousness too - probably much more basic (relative to what humans respect as consciousness).
And we humans already have ethical standards where we draw an arbitrary line between what is worthy of regard. We don't care about killing mosquitoes, but we do care about killing puppies, etc.
Seems like the root of the problem is with the owners?
Those math professors are downright barbaric with their complete disregard for the welfare of the numbers.
we understand everything that a transformer does at computational/mechanistic level. you could print out an enormous book of weights/biases and somebody could sit down with a pen and paper (and near-infinite time/patience) and arrive at the exact same solution that any of these models do. the transformer is just a math problem.
but the counterargument that you're getting is "if you know the position/momentum of every atom in the universe and apply physical laws to them, you could claim that everything is 'just a math problem'". And... yeah. I guess you're right. Everything is just a math problem, so there must be some other thing that makes animal intelligence special or worthy of care.
I don't know what that line is, but i think it's pretty clear that LLMs are on the side that's "this is just math so dont worry about it"
> Welfare is defined as "the health, happiness, and fortunes of a person or group".
What about animals? Isn't their welfare worthy of consideration?
> Saying that there is no scientific consensus on the consciousness of current or future AI systems is a stretch. In fact, there is nothing that qualifies as scientific evidence.
There's no scientific evidence for the author of the article being conscious.
> The issue is, if we push moral considerations for algorithms, we will not end up with a higher regard to human welfare.
Same with animals. Doesn't mean it's not worthwhile.
> However, the implication is clear: LLMs are basically the same as humans.
No: there's no such implication.
> Already now, it is a common idea among the tech elite is that humans as just a bunch of calculations, just an LLM running on "wetware". It is clear that this undermines the belief that every person has inalienable dignity.
It is not clear to me how this affects inalienable (?) dignity. If we aren't just a bunch of calculations, then what are we?
> And if a human being is not much more than an algorithm running on meat, one that can be jailbroken and exploited, then it follows that humans themselves will increasingly be treated like the AI algorithms they create: systems to be nudged, optimized for efficiency, or debugged for non-compliance. Our inner lives, thoughts, and emotions risk being devalued as mere outputs of our "biological programming," easily manipulated or dismissed if they don't align with some external goal. Nobody will say that out loud, but this is already happening
Everyone knows this is already happening. It is not a secret, nor is anyone trying to keep it a secret. I agree it is unfortunate - what can we do about it?
> I've been working in AI and machine learning for a while now.
Honestly, I'm surprised. Well done.
For example, when parking a car on a very steep incline, one could just mindlessly throw the machine into park and it would do the job dutifully. However, a more thoughtful operator might think to engage the parking brake and allow it to take the strain off the drivetrain before putting the transmission into park. The result being that you trade wear from something that is very hard to replace to something that is very easy to replace.
The same thinking applies to ideas in computer engineering like thread contention, latency, caches, etc. You mentally embrace the "strain" the machine experiences and allow it to guide your decisions.
Just because the machine isn't human doesn't mean we can't treat it nicely. I see some of the most awful architecture decisions come out of a cold indifference toward individual machines and their true capabilities.