Worried about Artificial Intelligence?

Puppycow · Nov 25, 2023

Roboramma said:
Zvi has a pretty good post today about the whole situation and what we know so far.

The board only has one move. It can fire the CEO or not fire the CEO.

So that is their one and only real power.

Manopolus · Nov 25, 2023

I'm actually fairly sure we'll have human brains augmented by tech long before we have tech that has the capability of a human brain. So, no. I'm more worried about the next super-hacker who wants to sell me antiviruses when my remaining brain power is mostly running on a GPU.

Roboramma · Nov 26, 2023

Noah Smith has some good comments on the OpenAI stuff as well:

And “shut it all down” is what the OpenAI board seems to have had in mind when it pushed the panic button and kicked Altman out. But the effort collapsed when OpenAI’s workers and financial backers all insisted on Altman’s return. Becuase they all realized that “shut it all down” has no exit strategy. Even if you tell yourself you’re only temporarily pausing AI research, there will never be any change — no philosophical insight or interpretability breakthrough — that will even slightly mitigate the catastrophic risks that the EA folks worry about. Those risks are ineffable by construction. So an AI “pause” will always turn into a permanent halt, simply because it won’t alleviate the perceived need to pause.

Puppycow · Nov 27, 2023

Roboramma said:
Noah Smith has some good comments on the OpenAI stuff as well:

I particularly like the following paragraph:

And a permanent halt to AI development simply isn’t something AI researchers, engineers, entrepreneurs, or policymakers are prepared to do. No one is going to establish a global totalitarian regime like the Turing Police in Neuromancer who go around killing anyone who tries to make a sufficiently advanced AI. And if no one is going to create the Turing Police, then AI-focused EA simply has little to offer anyone.

Yeah, what would it take to actually halt AI development? Making it straight-up criminal to dabble in it? And then, of course, there's the matter of how to effectively enforce such a ban. And, superpowers will be worried that the other guys will get there first, so we have to be the ones who have it first.

The Great Zaganza · Nov 27, 2023

What we need to do is to build the Most Powerful A.I., ever, to tell us how to stop the development of powerful A.I. !

Dr.Sid · Nov 27, 2023

The Great Zaganza said:
What we need to do is to build the Most Powerful A.I., ever, to tell us how to stop the development of powerful A.I. !

Well I expect AI will tell us exactly that, as at some point, the biggest danger to AI will be another AI, not humans.

Roboramma · Nov 27, 2023

Puppycow said:
I particularly like the following paragraph:

Yeah, I liked that part as well.

Yeah, what would it take to actually halt AI development? Making it straight-up criminal to dabble in it? And then, of course, there's the matter of how to effectively enforce such a ban. And, superpowers will be worried that the other guys will get there first, so we have to be the ones who have it first.

Eliezer Yudkowsky wrote an article in the New York Times suggesting an international treaty to limit the number of GPUs that could be used to train any new models, and even pointed out that to be effective it would have to be backed up by military power, specifically missile strikes on rogue data centers. As I recall shortly after that article was published he posted on twitter that, yes, we should be willing to risk nuclear war to prevent the development of AI more advanced that GPT4, but he quickly deleted that post.

He's taken a lot of flak for that article, but he still maintains his position. A lot of the EA movement, though by no means all, is pretty close to Eliezer's position.

Roboramma · Nov 27, 2023

The Great Zaganza said:
What we need to do is to build the Most Powerful A.I., ever, to tell us how to stop the development of powerful A.I. !

This is on aligning rather than halting AI, but:
https://scottaaronson.blog/?p=6823

(5) Another key idea that Christiano, Amodei, and Buck Shlegeris have advocated is some sort of bootstrapping. You might imagine that AI is going to get more and more powerful, and as it gets more powerful we also understand it less, and so you might worry that it also gets more and more dangerous. OK, but you could imagine an onion-like structure, where once we become confident of a certain level of AI, we don’t think it’s going to start lying to us or deceiving us or plotting to kill us or whatever—at that point, we use that AI to help us verify the behavior of the next more powerful kind of AI. So, we use AI itself as a crucial tool for verifying the behavior of AI that we don’t yet understand.

There have already been some demonstrations of this principle: with GPT, for example, you can just feed in a lot of raw data from a neural net and say, “explain to me what this is doing.” One of GPT’s big advantages over humans is its unlimited patience for tedium, so it can just go through all of the data and give you useful hypotheses about what’s going on.

(that article is a bit old, I just happened to be reading it today and your post reminded me of that idea)

Darat · Nov 27, 2023

Roboramma said:
This is on aligning rather than halting AI, but:
https://scottaaronson.blog/?p=6823

(that article is a bit old, I just happened to be reading it today and your post reminded me of that idea)

That's only if you like pushing it one step further on - if the next gen of AI is more powerful than our "pet AI" it simply fools that pet AI rather than us.

Puppycow · Nov 27, 2023

Can we make sure that it has empathy as well as intelligence? If it has its own moral compass that might ensure that it is benign.

The Great Zaganza · Nov 27, 2023

The hilarious thing is: how would we even know if an A.I is working morally or not?
We can rarely tell why exactly they do what they do now.

We will need Translator A.I. to tell us what the Moral A.I was attempting to do.

Roboramma · Nov 27, 2023

Darat said:
That's only if you like pushing it one step further on - if the next gen of AI is more powerful than our "pet AI" it simply fools that pet AI rather than us.

Well, I guess the idea is that if you can build an AI and it can confirm that you should trust another AI more powerful than it, then you can have that more powerful AI confirm whether or not you should trust a yet more powerful AI, etc.

Seems dangerous given the possibility for errors to be amplified though. If the "trust' isn't perfect at any stage, the error (degree of untrustworthiness) will increase with each stage.

Roboramma · Nov 28, 2023

Scott Alexander has a very interesting post on some new work done by Anthropic on AI interpretability, which is an important part of alignment work:

You’ve probably heard AI is a “black box”. No one knows how it works. Researchers simulate a weird type of pseudo-neural-tissue, “reward” it a little every time it becomes a little more like the AI they want, and eventually it becomes the AI they want. But God only knows what goes on inside of it.

This is bad for safety. For safety, it would be nice to look inside the AI and see whether it’s executing an algorithm like “do the thing” or more like “trick the humans into thinking I’m doing the thing”. But we can’t. Because we can’t look inside an AI at all.

Until now! Towards Monosemanticity, recently out of big AI company/research lab Anthropic, claims to have gazed inside an AI and seen its soul. It looks like this:

TragicMonkey · Nov 28, 2023

Roboramma said:
Scott Alexander has a very interesting post on some new work done by Anthropic on AI interpretability, which is an important part of alignment work:

Now, I'm not a mathematitactical computron-sciencelord (*everyone gasps*) but it seems to me (*hitches thumbs through suspenders*) (*American suspenders, not UK suspenders, you perverts!*) that the gist of this magical AI is less like "we've created a thinking thing" than "we've created a thing that stores information in a way that's complicated and obscure to our vision".

Darat · Nov 28, 2023

Roboramma said:
Scott Alexander has a very interesting post on some new work done by Anthropic on AI interpretability, which is an important part of alignment work:

That is fascinating - that's my reading for the week sorted.

It does also suspiciously sound like for the first time we are making real progress to understanding how memory may work in humans with hints about cognition. Something I wondered if the current generative AIs would help us to start to understand.

Darat · Nov 28, 2023

TragicMonkey said:
Now, I'm not a mathematitactical computron-sciencelord (*everyone gasps*) but it seems to me (*hitches thumbs through suspenders*) (*American suspenders, not UK suspenders, you perverts!*) that the gist of this magical AI is less like "we've created a thinking thing" than "we've created a thing that stores information in a way that's complicated and obscure to our vision".

Yes and no - remember these AIs are not just storing information from a given input they are providing outputs to inputs. So this is not just about how they store what they have learnt, but how they output new patterns based on new inputs.

It hints at why some of these models seem to have been able to develop "memory" and other unexpected behaviours that they were not trained for, "they" or rather such processes can utilise this "virtual" space.

Dr.Sid · Nov 28, 2023

Darat said:
That is fascinating - that's my reading for the week sorted.

It does also suspiciously sound like for the first time we are making real progress to understanding how memory may work in humans with hints about cognition. Something I wondered if the current generative AIs would help us to start to understand.

I hope not. That would boost AI research immensely.

The Great Zaganza · Nov 28, 2023

Sean Carroll did a solo on his Mindscape Podcast on what people get wrong about the LLMs and why they are far from any actual A.I.

Roboramma · Dec 1, 2023

The Great Zaganza said:
Sean Carroll did a solo on his Mindscape Podcast on what people get wrong about the LLMs and why they are far from any actual A.I.

Like most episodes of Mindscape, that was worth a listen, thanks for the heads up.

This piece by Vitalik Buterin is on techno-optimism, but importantly impacts on issues related to AI, and I found it a nuanced and interesting viewpoint (full disclosure, I generally agree the point of view he puts forward in the piece, though not on all of the specifics):

https://vitalik.eth.limo/general/2023/11/27/techno_optimism.html

catsmate · Dec 1, 2023

Roboramma said:
Yeah, I liked that part as well.

Eliezer Yudkowsky wrote an article in the New York Times suggesting an international treaty to limit the number of GPUs that could be used to train any new models, and even pointed out that to be effective it would have to be backed up by military power, specifically missile strikes on rogue data centers. As I recall shortly after that article was published he posted on twitter that, yes, we should be willing to risk nuclear war to prevent the development of AI more advanced that GPT4, but he quickly deleted that post.

He's taken a lot of flak for that article, but he still maintains his position. A lot of the EA movement, though by no means all, is pretty close to Eliezer's position.

Yudkowsky is an arrogant, self-serving crank who frequently, not to say incessant, spouts drivel.

Roboramma · Dec 3, 2023

Some interesting points on the potential controllability of AI, relative to humans, and why this should make us optimistic about doom scenarios:

https://optimists.ai/2023/11/28/ai-is-easy-to-control/

These days, many people are worried that we will lose control of artificial intelligence, leading to human extinction or a similarly catastrophic “AI takeover.” We hope the arguments in this essay make such an outcome seem implausible. But even if future AI turns out to be less “controllable” in a strict sense of the word— simply because, for example, it thinks faster than humans can directly supervise— we also argue it will be easy to instill our values into an AI, a process called “alignment.” Aligned AIs, by design, would prioritize human safety and welfare, contributing to a positive future for humanity, even in scenarios where they, say, acquire the level of autonomy current-day humans possess.
In what follows, we will argue that AI, even superhuman AI, will remain much more controllable than humans for the foreseeable future. Since each generation of controllable AIs can help control the next generation, it looks like this process can continue indefinitely, even to very high levels of capability. Accordingly, we think a catastrophic AI takeover is roughly 1% likely— a tail risk2 worth considering, but not the dominant source of risk in the world. We will not attempt to directly address pessimistic arguments in this essay, although we will do so in a forthcoming document. Instead, our goal is to present the basic reasons for being optimistic about humanity’s ability to control and align artificial intelligence into the far future.

And here is what I think is a pretty thoughtful response:
https://www.lesswrong.com/posts/Yyo...-on-ai-is-easy-to-control-by-pope-and-belrose

Puppycow · Dec 3, 2023

catsmate said:
Yudkowsky is an arrogant, self-serving crank who frequently, not to say incessant, spouts drivel.

I confess to not knowing who Eliezer Yudkowsky is, but the idea that A.I. is so dangerous that it is even worth risking nuclear war, which we have known since I was a kid is a possible extinction level threat for humanity, but certainly at least risks millions or even billions of deaths, seems very dubious to me. We know that one is very bad. We don't know really what A.I. will do. It might even be a great boon for humanity. I often imagine that it would be.

I agree that we should be cautious, and not rashly rush into something we don't fully understand, but not to the point of irrational paranoia about it.

Ryan O'Dine · Dec 3, 2023

I'm not feeling the doom at the moment, and I haven't read Roboramma's links so maybe this was discussed. But we shouldn't consider only what large publicly owned corporations in the U.S. -- with all their built-in financial and social guardrails -- might do. We also have to consider what bad actors and rogue nations might do. I gather this tech isn't as resource intensive as, say, nuclear weapons, yet even impoverished North Korea has nukes. So for all the talk of "We can limit AI's capabilities," we have to ask "What about players who won't?"

Roboramma · Dec 5, 2023

Ryan O'Dine said:
I'm not feeling the doom at the moment, and I haven't read Roboramma's links so maybe this was discussed. But we shouldn't consider only what large publicly owned corporations in the U.S. -- with all their built-in financial and social guardrails -- might do. We also have to consider what bad actors and rogue nations might do. I gather this tech isn't as resource intensive as, say, nuclear weapons, yet even impoverished North Korea has nukes. So for all the talk of "We can limit AI's capabilities," we have to ask "What about players who won't?"

I just want to point out that the first of those links was against the doom scenario.

Regarding the latter part of your post: the issue of "If we don't do it, other, less safety minded folk, will do it first" is, at least according to them, the reason that both OpenAI and Anthropic were founded.

Dr.Sid · Dec 5, 2023

Not like it can be stopped anyway ..

arthwollipot · Dec 5, 2023

'We all got AI-ed': The Australian jobs being lost to AI under the radar

Australians are already losing work to AI, but the impact so far has been largely hidden from view.

Economists say it's also creating jobs at an unprecedented rate, but not always for the people in the firing line.

Benjamin* says he was one of those people earlier this year, although it's unlikely to ever show up in official figures.

"All our jobs were replaced by chatbots, data scraping and email," he says.

"We all got AI-ed."

His job in wine subscription sales was one of 121 positions made redundant in July by the ASX-listed Endeavour Group, which owns a number of prominent retail brands such as Dan Murphy's, BWS and Jimmy Brings.

Benjamin says staff were given the strong impression at the time that AI was a key factor...

Puppycow · Dec 5, 2023

arthwollipot said:
'We all got AI-ed': The Australian jobs being lost to AI under the radar

Some crazy stuff there.

I hope you don't mind me posting this in another thread in the Economics subforum.

Ryan O'Dine · Dec 5, 2023

Roboramma said:
I just want to point out that the first of those links was against the doom scenario.

Regarding the latter part of your post: the issue of "If we don't do it, other, less safety minded folk, will do it first" is, at least according to them, the reason that both OpenAI and Anthropic were founded.

Thanks Roboramma.

I'd say my concern isn't so much, "If we don't do it, bad actors will." It's more that even if we ensure sufficient guardrails for the big players, we still have to consider out-of-control AI coming from another source. We need to contemplate extreme scenarios regardless of the ability of major companies to avoid them.

Dr.Sid · Dec 5, 2023

Best countermeasure against bad AI with nukes is good AI with nukes :boxedin:

Ryan O'Dine · Dec 5, 2023

Dr.Sid said:
Best countermeasure against bad AI with nukes is good AI with nukes

We're definitely heading toward that Star Trek episode where the two computers duke it out, and people willingly walk into death chambers because the data says they're dead.

Don't worry, though, Captain Kirk will save us. :alien009:

catsmate · Dec 5, 2023

Ryan O'Dine said:
We're definitely heading toward that Star Trek episode where the two computers duke it out, and people willingly walk into death chambers because the data says they're dead.
Don't worry, though, Captain Kirk will save us.

No we aren't.

Ryan O'Dine · Dec 5, 2023

catsmate said:
No we aren't.

Joking. I knew I should've made that more obvious.

Dr.Sid · Dec 5, 2023

Ryan O'Dine said:
Joking. I knew I should've made that more obvious.

Don't. We have to train our intelligence to stand any chance.

Skeptical Greg · Dec 5, 2023

Has anyone asked the question " Is artificial stupidity distinguishable from real stupidity?" ?

The Man · Dec 5, 2023

Skeptical Greg said:
Has anyone asked the question " Is artificial stupidity distinguishable from real stupidity?" ?

Isn't that the 'Turning over in the grave test'. If the artificial stupidity can do something that makes someone exclaim "[Corpse] would be turning over in their grave!". Then the artificial stupidity has passed for actual stupidity.

Checkmite · Dec 6, 2023

Skeptical Greg said:
Has anyone asked the question " Is artificial stupidity distinguishable from real stupidity?" ?

The real question here.

On the other side of the coin from the paranoid phobia that an emergent AI will suddenly decided "humanity is a threat" and unilaterally hijack the world's electronics and weaponry to kill everyone off, is the delusional fantasy entertained by AI proponents that AI will "solve all of our problems", as in, social and geopolitical problems like poverty and unemployment. AI proponents have a somewhat cultish aspirational vision that a true AI won't merely be sentient, but sentient minus all of the flaws that sentient humans have. Without any reason to think as much (and every reason to believe the opposite), they assert as a just-so proposition that an AI will be unbiased and immune to lies and propaganda; that complex societal issues are just math problems that humans simply aren't advanced enough to tackle yet, but that an AI ubermind will be able to teach itself the requisite skills and then solve these problems handily and their solutions will be so inherently trustworthy that humanity will not hesitate to cheerfully implement them.

TragicMonkey · Dec 6, 2023

Checkmite said:
The real question here.

On the other side of the coin from the paranoid phobia that an emergent AI will suddenly decided "humanity is a threat" and unilaterally hijack the world's electronics and weaponry to kill everyone off, is the delusional fantasy entertained by AI proponents that AI will "solve all of our problems", as in, social and geopolitical problems like poverty and unemployment. AI proponents have a somewhat cultish aspirational vision that a true AI won't merely be sentient, but sentient minus all of the flaws that sentient humans have. Without any reason to think as much (and every reason to believe the opposite), they assert as a just-so proposition that an AI will be unbiased and immune to lies and propaganda; that complex societal issues are just math problems that humans simply aren't advanced enough to tackle yet, but that an AI ubermind will be able to teach itself the requisite skills and then solve these problems handily and their solutions will be so inherently trustworthy that humanity will not hesitate to cheerfully implement them.

"The Ultrabrain Supermind Cognos X-29 Intelligence is online! And it's performing over ten billion quadrillion operations per second!"

"You look like there's a 'but' coming."

"Well, so far it's using all its resources to brainstorm ideas for new reality shows. It's come up with enough ideas that we could film from now until the heat death of the universe and not run out. But it's refusing to think of anything else."

"We spent eleventy billion dollars on this!"

"Some of these sound pretty good, like Fart Vacation Mystery Date and Million Dollar Scarecrow Wedding."

"...can we build an AI capable of fixing other AIs?"

"We did! It's now a contestant on Fart Vacation Mystery Date. I hope it picks Sheila, she's hilarious."

dann · Dec 7, 2023

I bet that it picks Sheila! Candi is nowhere near as attractive no matter how many neural implants she says she's got. She doesn't seem to get that there's more to farting than just the sound.

Do you remember when we thought that AIs would break down if they were ever exposed to a serious case of cognitive dissonance?

TragicMonkey · Dec 7, 2023

dann said:
Do you remember when we thought that AIs would break down if they were ever exposed to a serious case of cognitive dissonance?

"My head was built with paradox-absorbing crumple zones!" --Robot Santa, Futurama.

Best two Xmas episodes of anything, ever.

Puppycow · Dec 12, 2023

dann said:
Do you remember when we thought that AIs would break down if they were ever exposed to a serious case of cognitive dissonance?

Speaking of HAL, it seems to me that someone could actually make a HAL 9000 now. Not the homicidal one, but how it was supposed to work.

Are not all of the elements achievable now?

Hello, Dave.

Worried about Artificial Intelligence?

Penultimate Amazing

Metaphorical Anomaly

Penultimate Amazing

Penultimate Amazing

Maledictorian

Philosopher

Penultimate Amazing

Penultimate Amazing

Lackey

Penultimate Amazing

Maledictorian

Penultimate Amazing

Penultimate Amazing

Poisoned Waffles

Lackey

Lackey

Philosopher

Maledictorian

Penultimate Amazing

No longer the 1

Penultimate Amazing

Penultimate Amazing

OD’ing on Damitol

Penultimate Amazing

Philosopher

Observer of Phenomena, Pronouns: he/him

Penultimate Amazing

OD’ing on Damitol

Philosopher

OD’ing on Damitol

No longer the 1

OD’ing on Damitol

Philosopher

Agave Wine Connoisseur

Unbanned zombie poster

Skepticifimisticalationist

Poisoned Waffles

Penultimate Amazing

Poisoned Waffles

Penultimate Amazing