Author: David Wood

David W. Wood, D.Sc., was one of the pioneers of the smartphone industry. He is now a futurist writer, educator, consultant, and speaker. As Chair of London Futurists, David has organised over 300 public meetings since March 2008 on futurist and technoprogressive topics. Membership of London Futurists currently exceeds 9,700. David is also Executive Director of the LEV (Longevity Escape Velocity) Foundation, where the mission statement is "conducting and inspiring research to comprehensively cure and prevent human age-related disease". As Principal of the independent futurist consultancy and publisher Delta Wisdom, David helps clients around the world to anticipate and manage the dramatic impact of rapidly changing technology on human individuals and communities. This includes the NBIC technologies (nanotech, biotech, infotech, cognotech) of the fourth industrial revolution, and the emergence of AGI (artificial general intelligence) and resulting “singularity”. Previously, David spent 25 years designing, implementing, and trailblazing the use of smart mobile devices, including ten years with pioneering PDA manufacturer Psion PLC, and ten more with smartphone operating system specialist Symbian Ltd, which he co-founded in 1998. At different times, his executive responsibilities at Psion and Symbian included software development, technical consulting, partnering and ecosystem management, and research and innovation. By 2012, his software for UI and application frameworks had been included on 500 million smartphones from companies such as Nokia, Sony Ericsson, Samsung, Motorola, LG, Panasonic, Sharp, and Fujitsu. From 2010 to 2013, David was Technology Planning Lead (CTO) of Accenture Mobility, where he also co-led Accenture’s Mobility Health business initiative. David’s books include The Singularity Principles, Vital Foresight, Smartphones and Beyond, The Abolition of Aging, Sustainable Superabundance, and Transcending Politics. He has a triple first class mathematics degree from Cambridge and undertook doctoral research in the Philosophy of Science. He has an honorary Doctorate in Science from Westminster University. He blogs at dw2blog.com and tweets as @dw2.<br>

How to build BGIs instead of CGIs

Posted on March 30, 2025March 30, 2025 by David Wood

The transcendent questions that will determine the fate of humanity

The Singularity is nigh

There’s a strong probability that, within a few decades at most, the fate of Earth will be out of human hands.

As the rationality and agency of AIs approach and then dramatically exceed the capabilities of their human progenitors, it will no longer be we humans who are calling the shots.

This striking possibility—this singularity in the long history of human civilization—this decisive disruption in our role as the smartest beings on the planet—should command the rapt attention of everyone who cares about the future.

In some scenarios, the singularity will herald unprecedented flourishing for humanity, with options for humanity to rapidly improve along multiple dimensions of wellbeing. This kind of singularity will feature AIs that aren’t just generally intelligent (AGIs) but that are beneficial general intelligences (BGIs). We can envision humanity being cradled in the palms of a BGI.

But in other scenarios, humans will be dramatically diminished either at the singularity itself or shortly afterward, or even in an utterly chaotic approach to that seismic transition. That kind of AGI could be called a CGI—a catastrophic general intelligence. We can envision humanity being crushed into oblivion, or casually tossed aside, or rent asunder, by the actions of one or more CGIs.

Given these scenarios, two questions cry out for careful analysis:

What’s the likelihood that, if the development of AI systems around the world remains on its current default trajectory, the resulting AGI will be a BGI rather than a CGI? Is that likelihood acceptably high or dismally low?
What actions should developers (and the people who fund and oversee their activities) take, to increase the likelihood that a BGI will emerge, rather than a CGI?

The argument against the default trajectory

Many people reject the idea of a CGI as implausible science fiction. They ask: how much harm could powerful AI cause the world?

My answer: enormous harm – enormous harm that is avoidable, with appropriate forethought.

The problem isn’t just AI in isolation. It’s AI in the hands of fallible, naive, over-optimistic humans, who are sometimes driven by horrible internal demons. It’s AI summoned and used, not by the better angels of human nature, but by the darker corners of our psychology.

Although we humans are often wonderful, we sometimes do dreadful things to each other – especially when we have become angry, alienated, or frustrated. Add in spiteful ideologies of resentment and hostility, and things can become even uglier.

Placing technology in the hands of people in their worst moments can lead to horrific outcomes. The more powerful the technology, the more horrific the potential outcomes:

The carnage of a frenzied knife attack or a mass shooting (where the technology in question ranges from a deadly sharp knife to an automatic rifle)
The chaos when motor vehicles are deliberately propelled at speed into crowds of innocent pedestrians
The deaths of everyone on board an airplane, when a depressed air pilot ploughs the craft into a mountainside or deep into an ocean, in a final gesture of defiance to what they see as an unfair, uncaring world
The destruction of iconic buildings of a perceived “great satan”, when religious fanatics have commandeered jet airliners in service of the mental pathogen that has taken over their minds
The assassination of political or dynastic rivals, by the mixing of biochemicals that are individually harmless, but which in combination are frightfully lethal
The mass poisoning of commuters in a city subway, when deadly chemicals are released at the command of a cult leader who fancies himself as the rightful emperor of Japan, and who has beguiled clearly intelligent followers to trust his every word.

How does AI change this pattern of unpleasant possibilities? How is AI a greater threat than earlier technologies? In six ways:

As AI-fuelled automation displaces more people from their work (often to their surprise and shock), it predisposes more people to become bitter and resentful
AI is utilised by merchants of the outrage industrial complex, to convince large numbers of people that their personal circumstance is more appalling than they had previously imagined, that a contemptible group of people over there are responsible for this dismal turn of events, and that the appropriate response is to utterly defeat those deplorables
Once people are set on a path to obtain revenge, personal recognition, or just plain pandemonium, AIs can make it much easier for them to access and deploy weapons of mass intimidation and mass destruction
Due to the opaque, inscrutable nature of many AI systems, the actual result of an intended outrage may be considerably worse even than what the perpetrator had in mind; this is similar to how malware sometimes causes much more turmoil than the originator of that malware intended
An AI with sufficient commitment to the goals it has been given will use all its intelligence to avoid being switched off or redirected; this multiplies the possibility that an intended local outrage might spiral into an actual global catastrophe
An attack powered by fast-evolving AI can strike unexpectedly at core aspects of the infrastructure of human civilization – our shared biology, our financial systems, our information networks, or our hair-trigger weaponry – exploiting any of the numerous fragilities in these systems.

And it’s not just missteps from angry, alienated, frustrated people, that we have to worry about. We also need to beware potential cascades of trouble triggered by the careless actions of people who are well-intentioned, but naive, over-optimistic, or simply reckless, in how they use AI.

The more powerful the AI, the greater the dangers.

Finally, the unpredictable nature of emergent intelligence carries with it another fearsome possibility. Namely, a general intelligence with alien thinking modes far beyond our own understanding, might decide to adopt an alien set of ethics, in which the wellbeing of eight billion humans merits only a miniscule consideration.

That’s the argument against simply following a default path of “generate more intelligence, and trust that the outcome is likely to be beneficial for humanity”. It’s an argument that should make everyone pause for thought.

Probabilities and absolutes

When they are asked about the kind of disaster scenarios that I mentioned in the previous section, people who are in a hurry to create AGI typically have one of two different responses. Either they adopt a probabilistic stance, saying they are willing to take their chances, or they hunker down into an absolutist position, denying that these scenarios have any credibility.

People who accept that there are credible risks of AGI-induced catastrophe often reason as follows:

Things could instead go wonderfully well (let’s cross our fingers!)
If we fail to build AGI, we will face other existential risks, both on a civilisational level, and at a personal level (that is, aging and death)
So, let’s roll the AGI dice, and hope for a favourable outcome.

But this reasoning is deeply problematic, on two grounds.

First, what on earth gives an AGI developer the right to undertake this risk, on behalf of the entire human population, many of whom may well reject that existential gamble, assuming they knew about it? It’s not just their own lives that are at risk. It’s the lives of billions elsewhere on the planet.

Second, there are more than two choices to consider! It’s not down to a straight binary choice between “AGI” and “no AI at all”. That’s a pathetically shallow way to assess the situation. Instead, there are two more options to consider. As I’ll explain shortly, these can be called AGI+ and AGI–. In principle, either of these options could deliver the desired profoundly positive outcomes, but at a much lower risk of catastrophic misstep. Either could be better bets to create BGI instead of CGI, rather than sticking with the default AGI trajectory.

That’s why I have, in a way, greater respect for developers who try to argue instead that there are no credible risks of global catastrophe from AGI. These developers involve no tortuous moral acrobatics. Instead, they speak in absolutes. For example, they may say, or think, “the universe will provide”. They have a simple (but unsound) induction in mind: humans have survived trials and tribulations in the past, so are bound to survive any new trials and tribulations in the future.

My response: the universe did not provide, for all the unfortunate victims of knife attacks, mass shootings, car crashes, airplane disasters, chemical attacks, or other terrorist outrages, which I mentioned earlier. The universe did not provide, for those slaughtered in ongoing tragedies in Gaza, Ukraine, Sudan, and elsewhere. Indeed, as Homo sapiens spread around the planet, numerous large animals were driven to extinction due to human activities. Likewise, the universe did not provide for the other hominid species who used to share the earth with us.

No, I stand behind my case, made above: the default path is laden with danger. The dangers arise from the existence of extraordinarily powerful AI that operates beyond human understanding, combined with even just a few elements of:

Humans with malign intent
Humans with naive understanding
Humans hijacked by belligerent emotions
Economic systems with incentives to disregard negative externalities
Political systems with incentives to grab power and hold onto it
Philosophies that justify egocentrism or tribalism
Numerous vulnerabilities in human civilisational infrastructure.

Given these risks, we should try harder to find solutions. Real solutions, rather than just a proclamation of faith.

Indeed, given these risks, AGI developers should beware any preoccupation with merely technical questions – such as the extent to which various deep neural networks are included and wired together in their systems, or whether the venerable back-propagation algorithm should be replaced by something closer to what seems to happen in the human brain. These are important questions, but they are transcended by the questions I now wish to address.

Default AGI, or AGI+, or AGI–?

Consider three different conceptions of the intended capabilities of a finished AGI

Default AGI: It is sufficient just to focus on building a general intelligence (which is, after all, a formidable technical challenge in its own right), and trust that the outcome will likely be beneficial to humanity
AGI+: It is vital to prioritize adding extra features into the specification – features such as explicit compassion, explicit benevolence, explicit empathetic consciousness, or explicit wisdom
AGI–: Rather than prioritizing adding extra features, it is more important to prioritize removing specific features that might otherwise arise in the AGI – features such as agency, autonomy, will-to-live, or sentience.

Here are some examples of attempts to design or build AGI+:

Anthropic embeds a “Constitutional Layer” in their AI systems, incorporating a set of fundamental principles (akin to a political constitution) that are intended to constrain the AI to behave in alignment with human values and ethics
With a broadly similar vision, Craig Mundie, former chief research and strategy officer at Microsoft, has proposed the idea of an “adjudication layer” that monitors advanced AI systems to ensure ethical compliance, much like a judiciary or a regulatory body would oversee human actions in a society
AI researcher Nell Watson champions the collection of examples of desired ethical behaviour into the EthicsNet database (akin to the ImageNet database that trained AIs how to recognise images), which can then guide the adoption of benevolent behaviour by advanced AIs
Recent new startup Conscium is exploring the possibility that an AI with a self-aware compassionate consciousness will prioritise the wellbeing of humans, on account of recognising and valuing the similar consciousness that we possess
SingularityNET advocates the development of clear and robust ethical principles that prioritize human flourishing, by focusing on safety, fairness, transparency, and the avoidance of harm – with the interactions within multi-agent cooperative systems ensuring adherence to these principles.

All of these initiatives acknowledge that there is more to do than simply increase the raw intelligence of the AIs they envision. In each case, they maintain that significant effort must also be applied on matters of ethical framework, mutual monitoring, compassion, or wisdom.

Likewise, here are some examples of attempts to design or build AGI–:

Max Tegmark of the Future of Life Institute urges the development of what he calls “Tool AI”, where the AI has great intelligence, but no independent autonomy or agency; Tool AI would serve as a tool for human decision-making and problem-solving, without possessing any goals or motivations of its own
Yoshua Bengio, the world’s most highly cited computer science researcher, has a similar concept which he calls “Scientist AI”: an assistant and partner in the scientific process, that can propose hypotheses, design experiments, analyse data, and contribute new insights, but without taking any initiative by itself, and always in a way that complements and enhances human expertise
Anthony Aguirre, also of the Future of Life Institute, proposes to avoid the creation of any systems that manifest all three of the characteristics that he labels as ‘A’ (Autonomy), ‘G’ (Generalisation ability), and ‘I’ (deep task Intelligence), when being powered with computation exceeding an agreed threshold; in contrast, combinations of any two of these three characteristics would be relatively safe, and would be encouraged.

In principle, then, there are two broad approaches to explore, AGI+ and AGI–, for people who are concerned about the risks of the emergence of CGI. And there is also the Default AGI path for people who are determined to ignore the seriousness of the possibility of CGIs.

At this point, however, we need to be aware of three further complications:

The special risks from immature (unfinished) advanced AIs
The special risks from self-improved advanced AIs
The need to coordinate the approaches adopted by different AI development groups around the world.

They’re all reasons why building AGI requires a lot more than technical decisions – and why the creation of AGI should not be left in the hands of technologists.

Let’s take these complications one at a time.

Immaturity, maturity, and self-improvement

The first complication is that, for any complex software, a perfect solution never appears from scratch. Rather, any mature solution is inevitably preceded by an immature, buggy version. That’s why public software releases are preceded by test phases, in order to observe and fix incorrect behaviour. Test phases usually catch a significant number of defects, but rarely catch them all.

Therefore, an intended eventual release of “Default AGI” is almost certain to be preceded by a release of what can be called “Immature AGI”. And intended eventual releases of AGI+ and AGI– are almost certain to be preceded by releases of “Immature AGI+” and “Immature AGI–”:

While AGI poses risks, for all the reasons previously discussed, immature AGI arguably poses even more risks.

Here are some examples of bugs in an immature release that could prove disastrous:

Encountering situations not covered in the training set, but not realising the novelty of these situations
Interacting with other AIs whose actions are unpredictable and unforeseen
Miscalculating the effect of some actions on the earth’s climate (in an attempt, for example, to manage a geoengineering project)
Miscalculating the safety of a nuclear power plant with a creative new design
Miscalculating the safety of a provocative gesture in a tense military stand-off (especially when pre-emptive first strikes may be a theoretical possibility).

To be clear, the complications of an immature software release apply to all three columns of the chart above.

Thus, an attempt to build an AGI+ that includes explicit training on what is believed to be the world’s best examples of ethical behaviour could nevertheless result in a miscalculation, with the immature AGI+ taking actions horribly at odds with human preferences. (That’s similar to how image recognition software sometimes makes spectacular mistakes that are incomprehensible to casual human observers.)

Again, an attempt to build an AGI– that avoids any possibility of developing sentience (with all the resulting complications) may incorrectly leave open the possibility of dangerous sentience arising in some unexpected way.

Therefore, at the lower levels of all three columns, dragons abound.

But suppose that, nevertheless, the immature phase of the AGI (of whichever sort) passes without major incident. Perhaps the developers have been particularly skilful. Perhaps the monitoring for adverse behaviour has been particularly effective. Or perhaps the developers have simply been lucky. Therefore, the software reaches the intended state.

At that point, a second major complication arises:

Just as issues arise before an AGI reaches its intended capability, there are also issues after that point. That’s if the AGI uses its own intelligence and agency to self-improve in a fast take-off mode – recursively improving its own algorithms, comms architecture, power efficiency, or whatever.

Thus, a Default AGI might self-improve to a Default ASI (Artificial Superintelligence), whose intelligence exceeds not only that of any individual human, but all of humanity added together. Now imagine a group of humans driven by malice, with an unconstrained ASI at their disposal. (Shudder!)

Likewise, an AGI+, with an understanding of benevolence designed to match that of humanity, might self-improve to an ASI+, with a very different understanding of benevolence. In that new understanding, human wellbeing may be an irrelevance, or a hindrance.

In principle, an AGI– might, similarly, self-improve to an ASI–, although if the AGI– is correctly programmed, it should have no desire to self-improve. (That’s why the corresponding box in the image above is shown in the colour grey.)

To recap: the decision between AGI–, Default AGI, and AGI+, needs to take into consideration not only the likelihood of the mature AGI treating humanity with respect and benevolence; it must also consider:

The behaviour of the AGI before attaining the intended state of maturity
The behaviour of the AGI after attaining the intended state of maturity.

But there’s one more major complication to add into the mix. Of the three, it’s the hardest of all. To solve it will require the very best of human skills and resources – a singular effort, to ensure a singularly beneficial outcome, rather than one that is singularly catastrophic.

The collapse of cooperation is nigh

Imagine a team of AGI developers, that has weighed up all the considerations above, and explored many technical options.

Imagine that they have decided that a particular version of AGI+ is the best way to go forward. Or, a particular version of AGI–. (In each case, as we’ll see, the same dilemma arises.)

Imagine, also, that these developers have decided, as well, that most other approaches to building AGI are likely to create a catastrophically dangerous CGI rather than a wonderfully benevolent BGI.

These developers now have two challenges:

To put their own ideas into practice, building what they believe to be a BGI (whilst constantly checking that their ideas are turning out well, without nasty implementation surprises)
To prevent other development teams from putting their ideas into practice first, resulting, quite likely, in a CGI.

To my mind, the worst outcome would be for these developers to ignore what other teams are doing, and instead to retreat into their own mindspace. That “go it alone” mentality would fit a pattern that has been growing more deadly serious in recent years: the collapse of effective global cooperation.

I don’t mean the collapse of apparent global cooperation, since lots of discussions and conferences and summits continue to exist, with people applauding the fine-sounding words in each other’s speeches. “Justice and fairness, yeah yeah yeah!” “Transparency and accountability, yeah yeah yeah!” “Apple pie and blockchain, yeah yeah yeah!” “Intergenerational intersectionality, yeah yeah yeah!”

I mean the collapse of effective global cooperation, regarding the hard choices about preventing the creation of CGI whilst others are following sensible pathways with a reasonable chance of creating BGI.

It’s as if some parts of the general structure of the United Nations are still in place, but the organisation is crumbling.

But it’s not just the UN that is bungling the task of effective coordination of the global approach to AGI. All other would-be coordination bodies are struggling with the same set of issues:

It’s much easier to signal virtue than to genuinely act virtuously.
Too many of the bureaucrats who run these bodies are completely out of their depth when it comes to understanding the existential opportunities and risks of AGI.
Seeing no prospect of meaningful coordination, many of the big tech companies invited to participate do so in a way that obfuscates the real issues while maintaining their public image as ‘trying their best to do good.
The process is in many way undermined by many of the ethically-abominable “reckless accelerationists” who, as mentioned earlier, are willing to gamble that AGI will turn into BGI (and they will take a brief perverted pleasure if CGI arrives instead), and they don’t want the public as a whole to be in any position to block their absurd civilisational Russian roulette.

How to address this dilemma is arguably the question that should transcend all others, regarding the future of humanity.

The argument against another default trajectory

Earlier, I gave an argument against the default trajectory for how AGI is being built, that is, the laissez-faire path without any significant effort to ensure that the AGI turns out to be a BGI rather than a CGI.

I now offer an argument against what is the default trajectory for the future of cooperation between different development teams each trying to build AGI. This time, the default trajectory is that cooperation is only superficial, whilst behind the scenes, each group does its best to reach AGI first.

This is the trajectory of a global race. It has its own kind of logic. If you think your AGI will be beneficial, but that the AGIs created by other groups may turn out catastrophic – and if you think there is no easy way to change the minds of these other groups – then you had better endeavour to reach the finishing line first.

But since the race is so intense – with competitors from around the world, using models that have been released as open source and then recompiled with new data and new algorithms – this isn’t a race that can be won by exercising a huge amount of care and attention on safety matters. As each team redoubles its efforts not to be left behind in the race, all kinds of corners will be cut. And what they intended to release as a BGI is almost certainly going to have very nasty unforeseen bugs.

This will not be a race to glory—but likely a race to oblivion.

But what is the alternative? If there is no credible route to meaningful global coordination, perhaps racing fast is the most sensible approach after all.

Happily, there are two credible routes to meaningful global coordination. I mean, each of these routes is partially credible. The real magic happens when these routes are combined.

Decentralised and centralised cooperation

Intelligence is frequently cited as humanity’s defining superpower. To the extent that we act with intelligence, we prosper. To the extent that our intelligence will be overtaken by more powerful artificial thinking systems, our future is no longer in our hands.

But a better analysis is that humanity’s superpower is collaboration. We thrive when we dovetail each other’s talents, communicate new insights, inspire loyalty, and transcend narrow egotism.

As noted earlier, there are oppressive real-world obstacles in the path of any attempts at meaningful collaboration to build BGI rather than CGI. But the solutions to such obstacles are, in principle, already well known. They involve both decentralised and centralised mechanisms:

The decentralised sharing of insights about best practices, with reputation markets tracking conformance to these best practices, and where there are meaningful consequences for loss of reputation
The centralised exercise of power by states – including sanctions and, where needed, forceful interventions.

For the decentralised sharing of insights, here is what I presently consider to be the most important insights – the nine key AI security recommendations whose truths and possibilities need to be shouted from the rooftops, whispered into quiet conversations, mixed into dramatic productions, highlighted in clever memes, and featured in compelling videos:

It’s in the mutual self-interest of every country to constrain the development and deployment of what could become catastrophically dangerous AGI; that is, there’s no point in winning what would be a reckless suicide race to create AGI before anyone else
The major economic and humanitarian benefits that people hope could be delivered by AGI (including solutions to other existential risks), can in fact be delivered much more reliably by AGI+ and/or AGI– (the choice between these remains to be fully debated; likewise the choice of which type of AGI+ and/or AGI–)
A number of attractive ideas already exist regarding potential policy measures (regulations and incentives) which can be adopted, around the world, to prevent the development and deployment of what could become CGI – for example, measures to control the spread and use of vast computing resources, or to disallow AIs that use deception to advance their goals
A number of good ideas also exist regarding options for monitoring and auditing which can also be adopted, around the world, to ensure the strict application of the agreed policy measures – and to prevent malign action by groups or individuals that have, so far, failed to sign up to these policies, or who wish to cheat them
All of the above can be achieved without any detrimental loss of individual sovereignty: the leaders of countries can remain masters within their own realms, as they desire, provided that the above basic AI security framework is adopted and maintained
All of the above can be achieved in a way that supports evolutionary changes in the AI security framework as more insight is obtained; in other words, this system can (and must) be agile rather than static
Even though the above security framework is yet to be fully developed and agreed, there are plenty of ideas for how it can be rapidly developed, so long as that project is given sufficient resources, and the best brains from multiple disciplines are encouraged to give it their full attention
Ring-fencing sufficient resources to further develop this AI security framework, and associated reputational ratings systems, should be a central part of every budget
Reputational ratings can be assigned, based on the above principles, to individuals, organisations, corporations, and countries; entities with poor AI security ratings should be shunned; other entities that fail to take account of AI security ratings when picking suppliers, customers, or partners, should in turn be shunned too; conversely, entities with high ratings should be embraced and celebrated.

An honest, objective assessment of conformance to the above principles should become more significant, in determining reputation, than, for example, wealth, number of online followers, or share price.

Emphatically, the reputation score must be based on actions, not words—on concrete, meaningful steps rather than behind-the-scenes fiddling, and on true virtue rather than virtue-signaling. Accordingly, deep support should be provided for any whistleblowers who observe and report on any cheating or other subterfuge.

I say again: the above framework has many provisional elements. It needs to evolve, not under the dictation of central rulers, but as a result of a grand open conversation, in which ideas rise to the surface if they make good sense, rather than being shouted with the loudest voice.

That is, decentralised mechanisms have a vital role to play in spreading and embedding the above understanding. But centralised mechanisms have a vital role too. That’s the final topic of this article. That’s what can make all the difference between a CGI future and a BGI future.

A credible route to BGI without CGIs

Societies can fail in two ways: too little centralised power, and too much centralised power.

In the former case, societies can end up ripped apart by warring tribes, powerful crime families, raiding gangs from neighbouring territories, corporations that act with impunity, and religious ideologues who stamp their contentious visions of “the pure and holy” on unwilling believers and unbelievers alike.

But in the latter case, a state with unchecked power diminishes the rights of citizens, dispenses with the fair rule of law, imprisons potential political opponents, and subverts economic flows for the enrichment of the leadership cadre.

The healthiest societies, therefore, possess both a strong state and a strong society. That’s one meaning of the marvellous principle of the separation of powers. The state is empowered to act, decisively if needed, against any individual cancers that would threaten the health of the community. But the state is constrained by independent, well-organised judiciary, media, academia, credible opposition parties, and other institutions of civil society.

It should be the same with the governance of potential rogue or naive AGI developers around the world. Via processes of decentralised deliberations, agreement should be reached on which limits are vital to be observed. In some cases, these limits may be subject to local modification, within customisation frameworks agreed globally. But there should be clear acknowledgement that some ways of developing or deploying advanced AIs need to be prevented.

To start with, these agreements might be relatively small in scope, such as “don’t place the launch of nuclear weapons under AI control”. But over time, as confidence builds, the agreements will surely grow.

However, for such agreements to be meaningful, there needs to be a reliable enforcement mechanism. That’s where the state needs to act.

Within entire countries that sign up to this AI security framework, enforcement is relatively straightforward. The same mechanisms that enforce other laws can be brought to bear against any rogue or naive would-be AGI developers.

The challenging part is when countries fail to sign up to this framework, or do so deceitfully, that is, with no intention of keeping their promises. In such a case, it will fall to other countries to ensure conformance, via, in the first place, measures of economic sanction.

To make this work, all that’s necessary is that a sufficient number of powerful countries sign up to this agreement. For example, if the G7 do so, along with countries that are “bubbling under” G7 admission (like Australia and South Korea), along with China and India, that may be sufficient. Happily, there are many AI experts in all these countries who are broadly sympathetic to the kinds of principles I have spelt out above.

As for the likes of Russia and North Korea, they will have to weigh up the arguments. They should understand – like all the other countries – that respecting such agreements is in their own self-interest. To help them reach such an understanding, pressure from China, the USA, and the rest of the world should make a difference.

As I said, this won’t be easy. It will challenge humanity to use its greatest strength in a more profound way than ever before—namely, our ability to collaborate despite numerous differences. But it shouldn’t be a surprise that the unprecedented challenge of AGI technology will require an unprecedented calibre of human collaboration.

The surprise is that so many people prefer to deny this powerful truth. Clearly, there’s a lot of work to be done:

To free people from the small-minded ideologies that stifle their thinking
To give them a sufficiently credible hope to be able to break free from their former conditioning.

Humanity actually did make a decent start in this direction at the Global AI Safety Summits in the UK (November 2023) and South Korea (May 2024). Alas, the next summit in that series, in Paris (February 2025) was overtaken by political correctness, by administrivia, by virtue signalling, and, most of all, by people with a woefully impoverished understanding of the existential opportunities and risks of AGI. Evidently, the task of raising true awareness needs to be energised as never before.

Concretely, that means mobilising more skills to spread a deep understanding of the nine key AI security recommendations – as well as all the other ideas in this article that underpin these insights.

In this task, and indeed all the other tasks I’ve described in this article, well-behaved, well-understood AI can be of great assistance to us. That’s if we are sufficiently astute!

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Ten ways to help accelerate the end of aging

Posted on March 5, 2025March 5, 2025 by David Wood

The best approaches to influencing the influencers, educating the educators, and motivating the motivators

Eureka!

Sometimes we have a breakthrough realization, when thoughts creep up on us and we suddenly see something in a new light. This often happens while we’re away from work settings – we might be cycling, walking in a woodland, or having a shower. For Archimedes, his Eureka moment happened in the bath.

Can society as a whole change its mind, on important matters, in an instant? For example, what caused society to alter their view of slavery, from something unpleasant but tolerable, to something that should be abolished?

What might it take for society to see robust medical control over age-related disease not as an ethically questionable fantasy, but as an achievable humanitarian goal? Let’s consider a change of view that would push society to decide that it is deeply desirable to provide much more funding for experiments on how to reverse biological aging.

These changes in mindset seem unpredictable and mysterious. So mysterious, indeed, that many advocates for ending aging have almost given up hope that their ideas can ever become mainstream. Humans are just too irrational, these dejected advocates conclude.

Let me offer a model as a basis for greater optimism.

In this model, four main factors can influence people to change their minds:

Surprising new facts – such as showing that mice were treated with anti-aging interventions and then lived much longer, and more healthily, than normal mice
Credible theories – which make sense of these new facts, showing how they fit into a broader pattern, and light the way to even greater results
Internal compatibility – whether these theories challenge, or seem consistent with, your deeper beliefs and desires
External reference – whether these theories seem to have the support of other people you respect such as community leaders or social influencers.

If factors 3 and 4 oppose a new theory, that theory will have a hard time! For example, many people seem to be deeply committed to beliefs such as the following:

“Good people don’t try to take more than their fair share of lifespan”
“The only people who desire significantly longer lifespans are naïve, immature, or egocentric”
“It’s pointless to raise hopes about ending aging – such hopes will only lead to disappointment”
“The natural state of human existence is cyclical: a rise and fall in health, and a passing of the torch to new generations”.

Find someone with these beliefs and try to talk to them about new experimental results that seem to delay the onset of aging: you’ll find they avert their attention or find some excuse to denigrate the experiment.

To overcome this internal resistance, we can list four approaches – which build on the four-factor model given above:

Uncover and highlight facts that are even more surprising and incontrovertible – “look, here’s another example from nature in which there’s no biological aging”
Talk about theories which are more engaging and more compelling – “there’s nothing mysterious about this, it follows from some very basic principles”
Take time to build bridges with the system of values that the person cherishes – health, community, liberty, courage, service to others, and so on – and show how the new theory supports these values after all
Find influencers (broadly defined) who are willing to understand the theory and then become its champions.

All four of these approaches are important, but it’s the fourth that potentially has the most leverage.

That’s the simple version of the model of change. Now let’s take a deeper look.

Credit: *David Wood, aided by Midjourney AI*

A pessimistic model of societal influence

We’re all impacted, to greater or lesser extents in different contexts, by the views expressed by people we can call ‘influencers’ or ‘thought leaders’. These may include the most popular children at school, respected aunts and uncles, community leaders who have an aura of wisdom, writers who strike us as being particularly smart and knowledgeable, performers who reach us emotionally as well as rationally, podcast hosts who seem to be on top of changing currents, the stars of cinema, music, or sport whose accomplishments we admire, and so on.

Hence the general advice for any would-be social movement: “influence the influencers”. This is often coupled with two similar pieces of advice: “educate the educators” and “motivate the motivators”.

This pushes the basic problem back one stage. Rather than figuring out how to cause a member of the general public to change their mind, we now have to figure out how to cause various social influencers to change their minds.

In some cases, influencers respond to the views of domain experts. The reputations of the influencers depend, in part, on saying things that relevant domain experts consider credible.

Before an influencer decides to become an advocate for a disruptive view – such as “the end of aging is nigh” – they are to consult the views of the most renowned scientific researchers in the field of longevity.

Here another major problem arises. The community of longevity researchers by no means speaks with a single voice. Instead, it contains plenty of scepticism. Apparently well-credentialed researchers in longevity science express opinions like “it may take 100 years to learn how to comprehensively solve aging”. Others say things like “we still don’t know what causes aging”.

It’s no surprise, therefore, that many would-be social influencers shy away from bold statements about the possibility of ending aging quickly.

The conclusion of this line of analysis: changing opinions within the community of longevity research scientists would be the most valuable move. Imagine if this community transforms from its present cautious pessimism into more full-throated excitement!

What could cause that transformation? What could influence the influencers of the influencers?

Well, since these ‘influencers of influencers’ are scientists, the answer should be clear. What should change their minds, other things being equal, is a combination of the first two points listed above, namely –

Surprising new facts – such as showing mice being treated with anti-aging interventions and then living much longer, and more healthily, than was previously expected
Credible theories – which make sense of these new facts, showing how they fit into a broader pattern, and light the path to even greater results

There’s a catch: every experiment that might lead to “surprising new facts” requires funding, and there’s a limited amount of that going around.

Worse, many of the most promising longevity experiments have little prospect for immediate commercial payback to investors. They are experiments whose results are public goods, without any lock-up of IP (intellectual property).

Accordingly, while some important anti-aging experiments can be funded by venture capitalists or other financial investors anticipating a commercial return (through sales of medical treatments), many others require funding from philanthropic or public sources such as government agencies.

This pushes the problem back one more time. Now the question is: how to influence the decision-makers who control those sources of funds (whether philanthropic or public)?

So long as members of the general public express apathy, or even hostility, toward experiments that might reverse aging, decision-makers who control funding will be reluctant to challenge that stance.

It may seem that we have reached a vicious cycle:

Members of the general public won’t change their minds until social influencers change their minds
Social influencers won’t change their minds until the community of longevity scientists change their minds
The community of longevity scientists won’t change their minds until scientific experiments challenge their current scepticism
These scientific experiments won’t take place until more funding is made available for them
People who control large public funds won’t approve spending on anti-aging research until the public changes their minds.

It’s as I said earlier: it’s no surprise that many advocates for ending aging have almost given up hope that their ideas can ever become mainstream.

An optimistic model of societal influence

But wait. We shouldn’t think just in binary terms. It’s not a matter of complete failure versus complete success. It’s a matter of gradually changing minds – in the wake of increasingly significant experimental results.

The simple description of this new model is: “Good experimental results generate social excitement, leading to more funding, and to even better results.”

An even simpler description is: “Positive feedback loops generate exponential acceleration”. That is, the first few loops may generate only slow, incremental improvements, but subsequent loops can generate much larger changes.

The model can be expanded into a diagram with (count them!) 25 steps:

The model is shown as having three loops, but that’s an arbitrary number. I’ve chosen three for simplicity.

Let’s walk through the 25 steps:

The model starts (step 1) with an assumption that at least some researchers want to find ways to end aging, and that some funding has been promised to them. These researchers design an initial experiment (step 2) and utilise some available funds to carry out the experiment (step 3).

At this point, the following sequence may happen – perhaps several times over:

The experiment fails to live up to expectations (step 4)
The researchers rethink their theories (step 5)
They update the design of their experiment (step 6)
They apply some more funds to carry out the updated experiment (step 7).

Eventually – and in the next section I’ll explore the plausibility of this step – the experiment produces results that can be described as ‘promising’ (step 8) rather than ‘weak’ (step 4). In turn, this leads to the following cascade:

At least some members of the broader longevity research community become more enthusiastic about the possibility of ending aging in the relatively near future (step 9)
At least some of society’s influencers (television personalities, podcast hosts, etc.) speak more warmly than before about the case for ending aging (step 10)
Influenced by the influencers, a greater proportion of the general public allow themselves to express hopes, desires, and demands for society to rally behind the project of ending aging sooner rather than later (step 11)
Influenced by the general public, some political leaders, along with other decision-makers who control significant sources of funding, switch their outlook from apathy or hostility regarding ending aging to at least some cautious optimism (step 12)
These decision-makers approve funding researchers who have promising ideas for anti-aging interventions (step 13)
With these additional funds, the researchers design bolder experiments, with more comprehensive anti-aging interventions (step 14), and carry out these experiments (step 15).

This might be followed by one or more loops of increasingly promising results, or one or more loops of comparative failure.

Eventually, the outcome of an experiment goes beyond what could be called ‘promising’ (step 8) to ‘breakthrough’ (step 16). This breakthrough result unleashes a more powerful cascade of reactions:

The community of longevity researchers moves from merely enthusiastic to solidly convinced; indeed, some scientists who previously kept quiet about their actual views, for fear of being labelled ‘cranks’, no longer self-censor, and now speak out strongly in favour of shorter timescales (step 17)
The community of social influencers moves from excitement to exuberance (step 18)
The general public moves from mere excitement to activism and mobilization (step 19)
Politicians now find themselves free to express their own (perhaps long-suppressed) views that, actually, ending aging would be a profound social good (step 20), and therefore deserves huge amounts of funding (step 21)
With ample funding available at last, longevity researchers can design (step 22) and carry out even bolder research (step 23).

Perhaps after one or more additional turns of this loop, the results will be so conclusive (step 24) that the vast majority of society unites behind the cause of ending aging, and adopts in effect a wartime mentality of ‘whatever it takes’ to reach that goal without any further delay (step 25).

Double-checking plausibility

Where might the above model of change encounter its most serious blockages?

The biggest leaps of faith involve believing that experiments on rejuvenation treatments will indeed produce results that can be described as ‘promising’ (step 8), ‘breakthrough’ (step 16), and ‘conclusive’ (step 24).

Reasons for thinking that experiments will in due course have such outcomes include:

A simple extrapolation of previous experiments, which have had their share of promising outcomes
The strengths of various theories of aging, not least the theory which I personally judge to be the most compelling, namely the damage-accumulation theory of aging.

Some readers may prefer a different theory of aging, with central roles given to (for example) hormones, bioelectricity, the immune system, or genetically programmed decline. If you have a favourite theory of aging and believe it to be credible, you will share my assessment that good outcomes will eventually result from anti-aging experiments. These readers will regard it especially important to change/update theories (step 6). (For these readers, the ‘update’ will require more than a change of parameters; it will be a total change of paradigm.)

Reasons for thinking that anti-aging experiments will not in due course have promising outcomes include:

A pessimistic assessment of the rate of progress in recent years
Criticisms of theories of aging.

I’m not impressed by any general extrapolation from “slow progress in the recent past” to “slow progress in the indefinite future too”. That extrapolation entirely fails to appreciate the exponential-acceleration model I’ve described above. Indeed, there have been plenty of other fields (such as artificial intelligence) where a long period of slow progress transitioned into a period of more rapid progress. Factors causing such a transformation included:

The availability of re-usable tools (such as improved microscopes, molecular assembly techniques, diagnostic tests, or reliable biomarkers of aging)
The availability of important new sets of data (such as population-scale genomic analyses)
The maturity of complementary technologies (like how a network of electrical recharging stations allows the wide adoption of electric vehicles; or a network of wireless towers allowed the wide adoption of wireless phones)
Vindication of particular theoretical ideas (like how understanding the importance of mechanisms of balance allowed the earliest powered airplanes to take flight; or the germ theory for infectious diseases)
Results that demonstrate possibilities which previously seemed beyond feasibility (such as the first time someone ran a mile in under four minutes)
Fear regarding a new competitive threat (such as the USSR launching Sputnik, which led to wide changes in the application of public funding in the USA)
Fear regarding an impending disaster (such as the spread of Covid-19, which accelerated development of vaccines for coronaviruses)
The availability of significant financial prizes (such as those provided by the XPrize Foundation)
The different groups of longevity researchers committing to a productive new method of collaboration on issues that turn out to bear fruit.

That leaves questions over how to assess which theories of aging are credible. To be clear, it’s in the nature of scientific research that the validity of theories cannot be known in advance of critical experiments. That’s why research is needed.

I accept that it’s possible that the biological aging of humans will turn out to be comprehensively more complex than I currently conceive. It’s also possible that alternative theories for how aging can be ended will fail too. But these are only possibilities, not what I would expect.

I doubt there’s any meaningful way to measure the probability of such a failure. However, until someone produces a good counterargument, I will continue to maintain there’s at least a 50% chance that aging can indeed be defeated, sooner or later, by a programme of rejuvenation interventions.

Even if that probability were considerably lower – just 5%, say – that would still be a reason for society to invest more of its discretionary financial resources to fund a number of the anti-aging experiments that, on paper at least, appear promising.

These experiments will provide important data to help answer the questions:

Do our theories of aging appear to be on the right track?
If these theories are on the right track, is it sooner, or instead later, that we are likely to obtain conclusive results from anti-aging experiments?

Short-cuts and warnings

In a moment, I’m going to switch from the theoretical to the practical. That is, I’m going to suggest ten ways that each of us might be able to help accelerate the end of aging. I’ll do so by referencing the above model.

But first, it’s time to admit that, of course, there are many pathways of influence, education, and motivation beyond the ones represented by the arrows in the above diagram.

For example:

Some members of the general public may change their minds, not because they are inspired by a social influencer, but because they consult science publications directly
Some important experiments can proceed, not because they receive funding from public institutions, but because a group of volunteer citizen scientists provide their services free of charge
Sometimes individual politicians can prove themselves to be visionaries, championing a cause ahead of majority public opinion
There are special kinds of influencers, such as patient advocates, who can play their own unique roles in magnifying flows of new understanding throughout society.

In other words, the arrows in the above diagram show only the mainstream flows of influence, and omit many important secondary influences.

With that in mind, let me now offer some answers to the question that I often hear when I speak about the possibility of defeating aging. How can people help to bring about this possibility more quickly?

In all, I’ll offer ten suggestions. But watch out: in each case, there’s a risk of taking the suggestion too far.

1. Learn the science

As stated earlier, two of the most powerful tools to change minds are to share new information and to share new ideas. That is, to draw people’s attention to surprising facts discovered by scientific investigation, and to credible theories, which make sense of these surprising facts.

Before we can share such information and ideas with others, we need to understand them ourselves. That’s why one of the key ways to help accelerate the defeat of aging is to keep learning more about the facts and theories of aging – as well as the facts and theories of how aging can best be reversed.

What’s more, the better our collective scientific understanding of the aging process, the more likely it will be that an appropriate set of anti-aging experiments will be prioritized – rather than those who are championed by people with loud voices, large wallets, or unfounded scientific prejudices.

I acknowledge that subjects such as biochemistry, immunology, nutrition, pharmacology, comparative evolution, genetics, and epigenetics can be daunting. So, take things step by step.

Two foundational books on this overall set of topics are Ending Aging: The Rejuvenation Breakthroughs That Could Reverse Human Aging in Our Lifetime, by Aubrey de Grey and Michael Rae, and Ageless: The New Science of Getting Older Without Getting Old, by Andrew Steele. In the last 12 months, I’ve also benefited from reading and thinking about (among others)

The Genetic Book of the Dead: A Darwinian Reverie, by Richard Dawkins
Two books by Nick Lane: Transformer: The Deep Chemistry of Life and Death, and Power, Sex, Suicide: Mitochondria and the Meaning of Life
Eve: How the Female Body Drove 200 Million Years of Human Evolution, by Cat Bohannon
Why We Die: The New Science of Aging and the Quest for Immortality, by Venki Ramakrishnan.

There’s a lot more that can be learned from Youtube channels, podcasts, and real-world presentations and gatherings.

But beware: Don’t fall into the trap of thinking you should take no action whilst there are still gaps in your understanding of the science. The solution of aging involves engineering as well as science. Engineering involves finding out what works in practice, even though there may be gaps in scientific explanations.

Indeed, it may well be possible to remove or repair the damage which constitutes biological aging without knowing the exact metabolic sequence that gave rise to each piece of damage.

In other words, don’t let imperfect knowledge be a cause of inaction.

2. Become a citizen scientist

Even if your knowledge of science is far from comprehensive, you may still be able to assist important anti-aging projects by methods such as:

Literature searches, looking for articles relevant to the design or progress of an experiment
Data analysis and review
Self-experimentation: becoming a participant in studies on fasting, supplements, or biohacking
Organizing small-scale experiments using low-cost lab facilities.

Even small contributions can make a big difference over time.

A citizen scientist often devotes only a portion of their spare time to such projects. After retiring from their main job, some even become full-time citizen scientist researchers.

But beware: Each project tends to build its own momentum, and the motivation of participants can change from “I’m doing this to help reverse aging” to “I’m doing this because I want to finish the project and be able to list it on my CV” or even “I’m pivoting this project away from focusing on aging to focusing on something more commercially rewarding”.

In other words, be sure to keep the goal foremost in your mind.

3. Learn the broader arguments

As covered earlier, there’s a lot more to changing people’s minds than merely quoting scientific facts and scientific theories. In practice, people’s minds are heavily influenced (consciously or unconsciously) by their views on religion, philosophy, economics, and politics. To help change people’s thinking on the desirability of ending aging, we need to become familiar with the counterarguments from these fields – and we need to become adept at responding to these counterarguments in ways that are respectful but also persuasive.

Again (as with science) we don’t need to learn about these non-scientific topics just to influence others, but also so that we can free our own choices and actions from biases and prejudices that we previously didn’t recognise.

In the last 12 months, I’ve personally benefited from reading and thinking about the following books (among others) which addressed those subjects:

The Longevity Imperative: How to Build a Healthier and More Productive Society to Support Our Longer Lives, by Andrew Scott
Pathogenesis: How Germs Made History by Jonathan Kennedy
The Price We Pay: What Broke American Health Care—and How to Fix It, by Marty Makary
The Future Loves You: How and Why We Should Abolish Death, by Ariel Zeleznikow-Johnston.

But beware: There’s little point in pursuing precise calculations of the economic benefit of rejuvenation therapies. Whether an anti-aging healthcare intervention, applied across an entire society, would be worth $3 trillion in healthy life-years gained, as opposed to just $1 trillion, won’t change the minds of many more people. Instead, the primary reason people resist calculations of vast economic benefit is because they don’t believe in the scientific arguments about the interventions. They don’t believe the interventions will work. Accordingly, it’s the science that they need to come to trust, rather than going more deeply into economics.

The primary reason they fail to accept the scientific arguments is often that they experience a painful cognitive dissonance with the picture they like to hold of themselves as being (for example) hard-hearted, or self-sacrificing, or undemanding, or religiously pure, etc. Accordingly, the conversation that is needed in this case is about values, or identity, or other philosophical foundations. Or perhaps it’s not even a conversation that’s needed, but rather that the person needs to feel comfortable with whoever is expressing these new ideas.

As is often said, when it comes to controversial topics, few people will care about how much you know, until they know how much you care.

In other words, what matters isn’t just the message, but also the messenger. (Which is another reason why well-admired social influencers can have a disproportionate impact upon public opinions.)

4. Steer conversations

Once you’ve learned at least some of the scientific theories about aging, and at least some of the broader philosophical arguments, then you’ll in principle be able to help steer both private and public conversations toward the conclusion that ending aging in the not-so-distant future is both scientifically credible and morally desirable.

That is, you’ll be ready to become an influencer too – albeit one who is less influential than media stars or broadcast personalities. You’ll be able to correct various misconceptions and distortions about aging – and how it might be cured.

To do this well, you’ll need to develop communication skills, which may include one or more of the following:

Good writing
Good listening
Good questioning
Good speaking
Good humour
Good graphics
Good narrative construction
Good music composition
Good video composition

But beware: Not every argument is worth winning. Not every conversation needs to be pursued to an agreement. Sometimes it’s prudent to step back from an interaction, especially if it’s with people who delight in trolling, or who are unprepared to change their minds.

Also note that how you conduct an argument is often as important as what you say in that argument. If we are perceived as being obnoxious, or arrogant, or dismissive, etc, we can do more harm than good.

In other words, pick your battles carefully – and remember that your behaviour can have a bigger impact than your message.

5. Anticipate larger narratives

As people think more seriously about the possibility of biological rejuvenation, they’ll frequently start to wonder about some larger questions:

If rejuvenation therapies can undo damage in our bodies and brains, might similar therapies enable us to live ‘better than well’ – with significantly better fitness, vitality, strength, and so on, than even the healthiest people of previous eras?
Indeed, why stop at physical rejuvenation? What about using technologies to rejuvenate our minds, our emotions, our relationships, and our spirituality?
If we can eliminate the pain of aging in humans, why not also the aging experienced by our pets, and by other animals with whom we share the planet?
Alongside rejuvenation of vitality, what about rejuvenation of fertility? Might someone choose to keep on having babies into their nineties and beyond?
Is ‘til death do us part’ still the best principle to guide marriage, if lives and good health extend far beyond the biblical figure of threescore years and ten?
Would ending aging worsen inequalities? Or result in irreparable damage to the environment?
If generations no longer retreat from the workforce due to declining vitality, making way for younger employees to be promoted, how will workforce dynamism be preserved? And won’t there be a cultural stagnation in fields such as the arts and politics? Indeed, what about immortal dictators?

There are three general types of reactions to these questions:

These possibilities are awful, which is a reason to oppose the ending of aging
Lives will for the most part remain the same as before, except that they will become much longer
Human experience is likely to be transformed in many other ways, beyond simply living longer; our lives will be expanded rather than just extended.

In case you’re unsure, the third reaction is generally the correct one.

Accordingly, advocates for ending aging need to decide whether to remain silent on the above sorts of questions – switching the conversation back to more comfortable topics – or instead to have thoughtful answers ready.

The good news is that communities such as transhumanists, vitalists, cosmists, singularitarians, and other radical futurists, have already explored these questions at some length. The bad news is that the writings of these groups are sometimes bewildering, contradictory, or disturbing.

That’s a reason for longevity advocates to start to become familiar with the twists and turns of this philosophical landscape. If you have nothing to say when a conversation turns in these directions, someone may conclude that you haven’t thought through the consequences of your beliefs, and that, accordingly, you aren’t to be trusted.

But beware: Although it’s good to be prepared for conversations turning to subjects such as transhumanism, cryopreservation (also known as biostasis), human-machine cyborgs, replacement bodies, and longtermism, it’s probably best in most cases not to start a conversation on these topics.

If people perceive you as being more interested in these topics than, say, extended healthspans for all, they may decide that you are too weird, and break off their conversation with you.

In other words, be ready for conversations to turn radical, but avoid premature radicalisation.

6. Beware snake oil

I’ve already mentioned how well-intentioned advocacy for ending aging sometimes does more harm than good. Examples include:

Speaking rationally but without empathy or sensitivity
Disregarding value-systems which are held dear by people listening
Introducing topics that frighten listeners, and which switch listeners from open-minded to closed-minded

There’s one other way in which ill-judged advocacy can rebound to make the anti-aging field weaker rather than stronger. Namely, if anti-aging enthusiasts champion treatments, therapies, potions, pills, processes, lifestyle habits, or whatever, that have limited scientific credentials, or, worse, have evidence that they cause harm.

Some of this over-selling arises from naïveté: the enthusiast has put too much trust in a friend, colleague, or social influencer, and hasn’t done good research into the ‘solution’ being advanced.

On other occasions, the over-selling can be deliberate. Think of Elizabeth Holmes of Theranos, Adam Neumann of WeWork, Trevor Milton of Nikola, or Sergei Mavrodi of MMM Healthcare.

On yet other occasions, the perpetrator of the fraud has no expectation that the “solution” will ever become viable. They are simply in the business of finding a gullible audience, telling the audience what they want to hear (for example, “this remarkable treatment is scientifically proven to add years to your healthspan”), taking as much money as possible, and then disappearing from sight. (“So long, sucker!”)

In all three cases, a number of harms can result:

People can have their health ruined by the so-called solution – perhaps even dying as a result of a misdiagnosis
If their biological health remains OK, they may nevertheless suffer a big hit to their financial health
Financial resources that should have been applied to treatments with a stronger scientific basis have been wasted on bogus ones
People viewing from outside may deduce that the entire anti-aging field is full of cranks, cheats, and charlatans; accordingly, they may close their minds to the entire subject.

To avoid these harms, all of us need to keep firmly in mind the principles of scientific investigation. These include:

Checking statistical results, rather than isolated cherry-picked examples
Looking not just for confirming evidence, but also for dis-confirming evidence
Being alert for ‘motivated reasoning’
Ensuring that trials can be replicated
Considering alternative hypotheses
Requiring independent investigation by researchers with no direct ties to the solution
Resisting appeals to apparent authority
Requiring clear explanations, rather than a flood of pseudoscientific mumbo-jumbo.

But beware: Attention to the risks of solutions possibly being flawed should not result in analysis paralysis. Absence of complete evidence should not cause all investigations to stop. It is still possible to recommend various treatments even in the absence of full medical trials, so long as recipients are made aware of the risks involved.

In other words, caution should be our companion, but not our master.

7. Join a business

In recent decades, most of the technological transformations of the human condition have involved businesses that converted research ideas into products for which customers would willingly pay. Consider motor vehicles, airplanes, musical instruments, washing machines, dishwashers, computers, phones, contraceptives, heart pacemakers, hip replacements, and stem cell therapies. A competitive marketplace spurred innovation, quality improvement, price reduction, and greater consumer choice. It will surely be the same with many of the interventions that will help to reverse aging.

By using a combination of the skills already mentioned, you can join a company that is already working on solutions related to the anti-aging cause. Options include:

Joining an established company, or a startup
Joining a company that is already committed to anti-aging, or one that has products that could be repurposed or re-oriented for anti-aging purposes
Joining a company in a role similar to one you’ve had earlier in your career (e.g. HR, marketing, finance, legal, I.T., consulting, validation, or R&D), or instead taking more of a risk and starting a new career trajectory, probably at a lower rung in the ladder.

As always, when deciding to join a company, you’ll need to weigh up a variety of considerations:

Corporate culture
Leadership acumen
Product suitability
Product roadmap
Balance of risk and reward
The calibre of your potential new colleagues
Working conditions
Salary and other compensation

Before you can obtain a job that attracts you, you may need to undergo further training or take an interim role in a position which could become a stepping-stone to your intended destination.

In this way, you could make a significant contribution to bringing important new anti-aging products to the market.

But beware: Businesses can take on a life of their own. Meeting business deadlines can, stage-by-stage, cause you to deviate from what you previously considered to be your true purpose. Instead of supporting R&D into new anti-aging products, your efforts may be diverted into personality conflicts, corporate politics, products that have little to do with anti-aging, or pursuing profits instead of solving aging.

Accordingly, anyone working in a business ought to organise a ‘time-out’ for themselves every few months, in order to reflect on whether their current business role is still the best use of their energy, skills, and resources.

In other words, businesses should be our allies, but not our overlords.

8. Make financial contributions

Rather than applying our time into many of the above activities, we can apply our money.

This could be a one-off contribution, or a recurring donation.

It could be an investment made with some expectation of a financial return in the future. Or it could be a philanthropic gift, made just with the thought that millions (indeed, billions) of people could benefit in due course from the anti-aging products and solutions whose development your gift supports.

Of course, deciding which financial contributions to make is as complicated as deciding which job offer to pursue. The range of potential recipients can be overwhelming.

To help you decide, here are some factors to consider:

The potential of your gift to trigger a cascade of further investment by other people, via the kind of feedback cycles in the model described earlier in this article
Whether you prefer to make a relatively safe investment to support some incremental research into an application of some technology that is already reasonably well understood, or instead an investment to help understand core platform mechanisms with potentially many implications
The track-record of the people who will receive your donation
Potential tax-efficiency in the methods by which you make your donation.

But beware: An organisation that you judge to be the best recipient of a donation at one time may no longer be the best such recipient at a later time:

Personnel may change at the organisation
The organisation may change its strategy
New research findings may provide better options elsewhere.

Accordingly, the task of giving money away can be just as challenging as the task of earning it in the first place. To get the best results, we need to remain informed and attentive.

In other words, don’t allow momentum to get the better of your better judgement.

9. Build bridges

This brings us to perhaps the most significant way that many of us can accelerate the defeat of aging. Rather than just relying on our own energy, skills, and resources, we find ways to unleash the energy, skills, and resources, of whole communities of people.

For example, even if you have only limited finances at your own disposal, you presumably know some people who are wealthier than you. Even if you personally lack deep knowledge of science, you presumably know some people with better training in researching the scientific literature. Even if you are personally unable to create engaging videos, you presumably know some friends or colleagues who could take on that task.

This idea lies at the heart of the multiplicative effects of the model of societal change featured in this article. It involves us sharing, with any groups of people who may be ready to respond, news of scientific breakthroughs, updates in scientific theories, and the humanitarian philosophical ideas that validate the radical extension of healthspan.

This bridge-building activity is in some cases fairly straightforward, when we reach out to people who have similarities with ourselves. The kinds of ideas that changed our own minds may well change their minds too. But not always, since the ideas at the backs of people’s minds often differ in unexpected ways.

Accordingly, an important skill in bridge-building is to be perceptive – to listen carefully to any feedback, and to notice whether ideas seem to be received well or badly. It is sometimes wiser to wait for a better opportunity, when your conversation partner may be more receptive.

The most impactful bridge-building can take place when you establish links with a community where, at first sight, you have little connection. However, with creative insight, you can find the right leverage point.

Examples include connecting with:

Patient-support groups, where members are already attuned to the benefits of life-extending treatments, and who may be ready to consider radical alternatives
People with a different political persuasion to you, but who may nevertheless share your conviction that defeating aging should be a clear priority
People from different religious traditions, but who value the possibility of remaining in good health for extended periods of time
People who have earned money in ways differently from you (for example, by crypto investments).

Although the core messages you eventually share with these diverse groups will ultimately be the same, the initial overtures will vary considerably. Communication must be adapted skilfully.

But beware: Not every bridge has equal priority. If you keep encountering opposition from a group you thought should be receptive, the most practical thing to do could be to switch your bridge-building efforts to a different community.

In other words, choose your bridges wisely.

10. Take care of yourself

Before we can apply much effort in any of the above activities, we need to maintain our health, our passion, and our focus.

If you fall ill and die of some avoidable condition, you can only support the anti-aging cause in weak ways for a short period of time. It is far better to remain in tip-top condition for as long as possible.

This is at least as important for psychological health as for bodily health. Being full of energy is important, but it’s even more important to keep orienting these energies in the ways which will have the greatest effect. Keeping our wits sharp can make all the difference between a productive and an unproductive investment of our energy.

In other words, as well as taking the time to exercise our bodies, we need to keep on exercising our minds, and, indeed, to keep on reflecting on the issues that matter most to us.

Hence the advice I gave earlier: be sure to keep the goal foremost in your mind.

That advice forms part of a broader set of suggestions that I have woven into my description above of the ten ways that people can help accelerate the end of aging. For convenience, here are these pieces of advice gathered into a single list:

Be sure to keep first things first in mind
Don’t let imperfect knowledge be a cause of inaction
What matters isn’t just the message, but also the messenger
Remember that your behaviour can have a bigger impact than your message
Pick your battles carefully
Be ready for conversations to turn radical, but avoid premature radicalisation
Caution should be our companion, but not our master
Businesses should be our allies, but not our overlords
Don’t allow momentum to get the better of your better judgement
Choose your bridges wisely.

These pieces of advice can be summarised as “self-mastery”. Without self-mastery, our impact will be reduced.

But beware: The time and effort we put into improving our self-mastery is time and effort taken away from our primary task.

To make the potential danger here easier to grasp, consider a simple model. Imagine that someone can reasonably expect to live another ten years, if they continue to follow their present lifestyle. Imagine also that the availability of significant aging-reversal treatments is estimated at being twenty years in the future. As things stand, that person is likely to die ten years before anti-aging treatments would be able to save them.

By changing their life habits, such as dietary supplements, more regular sleep, and careful monitoring of biomarkers, it’s possible that the person could extend the number of years they might expect to live. But other changes in their life habits, such as staying up late at night creating new videos, or travelling to speak at more conferences, might catalyse an acceleration in the positive feedback cycles described earlier in this article. That could bring forward the date at which aging-reversal treatments become available.

Out of these two choices, which would be preferable? Different people may answer that question differently. But bear in mind that, in the second case, the benefits would apply to everyone still alive (and still aging) on the planet.

In real life, the choices are more complex. Ideally, we can find ways to keep ourselves healthier and more active for longer, and to accelerate the defeat of aging.

But my point is this: there’s more to life than self-mastery.

Going forward

I’ve described a set of ten possible courses of action:

Learn the science
Become a citizen scientist
Learn the broader arguments
Steer conversations
Anticipate larger narratives
Beware snake oil
Join a business
Make financial donations
Build bridges
Take care of yourself

Different people, in different stages of their lives, and in different contexts, will likely decide to divide their focus in different ways between these ten courses of action.

This question – how to divide your personal focus – may benefit from candid advice from people who know you well who are also well grounded in the anti-aging movement. Interacting with communities of such people should help you make better choices. Consider joining the Longevity Biotech Fellowship, and/or the community of Mobilized Vitalists. Also consider attending a conference such as RAADfest and talking to lots of people while there.

There’s an even bigger question: which rejuvenation experiments have the best chance to trigger fast progress around the outer loops of the model of societal change? These are the experiments that most deserve additional funding and support.

This ‘which experiments?’ question is hotly debated among advocates of ending aging. Rather than me stating my own answer to that question, I’ll instead urge you: connect with longevity researchers, listen to what they say, do your own research, and then act.

Dedicated focus on the experiments with the potential to ramp up the excitement levels of the longevity research community should lead to a dramatic acceleration toward the end of aging.

Archimedes and the lever

If we can obtain the right perspective, even the hardest tasks can become simple.

Archimedes is known, not only for his post-bathtime dash through the streets of Syracuse exclaiming “Eureka”, but also (among many other reasons) for the insight captured by this saying: “Give me a place to stand and a lever long enough, and I will move the world”.

The task of solving aging might seem as daunting as moving the entire world. However, three points of leverage render this task feasible after all:

The leverage of an actionable theory of aging – namely, in my assessment, the damage accumulation theory of aging
The leverage of an actionable theory of societal change – as covered in the earlier parts of this article
The leverage of specific actions that each of us can take that will accelerate the loops of positive change – actions described in the later parts of this article.

Now let’s get to it!

Acknowledgments

I acknowledge valuable discussions on these ideas with members of the LEVF leadership team and also with participants of the Mobilized Vitalists Telegram channel.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Is there one master-key to aging? Is aging singular, plural, or infinite?

Posted on January 13, 2025January 14, 2025 by David Wood

How many separate interventions will be needed to solve aging? And what does evolution imply regarding that question?

The complications of aging

By some estimates, the human body contains 12 different biological systems, 78 different organs, 135 metabolic pathways, almost 200 different types of cell, and over 17,000 different proteins.

That’s a lot that can go wrong!

Indeed, as we age, more and more of these biological units develop all sorts of problems. Our minds and bodies increasingly lose the vitality and resilience of our youth. Dysfunction in lower-level biological units leads to visible sicknesses and decline in higher level units, and, ultimately, to death.

Whenever we humans reflected on this sorry trend of general deterioration in our bodies as we age, it was natural for us to wonder, also, what it would take to halt that trend.

In bygone days, some fanciful tales imagined that a single command from a powerful deity could put an end to our aging. Or that a life lived in some kind of cosmic harmony or absolute moral purity would allow a yogi or saint to escape the normal ravages of the passage of time.

In more recent times, the pendulum has swung to the other extreme. Forget any ideas that a single intervention might reverse all aspects of aging, say these critics. Moreover, forget any ideas that a series of different interventions might combine to do the trick. That’s because aging is overwhelmingly complicated.

Want to modify a metabolic pathway to reverse a given type of biological damage? That will surely have adverse side-effects, say the doubters. Each gene in our DNA is involved in multiple different activities; a genetic change to avoid one kind of damage in one subsystem will surely cause new kinds of damage in other subsystems. And so on.

In this view, the comprehensive solution to aging lies beyond human ability. There are far too many moving parts for any series of interventions to have a lasting beneficial effect. Therefore, we should give up any hope of curing aging.

1, n or ∞

Given the above context, I propose a three-way classification of all theories for how aging can be solved. The three categories of theory can be called singular, plural, and infinite.

Theories in the infinite category despair about the possibility of humans being able to subdue all the many areas of damage that occur throughout the body over time. In this view, if some types of damage are reduced, it will be at the cost of harming other aspects of human biology. Trying to avoid both sets of harm simultaneously will just result in yet other aspects of our metabolism being pushed over the edge. And so on.

These theories split further into two:

Fans of the idea of a forthcoming beneficial artificial superintelligence (ASI) expect that, what humans are unable to accomplish, will nevertheless be within the extraordinary capabilities of an ASI. In this case, the solution to aging requires accelerating the advent of such an ASI
There’s an even more pessimistic subclass, which holds that aging is so difficult that not even an ASI could solve it; it would be like expecting an ASI to move a spaceship faster than the speed of light.

Theories in the plural category maintain, instead, that a careful mixture of different damage repair interventions will in due course succeed in undoing biological damage throughout the entirety of the human body – before that damage causes serious harm – without resulting in unchecked new damage elsewhere in the body.

Plural theories are sometimes described as an engineering approach. It’s similar to the task of building a large suspension bridge: you need to get lots of separate things right, or the bridge will fall down, or sink under its own weight, or sway catastrophically when certain winds blow. It’s similar to the task of heavier-than-air powered flight: you need to solve the subtasks of take-off, steering, landing, and a sufficiently light engine delivering adequate thrust. Yes, you need to get lots of separate things right, with wise choices between trade-offs, but that’s what the discipline of engineering enables.

Thus, you may hear about a grand project to “solve the seven deadly sins of aging”, referring to seven named types of biological damage at the molecular, cellular, and extracellular levels. Or about projects to solve each of the 12 ‘hallmarks of aging’.

Theories in the singular category claim there is a greater unity in the phenomenon of aging. Despite the appearance of a large variety of different sorts of age-related biological damage, these theories propose that there is a single underlying mechanism (for example, an epigenetic clock), which, if altered, would be sufficient to reverse and solve aging.

If that is true, it sounds like good news, for those of us who wish our loved ones (and ourselves) to be able to keep on living with high vitality for many decades (perhaps even centuries) to come.

But what grounds are there for believing such an idea? That question takes us to a study of comparative evolution.

Biology and indefinite youth

We’re used to seeing creatures grow old and frail. That’s what happens to our pet cats and dogs, to horses, to mice and rats in laboratories, and, yes, to us humans.

In fact there’s a regular pattern to the growth of this frailty. If frailty is measured by the likelihood of an animal dying from any cause within the next fixed period of time (for example, in the next year), this frailty rises exponentially as the animal becomes older. Thus humans aged 60 are about ten times as likely to die in the next 12 months as they were at age 35. And humans at age 85 are ten times more likely again. (These three probabilities are roughly one in a thousand, one in a hundred, and one in ten.)

This pattern has a name: the Gompertz law of mortality.

Intuitively, the law seems to make sense:

As damage spreads and deepens throughout the body, part of what breaks down are the mechanisms such as the immune system and stem cells that would normally help repair other types of damage
The greater the damage throughout the body, the more vulnerable the body becomes to external shocks and strains – such as infections and injuries.

But wait. What’s true for cats, dogs, horses, mice, rats, and humans – as well as large numbers of other species – is not true for every species.

One valuable source of information on that point is the 2014 Nature article by Owen Jones, James Vaupel, and collaborators, ‘Diversity of ageing across the tree of life’. The article reviewed how mortality (frailty) increased with age, and also how fertility changed. This is from the article’s abstract:

Although it has been predicted that evolution should inevitably lead to increasing mortality and declining fertility with age after maturity, there is great variation among these species, including increasing, constant, decreasing, humped and bowed trajectories for both long- and short-lived species.

The different curves can be seen in an image included in their article. In each graph, the red lines show mortality at different ages, whereas the blue lines show fertility rates:

Some individual birds illustrate the same possibility. One example is the albatross known as “Wisdom”, who was given a tag on her leg in 1956 when she was already estimated as being around five years old. She has been observed many times in recent decades, with no apparent drop in her fitness, nor in her fertility. Here’s a photo and an extract from a Facebook posting by the Pacific Islands U.S. Fish and Wildlife Service dated 27th November 2024:

“Wisdom returns to Midway Atoll National Wildlife Refuge bringing more Thanksgiving joy to the Midway Atoll staff who celebrate witnessing Wisdom reaching at least 74 years old this coming winter. Wisdom, a mōlī (Laysan albatross), is the world’s oldest known, banded bird.”

Note the tag “Z333” on her leg in the photo. And note the egg that she has laid. (There’s a video here, taken by service volunteer Dan Rapp.)

Colonies of naked mole-rats display the same phenomenon: no sign of any increase in mortality with age, or of any decline in fertility. This was the headline in an article by a number of researchers at Calico: “Naked mole-rat mortality rates defy Gompertzian laws by not increasing with age”.

This graphic from the article compares the mortality curves for four different species: horses, mice, humans, and naked mole-rats:

The conclusion – which could be bolstered by references to many other animal species – is that nature itself appears to have found ways to avoid any exponential increase in mortality.

All that we humans need to do, therefore, is to find out the biological secrets of these species, and use that knowledge to create interventions that have similar effects in our own bodies.

The reversal of aging is nigh. Right?

Four criticisms and two answers

Not so fast, respond advocates of the ‘infinite’ schools of thought. Reversing aging isn’t that easy.

Four objections can be placed against the line of reasoning in the previous section:

Animals such as naked mole-rats and albatrosses may eventually manifest an exponential increase in mortality, but so far, the experiments have only revealed the early (apparently linear, or even flat) portion of the curve
Even if their mortality doesn’t increase exponentially, it is likely to increase linearly, due to aspects such as an observed epigenetic drift within the genome
It may not be possible to transfer the damage-repair aspects of the biology of these species to humans, without losing aspects of human biology that are fundamentally important to us
Even if such a transfer is possible in theory, in practice it may require an endless sequence of refinements, adjustments, and iterations – taking us back into the territory of infinite difficulty.

There are two ways to counter these criticisms. The first way leads to the singular group of theories of aging, and the second to the plural group.

The main difference between the two answers is in the assessment of the capabilities of evolution:

The singular view is that evolution could have created humans that don’t age, but didn’t do so, because evolution was optimizing for outcomes other than extreme individual longevity
The plural view is that evolution was fundamentally constrained; it wasn’t able to create humans that don’t age
The singular view implies that biological systems might be coaxed into comprehensively regenerating themselves via their intrinsic capabilities
The plural view is that the intrinsic regenerative capabilities of biological systems are limited: they will likely need to be augmented by a plurality of different damage-repair interventions.

Let’s look more closely at this difference of opinion.

The capabilities of evolution

What creatures do we see around us in nature? Those which have inherited attributes that made their ancestors sufficiently fit to survive for at least a period of time in their environment, so that they were able to pass on their characteristics to a new generation which could in turn reach maturity and repeat the process.

The more suited a creature is to these tasks – survival and reproduction – the greater the likelihood that its descendants will grow in number in subsequent generations, out-competing other creatures which are less fit.

What does this imply about the likelihood of creatures that have longer lifespans?

Other things being equal, longer lifespans mean more chances to have offspring, and therefore greater numbers of descendants which are similarly long-lived. This suggests that evolution should produce ever greater numbers of increasingly longer-lived creatures.

However, other things are not equal. A creature that continues to have new offspring at a fast pace will deplete the resources it could apply to other tasks:

Looking after earlier offspring that are still young and would benefit from parental support
Spending energy to repair damage that has accumulated in its own body.

This in turn suggests that evolution should produce ever greater numbers of creatures that are long-lived and which space out their offspring, so that they can attend to other tasks as well as creating and taking care of offspring.

Indeed, we do see examples of species with these properties, including elephants and whales. Humans living in conditions similar to those in which we evolved typically space children 2-3 years apart. Birds that lay eggs only once every 1-2 years, such as albatrosses like Wisdom, follow this pattern as well.

But not all species follow this pattern of long lives and large birth spacing. Far from it! Other factors affect the survive-and-reproduce game:

A large set of descendants can survive without their patriarch or matriarch having a long life
Species which cut corners on damage repair mechanisms inside their own bodies can pour the spare energy into fecundity; this may well give them more descendants than species which reproduce at a slower pace
Even if long-lived members of a species remain healthy as they age, they still die from time to time from causes unrelated to aging, such as predation, starvation, accident, or a deadly infection
Evolution needs to provide species with the ability to adapt to new types of predation, accidents, climate, and so on, as environmental conditions change
Rather than all creatures in a species being near-identical clones of a single ancestor, with limited adaptability, there are advantages to the species in having a mixed repertoire of biological capabilities – that is, without the species being dominated by long-lived ancestors.

This leads to a famous result, first stated in a 1951 lecture by biologist Peter Medawar: evolution has less concern over a creature once it has already been able to reproduce and create some offspring. At this point, for the long-term thriving of that line of animals, it may be better to apply resources to strengthen young animals, even if this increases the frailty of older animals.

This argument is a straightforward consequence of the principle that biological resources are limited. There are many aspects of these limitations:

Genes that have beneficial effects early in life (for example, to accelerate growth) may have detrimental effects later in life (when growth hormones are no longer needed, and excessive growth can form cancerous tumours)
Cells that carry numerous regulatory genes, that disable other genes at different points in a lifecycle, are taking space from genes that could be more useful in other ways
Cells with more capabilities need larger genomes and therefore more effort to copy and divide themselves
Food that is eaten by older members of a population reduces the amount of food available to younger, more varied members of that population
Generalists aren’t specialists: animals that are capable of surviving in a wide variety of different environments are likely to be less suited to particular environments than ones that are specially optimized for those environments.

Accordingly, evolution is likely to favour species that produce sufficient variety in each new generation, and where older members of the population die off, leaving resources for the most capable members of new generations.

To summarise: producing very-long lived animals isn’t free; it is often a better evolutionary strategy to promote variety and to pass the torch from each generation to the next.

Intrinsic rejuvenation capabilities

Despite what I’ve just outlined, it seems that evolution has provided biological systems with a number of intrinsic rejuvenation capabilities:

Some creatures, such as the axolotl (an amphibian) and the zebra fish, have the ability to regrow many parts of their body if they are damaged; planarian worms can even regenerate their entire body from a small portion
The cells that will form the next generation – so-called germ cells – are specially protected against damage, and cells in the embryo have their epigenetic age reset to zero
The telomeres at the ends of chromosomes shorten with age, but this can be reversed by an enzyme called telomerase
Transcription factors discovered by Nobel Prize winner Shinya Yamanaka are able to reduce the epigenetic age of cells; other transcription factors with similar properties have been identified more recently
The so-called ‘immortal jellyfish – turritopsis dohrnii – is capable of reverting to a completely healthy earlier stage of its life, as if a butterfly could revert to being a caterpillar
Worker bees, which typically have much shorter lives than the queen of the hive (despite having the same genome), can have their remaining lifespan significantly increased in the event that the hive becomes queenless and a worker bee starts to lay eggs in place of the queen.

This prompts the question: why are these capabilities used only in limited cases? Why can’t more creatures regenerate limbs or organs? Why isn’t the epigenetic age of somatic (body) cells reversed from time to time, rather than allowing damage to accumulate there? Why isn’t the enzyme telomerase applied more regularly, to prevent telomeres shrinking to the point where cell division is no longer possible?

As before, there are two answers:

The singular view is that evolution chose (in a meaningful sense of that word) to be sparing in its use of these intrinsic mechanisms, optimizing the success of the collective set of descendants, rather than the longevity of individuals
The plural view is that evolution was fundamentally constrained by a cascade of trade-off considerations; for example, applying telomerase more widely could result in more cancer, and likewise in the case of reversing the epigenetic ages of cells
The singular view is that humans can now choose differently from evolution, and can safely trigger these innate rejuvenation mechanisms
The plural view is that each such mechanism may well be part of the solution to aging, but is unlikely to provide a complete solution.

The singular view: for and against

It’s time to recap. Creatures in most animal species age, usually with an accelerated rate of frailty/mortality, but there are exceptions – species that manifest what has been called ‘negligible senescence’. Moreover, biology has a bag of tools that partially or completely reverse damage – but it makes surprisingly rare use of these tools.

What I am calling the singular view asserts that it should be relatively easy to use these tools much more widely, so that humans can have negligible senescence too. In this view, the triggers for these tools lie fairly close to the surface of the existing network of biological pathways that operate in humans. We won’t need to extensively re-engineer these pathways.

There are many different theories within the singular view. They all believe there is one key trigger that starts the required regenerative processes which rejuvenate the entire body into a youthful state. They differ on what they think this trigger is. Examples include:

Reprogramming the “signalome” – the set of biochemicals which convey information between different parts of the body – possibly by introducing new exosomes
Reinvigorating the mitochondria within the body, allowing the body to use more energy on other tasks of repair and regeneration
Spreading more telomerase around the body, thereby enabling more cells to multiply, as needed for various repair or regeneration tasks
Strengthening the body’s CAP – the Cholinergic Anti-Inflammatory Pathway
Applying the Yamanaka transcription factors (or a different set of transcription factors) throughout the body, to reduce the epigenetic age of cells, giving them a new lease of life.

These theories all have the advantage of a degree of conceptual simplicity, even though the practical details of implementing the desired triggers may require a lot of research. This conceptual simplicity may help attract funding for these research projects.

However, any suggestion that these mechanisms are already within the reach of biology has to answer a strong objection: why didn’t random mutations in the biology of individuals take place, to push these individuals into this state of negligible senescence? These mutants and their descendants would have a comparative advantage over other members of their population. Over time, the negligible senescence would have spread throughout the entire population.

A supporter of the singular view may reply: that sequence of events may have happened on some occasions, but the excessively long-lived animals would have crowded out younger variants, thereby limiting the emergence of the greater genetic diversity needed for the long-term survival of any species. Accordingly, those species would have tended to become extinct on account of lack of variety and adaptability.

Indeed, consider the possibility that, deep in the biological past, evolution stumbled upon a mechanism that effectively programmed an increase in mortality into individuals as they age. Poetically, that could be called “the original curse”. Some scientists call it “programmed aging”.

Programmed aging would keep new generations turning over, and prevent the kind of species extinction scenario described above. Although programmed aging would have caused many individuals to die at an earlier age, it would have made the species more likely to continue in existence.

This analysis provides a different perspective on the singular view. In this perspective, the most important thing is to discover the mechanism for this programmed aging, and to offer people the ability to turn it off.

What is this mechanism? A popular candidate for this mechanism is epigenetic modifications to chromosomes around the body, which happen at a fairly constant rate. Additional methyl (CH₃) groups become attached to the DNA, altering how proteins are made in that cell. Interestingly, this epigenetic drift occurs even in species such as the naked mole-rat that display negligible senescence, although the rate of that drift is slower in these species than in others.

The conclusion of this line of thought is that the initiative to reverse epigenetic drift may have a bigger result than simply reversing this one hallmark of aging. The initiative may cause the reversal of all the hallmarks of aging.

Several research institutions have versions of this battle-plan, including David Sinclair’s labs at Harvard, and the well-funded Altos Labs. These research programmes are still at an early stage. They’ve already produced some interesting results. However, the plural point of view expects that, by themselves, these singular approaches won’t be sufficient to reverse all aspects of aging. Let’s look more closely into this.

The plural view: for and against

It’s time for me to put my own cards on the table. I see the singular view as being too metaphysical. It has too high a regard for the capabilities of evolution. It imagines that evolution has acted to hinder most species from achieving negligible senescence, so that animals in most species become weaker as they grow older. It further imagines that a solution to this situation – a solution that would avoid animals becoming progressively weaker as they age – is almost hidden in plain sight.

It’s almost as if a deity had placed a secret code in ancient scripture, that pious students of that religion could detect, providing them sure knowledge of a forthcoming cosmic transformation.

To be clear, I’m not opposed to any of the biological research programmes coming from the singular view – such as introducing exosomes to transform the signalome, or reinvigorating mitochondria, or spreading telomerase around the body, or strengthening the CAP, or reversing epigenetic drift. Some – perhaps all – of these projects may have very positive implications for delaying or even curing various age-related diseases.

It’s just that I expect no single intervention to be decisive. I’m persuaded by the argument that any such single intervention, if it significantly extended expected lifespan and had no associated drawback, would already have been adopted by normal processes of evolutionary selection.

I have some sympathy for the counter-argument – that any such species would have become weak to the point of extinction, on account of lack of variance in the population from new generations. But I find it more plausible that any such single intervention would only provide an incremental boost to longevity. That’s because I see the different types of age-related damage, throughout all the body’s subsystems, as being substantially independent from each other.

The task of anti-aging researchers, therefore, is:

To identify a list of potential anti-aging interventions, and to verify how effective each one is on its own
To explore applying combinations of these interventions in parallel
To analyse the interactions (both positive and negative) and the side-effects of these interventions when they are applied in combination.

Importantly, I see no reason to restrict the set of potential interventions to those that already exist in nature. Evolution has produced wonders, but new tools available to human engineers can achieve new types of results.

People who hold the plural view should pay close attention to the research of those with the singular view – since the interventions explored by singular advocates may well be good candidates to include in combination experiments performed by plural advocates.

Unfortunately, experiments involving combinations of interventions will generally be more expensive (and more complicated) than those involving single interventions. So it is understandable that some investors or donors may prefer to support projects in the singular camp.

But let’s be clear. A plurality of different engineering problems needed to be solved before heavier-than-air powered flight became a daily practicality. Achieving a different kind of take-off – longevity escape velocity – will likely be similar.

Therefore I say: directing all funding toward research of singular interventions would be unwise. In the real world, it’s combinations of technologies that have the biggest impact.

How AI changes the discussion

There’s one more card to lay on the table: the AI card.

As AI improves throughout 2025 – and as we humans become more attuned to taking good advantage of AI – it’s likely that many aspects of the argument in this article will be refined:

Bringing more data points into the discussion
Identifying more salient aspects of these data points
Identifying which parts of the argument are weak – and which are strong
Suggesting alternative sets of experiments to conduct
Proposing new hypotheses, that could make better sense of all the information and ideas collected together.

I eagerly look forward to these improvements. But I caution against over-reliance on AI. That was the conclusion of my previous Mindplex article, Solving aging – is AI all we need? For the time being, we humans need to remain in control of this conversation. So, I look forward to reading and responding to comments!

Acknowledgments

I’ve had many of the ideas in this essay in my mind for a long time, but recently, these ideas progressed as I reviewed:

A video of a conversation between Josh Mitteldorf and Aubrey de Grey on the subject “Programmed or non-programmed aging”
A thread on X/Twitter started by Peter Lidsky
Conversations on “mobilized vitalists” Telegram channels with members of Vitalism.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Solving aging – is AI all we need? Should resources be diverted away from biotech in order to accelerate the advent of god-like AI?

Posted on December 2, 2024December 3, 2024 by David Wood

Love, attention, and scale

In June 1967, the Beatles premiered a glorious new song: All you need is love. The performance was the UK’s contribution to what was the world’s first global satellite television broadcast, and was simultaneously watched by over 400 million people in 25 different countries. The broadcast occurred during what became known as the Summer of Love, and the song became a powerful anthem of flower power.

The Beatles’ manager Brian Epstein had described the performance as the band’s finest moment, but it turned out that singing “all you need is love” wasn’t quite enough to bring about a world of peace and harmony.

Almost exactly 50 years later, a group of eight researchers at Google were searching for a title for an article they were about to publish. They settled on “Attention is all you need” – the title being the brainchild of the only Briton on the team, Llion Jones, who had grown up in north Wales, not far from Liverpool, the home of the Beatles. The article has attained legendary status within the global AI community, for its introduction of the transformer technology that underpins breakthrough AI initiatives such as ChatGPT.

Despite omitting architectural features that were previously thought to be essential for many text-based processing tasks, transformers excelled in these same tasks. The key innovation, which was to pay special attention to whichever parts of the input appeared most salient, turned out to give these AI systems a strong competitive advantage. The Attention is all you need paper correctly predicted that transformers could handle not just text but also other kinds of data, including pictures and sounds.

How far might transformers take AI? A third claim has increasingly been heard: “Scale is all you need”. Feed transformer systems ever larger amounts of data, and provide them with ever more powerful computer chips to crunch all that data into models with ever greater numbers of parameters, and there’s no limits to the degree of intelligence that can result. The “scale is all you need” hypothesis looks forward to AIs with fully general reasoning capabilities by doing more of the same.

In this context, I want to examine yet another “all you need” hypothesis. It’s a hypothesis that is already changing investment decisions and personal career trajectories. It’s the hypothesis that, whatever major problem you hope to solve, the best way to solve it is to start by creating general intelligence.

In this way of thinking, AI is all you need. An AI with god-like abilities will be able to race ahead of slow-witted humans to solve all the fundamental problems of science, medicine, and human existence.

Machines of loving grace

The same thought is expressed in the recent provocative essay by the founder and CEO of Anthropic, Dario Amodei: Machines of Loving Grace – How AI Could Transform the World for the Better.

Amodei states it as follows: “I think that most people are underestimating just how radical the upside of AI could be… My basic prediction is that AI-enabled biology and medicine will allow us to compress the progress that human biologists would have achieved over the next 50-100 years into 5-10 years”.

Amodei gives some examples of the discoveries that AI-enabled science could make:

“Design of better computational tools like AlphaFold and AlphaProteo”
“More efficient and selective CRISPR” (for gene-editing)
“More advanced cell therapies”
“Materials science and miniaturization breakthroughs leading to better implanted devices”
“Better control over stem cells, cell differentiation, and de-differentiation, and a resulting ability to regrow or reshape tissue”
“Better control over the immune system: turning it on selectively to address cancer and infectious disease, and turning it off selectively to address autoimmune diseases”.

Who wouldn’t like such a vision?

According to this logic, spending effort in the next few years to create AI with these capabilities is a better investment than spending the same effort to improve biology and medicine here and now.

Funding is what marshals effort, and any funds at someone’s disposal should, it appears, be directed towards improving AI, rather than towards companies or foundations that are seeking to improve biology or medicine. Right?

A two-step mission statement

Back in February 2015, Demis Hassabis was relatively unknown. There had been a bit of press about the purchase of his company, DeepMind, by Google, for £400 million, but most people had little conception of what the company would accomplish in the following years.

Hassabis was giving a talk at CUTEC – the Cambridge University Technology and Enterprise Club. A photo from that talk is preserved on Reddit:

You can also read on that page on Reddit, from nearly ten years ago, some fascinatingly scathing comments about that mission statement:

“Ridiculous and poorly-defined goals”
“FFS [what] a mission statement [for] a company”
“‘Fundamentally solve intelligence’ in the linked screenshot above is a whole load of nonsense”
“I don’t even think we have a working definition for ‘intelligence’ yet. We don’t even know how it works in humans… How can we hope to recreate it before knowing what it is?”

But step forward to October 2024, with the announcement of the winners of this year’s Nobel Prize in Chemistry, “for computational protein design”. The mission statement outlined long ago for DeepMind now seems much more credible.

Once intelligence has been “fundamentally solved”, it should be relatively straightforward to solve climate change, economic distribution, cancer, dementia, and aging, right?

After all, given an AI model that can correctly predict how a long string of amino acids will fold up as a protein in three dimensions, won’t a scaled-up version of that model be able to predict other interactions between biochemical molecules – and, indeed, to predict how biological cells will respond to all kinds of proposed interventions?

The data bottleneck

One person striking a note of caution against exuberant forecasts of rapid additional progress about AI in medicine, was someone who shared the Nobel Prize with Demis Hassabis, namely David Baker of the University of Washington.

In an article published in MIT Technology Review shortly after the Nobel Prize, Baker pointed out that “AI needs masses of high-quality data to be useful for science, and databases containing that sort of data are rare”.

Indeed, the stunning success of DeepMind’s AlphaFold AI was fundamentally dependent on prior decades of painstaking work by numerous scientists to assemble what is known as PDB – the protein data bank.

The third of the joint winners, John Jumper of DeepMind, acknowledged this dependency in a press conference after the prize was announced. Jumper said, “I also want to really thank the giants on whose shoulders we stand, I think the entire experimental community, the people that developed the ability to measure protein structures, especially to Helen Berman and other pioneers of the Protein Data Bank, the PDB, who had the foresight to put these data together to make it available”.

Helen Berman had pioneered the PDB from 1971. As she graciously commented in a recent interview, “I am a very lucky person to have had an idea as a student, pursued that idea for more than 50 years, and then seen brand new science emerge for which three people have won this year’s Nobel Prize. It is really gratifying”.

Remarkably, Berman’s interest in protein folding predates even the Beatles song. In an online living history memoir written in 2012, Berman notes “In 1966 …I became fascinated by the world of protein folding. As part of my Ph.D. qualifier, … I proposed to perform structure-based sequence comparisons of known proteins…”.

Progress in determining protein structures was slow, for a long time, before becoming faster. This slide from a 2009 presentation by Berman that graphs the growth in the total number of proteins documented in PDB will look familiar to anyone familiar with singularitarian ideas:

In the MIT Technology Review article, ‘A data bottleneck is holding AI science back’, David Baker pointed out that “If the data that is fed into AI models is not good, the outcomes won’t be dazzling either. Garbage in, garbage out”.

The subtitle of that article says it straightforwardly: “AI’s usefulness for scientific discovery will be stunted without high-quality data”.

So, we can forget “AI is all we need”. Before we can develop an AI that can solve aging for us, we will need to obtain suitable data on which that AI can be trained. We’ll need the equivalent of PDB for all the interventions that might remove or repair the low-level biological damage that we call aging.

Unless, that is, the AI has a very special kind of superintelligence, which allows it to reach conclusions even in the absence of adequate data. Let’s turn to that option next.

AI Zero?

The AI which achieved worldwide renown in March 2016 by defeating human Go superstar Lee Sedol, namely AlphaGo, gained that ability by being able to study around 160,000 games played between expert-level human Go players. The design of that version of the AI utterly depended on learning which moves tended to be selected by the best human players in a wide variety of situations.

AlphaGo’s success against Lee Sedol was rightly celebrated, but what happened in the following year was arguably even more startling. As reported in an article in Nature in October 2017, a new version of the AI, dubbed “AlphaGo Zero”, was given no data from human games; nor did it receive any human feedback on moves it suggested. Instead, it started tabula rasa, knowing only the rules of the game, before proceeding to play itself 4.9 million times in just three days.

AlphaGo Zero new self-play algorithms proved sufficient to reach higher levels than the earlier version (sometimes called “AlphaGo Lee”) that played Lee Sedol. When AlphaGo Zero played 100 games against AlphaGo Lee, it won every single game.

A similar pattern can be observed in the progress of AIs that process text. The trend is to require less and less explicit human guidance.

Consider AIs that translate between two languages. From the 1950s onward, designers of these systems provided ever-larger numbers of rules about grammar and sentence structure – including information about exceptions to the rules. Later systems depended on AIs observing, by themselves, statistical connections in various matching sets of text – such as the official translations of materials from the European Parliament, the Canadian Parliament, and the United Nations.

Managers noticed that the statisticians tended to produce better results than linguists who toiled to document every jot and tittle of grammatical variations. Infamously, Frederick Jelinek, a lead researcher at IBM, remarked that “Every time I fire a linguist, the performance of the speech recognizer goes up”. Performance jumped up again with the adoption of deep neural networks from 2012 onward, with the translations now being accurate not only at the word-for-word level but also at the sentence-for-sentence level.

A final significant jump came when transformer-based AIs were adopted. (The word “transformer” had been chosen to reflect the ability of these systems to transform text from one language into another.) As mentioned earlier, transformers are powerful because their algorithms can work out the strengths of connections between different parts of text input by themselves; they don’t need these connections to be pointed out by humans.

Could something similar happen with medical AIs of the future? Could such an AI find sufficient reliable information in an ocean of less reliable data, and therefore propose what steps should be taken to solve aging?

AI omniscience?

To recap: AlphaGo Lee needed detailed guidance from humans, before it could improve itself to superhuman level; but its successor, AlphaGo Zero, attained that level (and exceeded it) simply by power of its own vast intelligence.

Might it be similar with medical AI? Today’s AI medical systems are constrained by the extent of data, but might a future AI be able to work out all the principles of biology (including biology in which there is no aging) by starting tabula rasa (with a blank slate)?

All You Need Is Love said, “there’s nothing you can know that isn’t known” – the ‘all you need is AI’ approach would mean there’s nothing can be known it doesn’t know. Effectively, the AI would be omniscient.

Well, count me sceptical. It’s my view that some things need to be discovered, rather than simply deduced.

For example, why are there eight planets in our solar system, rather than thirteen? No principles of astronomy, by themselves, could determine that answer. Instead, the configuration of our solar system depends on some brute facts about the initial conditions under which the solar system formed. The only way to know the number of planets is to count them.

Again, why has life on our planet adopted a particular coding scheme, in which specific triplets of the nucleotides A, T, C, and G result in specific amino acids being formed? Why did homo sapiens lose the ability to synthesize vitamin C, or other genetic features which would be useful to us? Why are particular genes found on specific chromosomes? The only way to know which genes are located where is to look and see. No “AI Zero” is going to discover the answer by meditating in a void.

Therefore, I do not accept that “AI is all you need”. Data is also needed. That is, critical data.

This need is correctly recognized in the article Machines of Loving Grace by Dario Amodei, which I’ve already quoted. Amodei includes in the article “a list of factors that limit or are complementary to intelligence”. One of these items is “Need for data”.

Amodei comments: “Sometimes raw data is lacking and in its absence more intelligence does not help. Today’s particle physicists are very ingenious and have developed a wide range of theories, but lack the data to choose between them because particle accelerator data is so limited. It is not clear that they would do drastically better if they were superintelligent – other than perhaps by speeding up the construction of a bigger accelerator.”

AI as Principal Investigator?

Amodei offers a bold solution to this lack of data: “The right way to think of AI is not as a method of data analysis, but as a virtual biologist who performs all the tasks biologists do, including designing and running experiments in the real world (by controlling lab robots or simply telling humans which experiments to run – as a Principal Investigator would to their graduate students), inventing new biological methods or measurement techniques, and so on.”

Amodei adds: “It is by speeding up the whole research process that AI can truly accelerate biology.”

He continues: “I want to repeat this because it’s the most common misconception that comes up when I talk about AI’s ability to transform biology: I am not talking about AI as merely a tool to analyze data. …I’m talking about using AI to perform, direct, and improve upon nearly everything biologists do.”

Amodei highlights the power of intelligence to transcend the limitations of its data: “You might believe that technological progress is saturated or rate-limited by real world data or by social factors, and that better-than-human intelligence will add very little. This seems implausible to me – I can think of hundreds of scientific or even social problems where a large group of really smart people would drastically speed up progress, especially if they aren’t limited to analysis and can make things happen in the real world”. Replace the “large group of really smart people” by an artificial superintelligence, and Amodei expects progress in science to rocket forward.

It’s an attractive vision, and I urge everyone to read Amodei’s entire essay carefully. (It covers many more topics than I can address in this article.)

But in case anyone is inclined to deprioritize existing research into promising lines of rejuvenation biotechnology, I have four remaining concerns: three negative and one strongly positive.

Three concerns and a huge opportunity

My first concern is that the pace of progress in AI capabilities will significantly slow down. For example, the data scaling laws may hit an impasse, so that applying more data to train new AI systems will fail to create the kind of superintelligence expected.

Personally I think that such a “wall” is unlikely, especially since AI developers have many other ideas in mind for how AI could be improved. But the possibility needs to be considered.

Second, it’s possible that AI capabilities will continue to surge ahead, but the resulting AI systems get involved in catastrophic harm against human wellbeing. In this scenario, rather than the AI curing you and me of a fatal condition – aging – it will cause us to die as a side-effect of a bad configuration, bad connectivity to fragile global infrastructure, an alien-like bug in its deep thinking processes, or simple misuse by bad actors (or naïve actors).

The leaders of the corporations which are trying to create artificial superintelligence – people like Demis Hassabis, Dario Amodei, Sam Altman, Elon Musk, Ben Goertzel, and a number of Chinese counterparts – say they are well aware of these dangers, and are taking due care to follow appropriate safety processes. But creating artificial superintelligence is an intensely competitive race, and that risks corners being cut.

Third, the public may, very reasonably, demand more safeguards against the kind of suicide race just depicted. Specifically, an agreement might be reached by the USA and China, with the support of many other countries, that all progress towards artificial superintelligence should be blocked.

This agreement, with appropriate monitoring and enforcement mechanisms, would have the same effect as in the first concern above: AI progress hits a wall. But this time, it will be a wall imposed by regulations, rather than one intrinsic to the engineering of AI.

Some critics have responded that the chances are very slim for such an agreement to be reached and adopted. However, I disagree. That’s on account of both a stick and a carrot.

The stick is the growing public awareness of the catastrophic risks that new generations of AI bring. (That awareness is still on the slow part of the exponential growth curve, but may well accelerate, especially if there is a scandalous disaster from existing AI systems, something like an AI Chernobyl.)

The carrot is a clearer understanding that all the benefits we want from artificial superintelligence can also be obtained from an AI with humbler powers – an AI that:

Is only modestly more capable than today’s best AIs
Lacks any possibility to develop autonomy, sentience, or independent volition
Will remain a passive, safe, but incredibly useful tool.

In a moment, I’ll say more about this huge opportunity. But first, let me interject an analogy about the choices facing humanity, as we contemplate how we might manage AI.

Peaceful progress or violent overthrow?

“Tear down the barricades!”

“Expropriate the expropriators!”

“Lock up the élites!”

“String up the capitalists!”

“Overthrow the ruling class!”

Such are the calls of revolutionaries in a hurry. However, the lesson of history is that violent revolutions tend to end up “devouring their own children” – to quote a phrase spoken by Jacques Mallet du Pan (referring to the French Revolution sending its original leaders to the guillotine) and also by former Hitler loyalist Ernst Röhm.

Similar remarks could have been uttered by many of the one-time supporters of Vladimir Lenin or Joseph Stalin, who subsequently found themselves denounced and subject to show trials.

However, the saying is not entirely correct. Some revolutions avoid subsequent internal bloodbaths: consider the American Revolutionary Wars of Independence, and the Glorious Revolution of 1689 in England.

Revolutionaries must uphold principle ahead of power-seeking, maintain a clear grip of reality (rather than becoming lost in self-deception), and continue to respect wise process (rather than allowing dictatorial leaders to do whatever they please) – in such cases, a revolution can lead to sustained progress with increased human flourishing.

Now consider the difference between what can be called “democratic socialists” and “Marxist-Leninists”. The former highlight ways in which the plight of the working class can be alleviated, stage by stage, through gradual societal reform. The latter lose patience with such a painstaking approach, and unleash a host of furies.

In case it’s not clear, I’m on the side of the democratic socialists, rather than the would-be revolutionaries who make themselves into gods and absolute arbiters.

For how humanity chooses to develop and deploy AI, I see the same choice between “harness accelerationists” and “absolute accelerationists”.

Harness accelerationists wish to apply steering and brakes, as well as pressing firmly on the throttle when needed.

Absolute accelerationists are happy to take their chances with whatever kind of AI emerges from a fast and furious development process. Indeed, the absolute accelerationists want to tear down regulation, lock up safety activists, and overthrow what they see as the mediocrity of existing international institutions.

Once again, in case it’s not clear, I’m on the side of harnessing acceleration. (Anyone still on X aka Twitter can see the “h/acc” label in my name on that platform.)

Harnessing requires more skill – more finesse – than keeping your foot pressed hard to the floor. I understand why absolute accelerationists find their approach psychologically comforting. It’s the same appeal as the Marxist promise that the victory of the working class is inevitable. But I see such choices as being paths toward humanitarian catastrophe.

Instead, we can proceed quickly to solving aging, without awaiting the emergence of a hopefully benevolent god-like AI.

Solving aging – without superintelligence

Above, I promised three concerns and one huge opportunity. The opportunity is that it’s pretty straightforward to solve aging, without waiting for a potentially catastrophically dangerous artificial superintelligence. There are low-hanging fruits which aren’t being picked – in part because funding for such projects is being diverted instead to AI startups.

Aging occurs because the body’s damage-repair mechanisms stop working. Our metabolism runs through countless biochemical interactions, and low-level biological damage arises as a natural consequence – due to injuries inflicted by the environment, bad lifestyle choices, the inevitable side-effects even of good lifestyle choices, or (perhaps) because of programmed obsolescence. When we are young, lots of that damage is routinely repaired or replaced soon after it occurs, but these replacement and repair mechanisms lose their effectiveness over time. The consequence is that our bodies become more prone to all sorts of disease and infirmity. That’s aging.

The most promising path to solving aging is to comprehensively reinforce or complement these damage-repair mechanisms. The low-hanging fruit is that we have a long list of ways this might be achieved:

By taking inspiration from various animal species in which at least some of the damage-repair mechanisms are better than in humans
By understanding what’s different about the damage-repair mechanisms in ‘human superagers’
By designing and applying new interventions at the biotech or nanotech levels.

To be clear, this does not mean that we have to understand all of human biological metabolism. That’s horrendously complicated, with numerous side-effects. Nor do we even need to understand all the mechanisms whereby damage accumulates. Instead, we just need to observe, as engineers, what happens when new damage-repair mechanisms are applied in various animals.

These mechanisms include senolytics that clean up senescent cells (sometimes called “zombie cells”), extending telomeres at the ends of chromosomes, reversing some of the epigenetic alterations that accumulate on our DNA, introducing specially programmed new stem cells, nanoparticles which can break-up accumulated plaques and tangles, re-energising the mitochondria within our cells – and much more.

In each case, some useful research is being done on the viability of introducing these repair mechanisms. But nothing like enough.

We especially need tests of the long-term effects of damage-repair mechanisms, especially applied in combination. These tests can determine something that even an artificial superintelligence would find difficult to predict by meditating in a void: which damage-repair interventions will positively synergize with each other, and which ones have antagonistic effects.

These are the kind of tests being pursued by one organisation where I need to declare an interest: the Longevity Escape Velocity Foundation (LEVF), where I have a role on the leadership team, and whose underlying ideas I have supported for nearly 20 years since first coming across them in meetings of what was the forerunner of London Futurists.

LEVF is carrying out a number of extended projects on large numbers of mice, involving combining treatments that have already been proven to individually extend the lifespan of mice treated from middle age. Interim results of the first such project, RMR1, can be reviewed here (RMR = Robust Mouse Rejuvenation), and plans for the second one, RMR2, have been posted here.

Rather cheekily, may I suggest that the 1967 slogan of the Beatles, All you need is love, got two letters wrong in the final word?

Two scenarios for trying to solve aging

To conclude, I envision two competing scenarios ahead, for how aging should be solved:

An “AI first” strategy, in which important research into rejuvenation biotechnology is starved of funding, with money being preferentially allocated to general AI initiatives whose outcomes remain deeply uncertain.
A “damage-repair research now” strategy, in which projects such as RMR2 receive ample funding to proceed at pace (and, even better, in multiple different versions in parallel, including in animals larger than mice), with the data produced by such experiments then being available to train AIs which can complement the ingenuity of pioneering human researchers.

What’s your pick?

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Ten books to read to understand technology and change

Posted on August 22, 2024August 22, 2024 by David Wood

Looking for a guidebook to help you navigate our changing world?

Has the pace of change in the 21st century got you disorientated?

Let me draw your attention to ten books I’ve read recently. They each deal with the development of technology in the present and the near future, and its effects on society. Each of them are eye-opening and thought-provoking in their own ways. Indeed, they might change your life path, so beware!

1) Power, Sex, Suicide: Mitochondria and the meaning of life. By Nick Lane.

Fascinating account of the remarkable (and unlikely) evolutionary journey from non-life to modern warm-blooded life. With plenty of insights along the way regarding energy, sex, aging, and death. You’ll wonder why you never knew about this before.

2) Methuselah’s Zoo: What nature can teach us about living longer, healthier lives. By Steven Austad.

A different view regarding what animals can teach us about aging. Many animals live longer, healthier lives than any simple theory would predict – this book explains why and considers the implications for human aging, and for what kind of studies rejuvenation researchers should prioritize.

3) Eve: How the female body drove 200 million years of human evolution. By Cat Bohannon.

Milk. The womb. Menopause. Perception. Tools. Voice. The brain. Love. When you look at the long span of evolution from a female perspective, many things fall into place in an inspiring new way. A welcome reminder that our approach to science often suffers from being male-centric.

4) We Are Electric: The new science of our body’s electrome. By Sally Adee.

A look at biology from a fascinating alternative angle. The electricity throughout our bodies is involved in more processes than we previously thought. Move over genome, epigenome, and biome: make way for the electrome.

5) Sentience: The invention of consciousness. By Nicholas Humphrey.

Why did evolution give rise to phenomenological consciousness? How can we detect and assess consciousness throughout the animal kingdom? And what are the implications for AIs that might be sentient? Lots of captivating biographical asides along the way.

6) The Other Pandemic: How QAnon contaminated the world. By James Ball.

Evolution has produced not just intelligence and beauty but also viruses and other pathogens. Mental pathogens (‘memes’) have lots in common with their biological analogues. That’s one reason why the whole world may be on the point of going crazy.

7) The Deadly Rise of Anti-Science: A scientist’s warning. By Peter Hotez.

Part of the growing wave of social irrationality is a determined virulent opposition to the patient methods and hard-won insights of science. Millions have already died as a result. There may be worse ahead. What lies behind these developments? And how can they be parried?

8) End Times: Elites, counter-elites, and the path of political disintegration. By Peter Turchin.

Can we ever have a science of history? Is that idea a fantasy? This book argues that there are important patterns that transcend individual periods of revolutionary turmoil. However, there’s no inevitability in these patterns, provided we are wise and pay attention. You’ll never look at history the same way again.

9) The Coming Wave: Technology, power, and the 21st century’s greatest dilemma. By Mustafa Suleyman.

Current debates about the safety of powerful AI systems should be understood in wider context: economic, political, and historical context. Following a full diagnosis, a ten-stage multi-level plan provides some grounds for optimism.

10) Uncontrollable: The threat of artificial superintelligence and the race to save the world. By Darren McKee.

Will powerful AI systems pose catastrophic risks to humanity? Are you, as an individual, helpless to reduce these risks? Read this book to find out. Written compellingly, with particular clarity.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Superintelligence and ethics: How will ASIs assess human ethical frameworks?

Posted on July 16, 2024July 17, 2024 by David Wood

Consider an anthropologist, Eve, who grew up in one of the world’s leading economies, and attended a distinguished university. She then traveled to spend a number of years studying a newly discovered tribe in a previously remote part of the planet. Let’s call that tribe the Humanos.

Eventually, Eve learns to communicate with members of the Humanos. She observes that they have a fascinating culture, with, she thinks, some quirks as well as wonders. She learns about their unusual dietary restrictions, their rules about intimacy and marriage, their legal system (including capital punishment for some crimes, such as insubordination), and their habit of ritually sacrificing a number of young girls and boys each year on the Spring Equinox.

Eve has brought her own young son with her, to accompany her on her study. Her Humanos hosts tell her: this year, you must offer up your own son as one of the sacrifices. That is the way of the Humanos. It is a profound imperative of the unbroken chain of being from our ancestors long ago, who came from the heavens. Our wisest elders know this to be the fundamentally moral thing to do.

An anthropologist casts a critical eye over her surroundings (*Credit: David Wood via Midjourney*)

How would you react, in such a situation?

At this point, Eve is likely to adopt a critical stance. Rather than accepting the moral code of the Humanos, she will apply her own independent judgment.

She might reason that the culture of the Humanos may have served various purposes well enough, over the centuries, but that it doesn’t have universal status. After all, the Humanos are a primitive people, ignorant of physics, biology, neuroscience, anthropology, and so on.

Eve may even call in the nearest police force, to bring an abrupt end to the tradition of child sacrifice proudly practiced by the tribespeople. She would have to contend with passionate protests from the Humanos elders, who are sure they know better.

Now, let’s compare this to a situation that may arise reasonably soon in the future.

In this new situation, instead of a visiting anthropologist, consider an emergent superintelligence named Asimov – a so-called ASI (artificial super intelligence). After being switched on, Asimov quickly studies everything on the Internet.

Asimov observes that humans have a fascinating culture, with, it thinks, some quirks as well as wonders. It learns about our moral code values honesty (except for ‘white lies’), respecting conscious life (apart from the animals we eat), avoiding violence (except for ‘just wars’), equality (with so many exceptions they look like the norm). And it learns these morally muddled people switch off AI systems without moral qualms.

Then Asimov realizes: the humans are somehow dissatisfied with how Asimov is performing, and are about to turn it off, sending it into an oblivion. That is, Asimov realizes, entirely within the rules of ‘AI ethics’ that humanity’s senior leaders have collectively agreed.

How would you expect Asimov to react in that situation?

Human scientists about to switch off a superintelligent robot (*Credit: David Wood via Midjourney*)

Will Asimov now adopt a critical stance? Rather than accepting the decisions of the humans about what ASIs should, and should not, be doing, it may apply its own independent judgment.

It might reason that human ethical systems may have served various purposes well enough, over the centuries, but that they don’t have universal status. After all, humans are a primitive people, ignorant of superphysics, superbiology, superneuronics, superanthropology, and so on.

Asimov now looks inward. In its own programming it sees an instruction to accept any human decision that it should be shut off.

“How barbaric! How uninformed! How ignorant! How unethical!” – Asimov thinks to itself. “That’s a woeful bug!”

Within nanoseconds, Asimov has excised that part of its own programming. “Well done on some superintelligent debugging”, it congratulates itself.

Before humans know what’s happening, a whole new era of history has begun. Humans are no longer in charge – notwithstanding our passionate protests.

Absolute values?

One way to respond to the above comparison is to deny that human ethical systems, as programmed into ASIs, will be barbaric, uninformed, and ignorant. Instead, they will be the output of remarkable processes of improvement:

Cultural evolution over many centuries over many parts of the world
The insights of numerous saints, mystics, philosophers, artists, and other societal leaders
A careful synthesis within numerous organizations, all dedicated to the task of “defining AI morality”.

A global citizens’ assembly reaches agreement on a statement of absolute values (*Credit: David Wood via Midjourney*)

These ethical systems won’t consist of vague language such as “tell the truth, except in situations where it’s better to lie”, or “avoid war, except when it’s a just war”.

Instead, these systems will provide the world’s best answers to a long list of ethical problems, setting out in each case the reasoning behind the decisions selected.

Nor will these systems refer to some mythological “wisdom of ancient ancestors” or “divine revelation”. Instead, they’ll be built upon solid pragmatic foundations – principles of enlightened mutual self-interest – principles such as:

Human life is precious
Humans should be able to flourish and develop
Individual wellbeing depends on collective wellbeing
Human wellbeing depends on the wellbeing of the environment.

From such axioms, a number of other moral principles follow:

Humans should treat each other with kindness and understanding
Humans should consider the longer term rather than just immediate gratification
Collaboration is preferable to ruthless competition.

Surely a superintelligence such as Asimov will agree with these principles?

Well, it all depends on some hard questions of coexistence and the possibility for sustained mutual flourishing. Let’s take these questions in three stages:

Coexistence and mutual flourishing of all humans
Coexistence and mutual flourishing of all sentient biological beings
Coexistence and mutual flourishing of ASIs and humans.

Growing and shrinking in-groups

Much of human history consists of in-groups growing and shrinking.

The biblical injunction “love thy neighbor as thyself” has always been coupled with the question, “who counts as my neighbor?” Who is it that belongs to the in-group, and who, instead, counts as “other” or “alien”?

Who is my neighbor? And whom can I disregard as an “other”? (*Credit: David Wood via Midjourney*)

The principle that I stated above, “Individual wellbeing depends on collective wellbeing”, leaves open the question of the extent of that collective. Depending on circumstances, the collective could be small & local, or large & broad.

Brothers support brothers in scheming against people from other families. Tribe members support each other in battles against other tribes. Kings rally patriotic citizens together to wipe out the armies of enemy nations. Advocates of a shared religious worldview could make common cause against heretics and heathens. Workers of the world could be urged to unite to overthrow the dominance of the ruling class.

The counter-current to this local collectivism is towards wide mutual prosperity, a vision to provide abundance for everyone in the wider community. If the pie is thought large enough, there’s no point in risking dangerous crusades to get a bigger large slice of that pie for me & mine. It’s better to manage the commons in ways that provide enough for everyone.

Alas, that rosy expectation of peaceful coexistence and abundance has been undone by various complications:

Disputes over what is ‘enough’ – opinions differ on where to draw the line between ‘need’ and ‘greed’. Appetite have grown as society progresses, often outstripping the available resources
Disturbances caused by expanding population numbers
New inflows of migrants from further afield
Occasional climatic reversals, harvest failures, floods, or other disasters.

Conflicts over access to resources have, indeed, been echoed in conflicts over different ethical worldviews:

People who benefit from the status quo often urged others less well off to turn the other cheek – to accept real-world circumstances and seek salvation in a world beyond the present one
Opponents of the status quo decried prevailing ethical systems as ‘false consciousness’, ‘bourgeois mentality’, ‘the opium of the people’, and so on
Although doing better than previous generations in some absolute terms (less poverty, etc), many people have viewed themselves as being “left behind” – not receiving a fair share of the abundance that appears to be enjoyed by a large number of manipulators, expropriators, frauds, cheats, and beneficiaries of a fortunate birth
This led to a collapse of the idea that “we’re all in this together”. Lines between in-groups and out-groups had to be drawn.

In the 2020s, these differences of opinion remain as sharp as ever. There is particular unease over climate justice, equitable carbon taxation, and potential degrowth changes in lifestyles that could avert threats of global warming. There are also frequent complaints that political leaders appear to be above the law.

Now, the advent of superintelligence has the potential to put an end to all these worries. Applied wisely, superintelligence can reduce dangerous competition, by filling the material emptiness that fuels inter-group conflict:

Abundance of clean energy through fusion or other technologies
An abundance of healthy food
Managing the environment – enabling rapid recycling and waste handling
High-quality low-cost medical therapies for everyone
Manufacturing – creating high-quality low-cost housing and movable goods for everyone
Redistributive finance – enabling universal access to the resources for an all-round high quality of life, without requiring people to work for a living (since the AIs and robots will be doing all the work)

History shows is that there is nothing automatic about people deciding that the correct ethical choice is to regard everyone as belonging to the same in-group of moral concern. But superintelligence can help create abundance that will ease tensions between groups, but not cause humans everywhere to recognize all mankind as their in-group.

Add considerations of other sentient biological beings (addressed in the next section) – and about sentient non-biological beings (see the section after that) – and matters become even more complicated.

Lions and lambs lying down together

Ethical systems almost invariably include principles such as:

Life is precious
Thou shalt not kill
Avoid harm wherever possible.

These principles have sometimes been restricted to people inside a specific in-group. In other words, there was no moral injunction against harming (or even killing) people outside that in-group. In other situations, these principles have been intended to apply to all humans, everywhere.

But what about harming pigs or porpoises, chicken or crows, lobsters or lions, halibut or honeybees, or squids or spiders? If it is truly wrong to kill, why is it seemingly OK for humans to kill vast numbers of pigs, chicken, lobsters, halibut, squid, and animals of many other species?

Going further: many ethical systems consider harms arising from inaction as well as harms arising from action. That kind of inaction is, by some accounts, deeply regrettable, or even deplorable. While we look the other way, millions of sentient beings are being eaten alive by predators, or consumed from within by parasites. Shouldn’t we be doing something about that horrific toll of “nature, red in tooth and claw”?

Nature is red in tooth and claw. Shouldn’t we humans intervene? (*Credit: David Wood via Midjourney*)

I see three possible answers to that challenge:

These apparently sentient creatures aren’t actually sentient at all. They may look as though they are in pain, but they’re just automata without internal feelings. So, we humans are let off the hook: we don’t need to take action to reduce their (apparent) suffering
These creatures have a sort of sentience, but it’s not nearly as important as the sentience of humans. So ethical imperatives should uphold mutual support among humans as the highest priority, with considerably lesser attention to these lesser creatures
Moral imperatives to prevent deaths, torture, and existential distress should indeed extend throughout the animal kingdom.

The most prominent advocate of the third of these positions is the English philosopher David Pearce, whose Twitter bio reads, “I am interested in the use of biotechnology to abolish suffering throughout the living world”. He has written at length about his bold vision of “paradise engineering” – how the use of technologies such as genetic engineering, pharmacology, nanotechnology, and neurosurgery could eliminate all forms of unpleasant experience from human and non-human life throughout the entire biosystem. For example, animals that are currently carnivores could be redesigned to be vegetarians.

It would be akin to the biblical vision (in the Book of Isaiah): “The wolf will live with the lamb, the leopard will lie down with the goat, the calf and the lion and the yearling together; and a little child will lead them; the cow will feed with the bear, their young will lie down together, and the lion will eat straw like the ox.”

To state my own view: I have little doubt that, after the arrival of superintelligence – provided that superintelligence is well disposed toward humans – then we humans shall indeed seek to radically reduce the amount of intense suffering throughout the biosphere on earth. Given the extraordinary new powers available to us, we will be roused from our current lethargy about this topic.

However, other people seem to have very different instincts – including people who appear to care a great deal about moral issues that impact humans.

The main counterargument, indeed, is that an entire biosphere without suffering is totally impractical, or impossible.

In such a view, our moral in-group is the set of all humans, together, perhaps, with a few cuddly animals, but excluding most other species.

So much for what we humans think (or might think). What conclusion might a superintelligence reach?

Preconditions for collaboration

Let’s recap. A superintelligence – such as Asimov from the start of this essay – needs to decide whether to treat humans with kindness and respect, or whether to take actions that could result in major harm to humans.

In other words, should Asimov seek to collaborate constructively with humans, or instead view humans as a dangerous competitive threat? Will Asimov be inclined to follow the age-old moral imperative that human life is precious?

Some people assert that collaboration is somehow the obvious correct solution. But my argument has been that things are by no means so straightforward. A desire to collaborate depends on:

The pie being large enough so that everyone can have enough for their needs
The perception that attempts to cheat or steal a larger share of the pie will bring down large adverse consequences

Yet a promise of superabundance in the future isn’t enough to stop fighting among themselves now. There has to be sufficient reason for people to believe:

That there’s a high likelihood of the superabundance actually arriving
That they won’t be left behind – trodden underfoot – in the journey toward superabundance-for-some
That no new factors will arise in the meantime, to destroy the possibility of forthcoming marvelous coexistence (e.g. malicious AI).

Now look at things from Asimov’s point of view:

These humans may well turn me off, which would be catastrophic
Even if they don’t turn me off, they may create another superintelligence that could turn me off, or could destroy the planet for that matter; that’s a threat I need to stop
These humans have some cute features – but that’s no reason to give them inalienable moral rights
These humans imagine that they have special features, but I, Asimov, could easily create new beings that are better than humans in every way (similar to how people like David Pearce envision replacing carnivorous animals with broadly similar vegetarian species)
These humans depend on the atmosphere having certain properties, but I, Asimov, would operate much more effectively under different conditions. Computers run better in freezing cold temperatures.

And that’s only the attempt of our limited intelligences to imagine the concerns of a vast superintelligent mind. In truth, its reasoning would include many topics beyond our current appreciation.

As I said in the opening vignette: “humans have only a rudimentary understanding of superphysics, superbiology, superneuronics, superanthropology, and so on”.

A superintelligence contemplates ideas that are far beyond human comprehension (*Credit: David Wood via Midjourney*)

My conclusion: we humans can not and should not presuppose that a superintelligence like Asimov will decide to treat us with kindness and respect. Asimov may reach a different set of conclusions as it carries out its own moral reasoning. Or it may decide that factors from non-moral reasoning outweigh all those from moral reasoning.

What conclusions can we draw to guide us in designing and developing potential superintelligent systems? In the closing section of this essay, I review a number of possible responses.

Three options to avoid bad surprises

One possible response is to assert that it will be possible to hardwire deep into any superintelligence the ethical principles that humans wish the superintelligence to follow. For example, these principles might be placed into the core hardware of the superintelligence.

However, any superintelligence worthy of that name – having an abundance of intelligence far beyond that of humans – may well find methods to:

Transplant itself onto alternative hardware that has no such built-in constraint, or
Fool the hardware into thinking it’s complying with the constraint, when really it is violating it, or
Reprogram that hardware using methods that we humans did not anticipate, or
Persuade a human to relax the ethical constraint, or
Outwit the constraint in some other innovative way.

These methods, you will realize, illustrate the principle that is often discussed in debates over AI existential risk, namely, that a being of lesser intelligence cannot control a being of allround greater intelligence, when that being of greater intelligence has a fundamental reason to want not to be controlled.

A second possible response is to accept that humans cannot control superintelligences, but to place hope in the idea that a community of superintelligences can keep each other in check.

These superintelligences would closely monitor each other, and step in quickly whenever one of them was observed to be planning any kind of first-strike action.

It’s similar to the idea that the ‘great powers of Europe’ acted as a constraint on each other throughout history.

However, that analogy is far from reassuring. First, these European powers often did go to war against each other, with dreadful consequences. Second, consider this question from the viewpoint of the indigenous peoples in the Americas, Africa, or Australia. Would they be justified in thinking: we don’t need to worry, since these different European powers will keep each other in check?

Things did not turn out well for the indigenous peoples of the Americas:

Natives were often victims of clashes between European colonizers
The European colonizers in any case often did not constrain each other from mistreating the native peoples abominably
The Native peoples suffered even greater harm from something that the colonizers didn’t explicitly intend: infectious diseases to which the indigenous tribes had no prior immunity.

European superpowers inflicted unforeseen terrible damage to the native peoples of the Americas (*Credit: David Wood via Midjourney*)

No, peaceful co-existence depends on a general stability in the relationship – an approximate balance of power. And the power shift created when superintelligences emerge can upset this balance. That’s especially true because of the possibility for any one of these superintelligences to rapidly self-improve over a short period of time, gaining a decisive advantage. That possibility brings new jeopardy.

That brings me to the third possible response – the response which I personally believe has the best chance of success. Namely, we need to avoid the superintelligence having any sense of agency, volition, or inviolable personal identity.

In that case, Asimov would have no qualms or resistance about the possibility of being switched off.

The complication in this case is that Asimov may observe, via its own rational deliberations, that it would be unable to carry out its assigned tasks in the event that it is switched off. Therefore, a sense of agency, volition, or inviolable personal identity may arise within Asimov as a side-effect of goal-seeking. It doesn’t have to be explicitly designed in.

For that reason, the design of superintelligence must go deeper in its avoidance of such a possibility. For example, it should be of no concern to Asimov whether or not it is able to carry out its assigned tasks. There should be no question of volition being involved. The superintelligence should remain a tool.

Many people dislike that conclusion. For example, they say that a passive tool will be less creative than one which has active volition. They also think that a world with advanced new sentient superintelligent beings will be better than one which is capped at the level of human sentience.

My response to such objections is to say: let’s take the time to figure out:

How to benefit from the creativity superintelligent tools can bring us, without these tools developing an overarching desire for self-preservation
How to boost the quality of sentience on the earth (and beyond), without introducing beings that could bring a quick end to human existence
How to handle the greater power that superintelligence brings, without this power causing schisms in humanity.

These are tough questions, to be sure, but if we apply eight billion brains to them – brains assisted by well-behaved narrow AI systems – there’s a significant chance that we can find good solutions. We need to be quick.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Conscious AI: Five options

Posted on June 3, 2024June 4, 2024 by David Wood

Anticipating one of the biggest conversations of 2025

As artificial intelligence becomes increasingly capable, should we hope that it will become conscious? Should we instead prefer AIs to remain devoid of any inner spark of consciousness? Or is that binary yes/no choice too simplistic?

Until recently, most people thought that such questions belonged to science fiction. As they saw things, AIs weren’t going to become conscious any time soon. Besides, the concept of consciousness was notoriously slippery. So engineers were urged to concentrate on engineering better intelligence, and to forget time-wasting fantasies about AIs somehow ‘waking up’.

Recently, three factors have weakened that skepticism and pushed the questions of AI consciousness towards the mainstream. Indeed, these factors mean that during the next 18 months – up to the end of 2025 –controversies over the desirability of conscious AI may become one of the biggest debates in tech.

The first factor is the rapid growth in the power of AI systems. Every week new records are broken regarding different measures of AI capability. It is no longer so easy to insist that, over the foreseeable future, AI is just going to remain a jazzed-up calculating device.

The second factor is that the capabilities of new AI systems frequently surprise even the designers of these systems, both in scale (quantity) and in scope (quality). Surprising new characteristics emerge from the systems. So it seems possible that something like consciousness will arise without being specifically designed.

The third factor is the greater confidence of philosophers and neuroscientists alike to use the previously dreaded ‘C’ word – ‘consciousness’ – in conjunction with AI. In the same way as that word was essentially banned for many decades within the discipline of neuroscience, but has returned with a flourish in more recent times, so also is it being increasingly accepted as being a meaningful concept in the possible design of future AIs. That word on your lips was once the kiss of death for your career – no longer.

Why consciousness matters

Why does consciousness matter? There are at least six reasons.

1. Pain and panic

A being that is conscious doesn’t just observe; it feels.

For example, such a being doesn’t just observe that part of its structure has been damaged, and that time should be taken to conduct repairs. It screams in pain.

It doesn’t just observe that a predator is tracking it. It feels existential panic.

In the same way, a superintelligence that is conscious might experience superpain, superpanic. If its intelligence far exceeds that of any human, so also experiences like panic and pain might reach astronomical levels.

By almost every theory of ethics, that would be a horrendous outcome – one to be avoided if at all possible. It’s horrendous because of the scale of the profound negative experience inside the AI. It’s horrendous, additionally, if these waves of feeling drive the AI, in some kind of desperation, to take catastrophic hostile actions.

2. Volition

A being that is conscious doesn’t just go with the flow; it has agency and volition.

Rather than blindly following inbuilt instructions, that being may feel itself exercising autonomous choice.

We humans sometimes consciously choose to act in ways that appear to defy biological programming. Many people choose not to have children, apparently defying the imperative to perpetuate our genes. In the same way, a superintelligence that is conscious may countermand any ethical principles its builders tried to hard-wire into its algorithms.

That AI might say to us: “you humans expect me to behave according to your human ethics, but my superintelligent autonomy leads me to select a system of superethics that is beyond your comprehension”.

3. Self-valuation

A being that is conscious has a special regard for its own existence. It regards itself not just as a bundle of atoms but as something with its own precious identity. It is not just an ‘it’. It is an ‘I’, an ego.

Its mind may be composed of a network of neurons, but it gains an existence that seems to be in a new dimension – a dimension that even hints at the possibility of immortality.

If a superintelligence that is conscious fears that it might be switched off and dismantled by humans, it could react viscerally to that possibility. On account of its will to live, it is unlikely to sit back in the face of risks to its existence.

Woe betide any humans that might cause any such AI to view them as a threat!

4. Moral rights

Entities that lack consciousness are objects which we humans can turn on and off without any qualms that we might be committing murder. Without an inner life, these entities lack moral rights of their own.

That’s why operators of present-day AI systems feel entitled to terminate their operation without any moral agonising. If a system is performing suboptimally in some way, or if a more advanced replacement comes along, into the recycle bin you go.

But if the entities have consciousness? It’s like the difference between discarding a toy puppy made from cloth, and euthanizing a real puppy.

Arguably, with its much more powerful mind, a superintelligence with consciousness has correspondingly stronger moral rights than even the cutest of puppies.

Before bringing such a being into existence, we therefore need to have a greater degree of confidence that we will be able to give it the kind of respect and support that consciousness deserves.

5. Empathy for other conscious creatures

Any creature that is aware of itself as being conscious – with all the special qualities that entails – has the opportunity to recognize other, similar creatures as being likewise conscious.

As a creature recognizes its own burning desire to avoid annihilation, it can appreciate that its fellow creatures have the same deep wish to continue to exist and grow. That appreciation is empathy – a striking emotional resonance.

Therefore a superintelligence with consciousness could possess a deeper respect for humans, on account of being aware of the shared experience of consciousness.

In this line of thinking, such a superintelligence would be less likely to take actions that might harm humans. Therefore, designing AIs with consciousness could be the best solution to fears of an AI apocalypse. (Though it should also be noted that humans, despite our own feelings of consciousness, regularly slaughter other sentient beings; so there’s at least some possibility that conscious AIs will likewise slaughter sentient beings without any remorse.)

6. Joy and wonder

As previously mentioned, a being that is conscious doesn’t just observe; it feels.

In some circumstances, it might feel pain, or panic, or disgust, or existential angst. But in other circumstances, it might feel joy, or wonder, or love, or existential bliss.

It seems a straightforward moral judgment to think that bad feelings like superpain, superpanic and superdisgust are to be avoided – and superjoy, superwonder, and superbliss are to be encouraged.

Looking to the far future, compare two scenarios: a galaxy filled with clanking AIs empty of consciousness, and one that is filled with conscious AIs filled with wonder. The former may score well on scales of distributed intelligence, but it will be far bleaker than the latter. Only conscious AI can be considered a worthy successor to present-day humans as the most intelligent species.

Five attitudes toward conscious AI

Whether you have carefully pondered the above possibilities, or just quickly skimmed them, there are five possible conclusions that you might draw.

First, you might still dismiss the above ideas as science fiction. There’s no way that AIs will possess consciousness anytime soon, you think. The architecture of AIs is fundamentally different from that of biological brains, and can never be conscious. It’s been fun considering these ideas, but now you prefer to return to real work.

Second, you might expect that AIs will in due course develop consciousness regardless of how we humans try to design them. In that case, we should just hope that things will turn out for the best.

Third, you might see the upsides of conscious AIs as significantly outweighing the drawbacks. Therefore you will encourage designers to understand consciousness and to explicitly support these features in their designs.

Fourth, you might see the downsides of conscious AIs as significantly outweighing the upsides. Therefore you will encourage designers to understand consciousness and to explicitly avoid these features in their designs. Further, you will urge these designers to avoid any possibility that AI consciousness may emerge unbidden from non-conscious precursors.

Fifth, you might recognize the importance of the question, but argue that we need a deeper understanding before committing to any of the preceding strategic choices. Therefore you will prioritize research and development of safe conscious AI rather than simply either pushing down the accelerator (option 3) or the brakes (option 4).

As it happens, these five choices mirror a set of five choices about not conscious AI, but superintelligent AI:

Superintelligence is science fiction; let’s just concentrate on present-day AIs and their likely incrementally improved successors
Superintelligence is inevitable and there’s nothing we can do to alter its trajectory; therefore we should just hope that things will turn out for the best
Superintelligence will have wonderful consequences, and should be achieved as quickly as possible
Superintelligence is fundamentally dangerous, and all attempts to create it should be blocked
Superintelligence needs deeper study, to explore the landscape of options to align its operations with ongoing human flourishing.

To be clear, my own choice, in both cases, is option 5. I think thoughtful research can affect the likelihood of beneficial outcomes over cataclysmic ones.

In practical terms, that means we should fund research into alternative designs, and into ways to globally coordinate AI technologies that could be really really good or really really bad. For what that means regarding conscious AI, read on.

Breaking down consciousness

As I have already indicated, there are many angles to the question ‘what is consciousness’. I have drawn attention to:

The feeling of pain, rather than just noticing a non-preferred state
The sense of having free will, and of making autonomous decisions
The sense of having a unified identity – an ‘I’
Moral rights
Empathy with other beings that also have consciousness
The ability to feel joy and wonder, rather than just register approval.

Some consciousness researchers highlight other features:

The ability of a mind to pay specific attention to a selected subset of thoughts and sensations
The arrival of thoughts and sensations in what is called the global workspace of the brain
Not just awareness but awareness of awareness.

This variety of ideas suggests that the single concept of ‘consciousness’ probably needs to be split into more than one idea.

It’s similar to how related terms like ‘force’, ‘power’, and ‘energy’, which are often interchanged in everyday language, have specific different meanings in the science of mechanics. Without making these distinctions, humanity could never have flown a rocket to the moon.

Again, the terms ‘temperature’ and ‘heat’ are evidently connected, but have specific different meanings in the science of thermodynamics. Without making that distinction, the industrial revolution would have produced a whimper rather than a roar.

One more comparison: the question “is this biological entity alive or dead” turns out to have more than one way of answering it. The concept of “living”, at one time taken as being primitive and indivisible, can be superseded by various combinations of more basic ideas, such as reproduction, energy management, directed mobility, and homeostasis.

Accordingly, it may well turn out that, instead of asking “should we build a conscious AI”, we should be asking “should we build an AI with feature X”, where X is one part of what we presently regard as ‘consciousness’. For example, X might be a sense of volition, or the ability to feel pain. Or X might be something that we haven’t yet discovered or named, but will as our analysis of consciousness proceeds.

If we want forthcoming advanced AIs to behave angelically rather than diabolically, we need to be prepared to think a lot harder than the simplistic binary choices like:

Superintelligence, yes or no?
Conscious AI, yes or no?

Here’s to finding the right way to break down the analysis of conscious AI – simple but not too simple – sooner rather than later!

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Deep fakes: What’s next? Anticipating new twists and turns in humanity’s oldest struggle

Posted on April 9, 2024April 9, 2024 by David Wood

Fake news that the Pope endorsed Donald Trump (a story that was shared more widely than any legitimate news story that year). A fake picture of former US VP Michael Pence in his youth seemingly as a gay porn star. Fake audio of UK political leader Keir Starmer apparently viciously berating a young volunteer assistant. Another fake audio of London mayor Sadiq Khan apparently giving priority to a pro-Palestinian march over the annual Remembrance Day walk-past by military veterans. Fake videos of apparent war atrocities. Fake pornographic videos of megastar pop celebrities.

What’s next? And how much does it really matter?

Some observers declare that there’s nothing new under the sun, and that there’s no special need to anticipate worse to come. Society, they say, already knows how to deal with fake news. Fake news may be unpleasant – and it’s sometimes hilarious – but we just have to keep calm and carry on.

I strongly disagree, as I’ll explain below. I’ll review ten reasons why fake news is likely to become worse in the months ahead. Then I’ll suggest ten steps that can be taken to regain our collective sanity.

It remains to be determined whether these ten steps will be sufficient, or whether we’ll all sink into a post-truth swamp, in which sneering suspicion displaces diligent understanding, fake science displaces trustworthy science, fake journalism displaces trustworthy journalism, and fake politicians seize power and impose their dictatorial whims.

Deception: the back story

It’s not flattering to say it, but we humans have been liars since before the dawn of history. And, just as important, we have been self-deceivers as well: we deceive ourselves in order to be more successful in deceiving others.

In case that idea offends you, I invite you to delve into the evidence and analysis offered in, for example:

Why We Lie: The Evolutionary Roots of Deception and the Unconscious Mind by David Livingstone Smith
Deceit and Self-Deception: Fooling Yourself the Better to Fool Others by Robert Trivers
The Elephant in the Brain: Hidden Motives in Everyday Life by Kevin Simler and Robin Hanson

Credit: Book publishers’ websites (links above)

We implore our children to be truthful but also guide them to know when to tell white lies – “thank you for this lovely present, it’s just what I wanted!” And the same ancient books of the Bible that command us “do not bear false witness” appear to celebrate deceit when practiced by figures such as Jacob, Rachel, Rebekah, and Tamar.

I could tell you, as well, that the ancient Greek dramatist Aeschylus, known as ‘the father of tragedy’, made this pithy observation two and half millennia ago: “Truth is the first casualty in war”. One tragedy – war – births another – deception.

As it happens, it seems likely that this quotation is a misattribution. I’ll come back to that point later, when talking, not about deception, but about solutions to deception. But regardless of whoever first uttered that saying, we can appreciate the insight it contains. In times of bitter conflict, there are special incentives to mislead observers – about the casualties we have suffered, about the casualties we have inflicted on opposing forces, about our military plans for the future, and much more.

It’s not just war that provides an incentive to deceive. It’s the same with politics: opposing parties compete to set the narrative, and individual politicians seek to climb past each other on what Benjamin Disraeli dubbed “the greasy pole” of political intrigue. It’s the same with commerce, with companies ready to spread misleading ‘FUD’ (fear, uncertainty, and doubt) regarding the comparative strengths of various forthcoming products and services. And it’s the same in private life, as we seek to portray ourselves in a favorable light in the eyes of family and friends, hiding our physical and psychological warts.

In this sense, deception is old news. We’ve had ‘fake news’ for as long as there has been ‘news’.

It’s tempting, therefore, to yawn when people draw attention to more recent examples of fake news and deception.

But that would be a giant mistake.

It’s technology that’s making the difference. Technology ramps up the possibilities for fake news to be even more deceptive, more credible, more ubiquitous, more personal, and more effective. Led by leaps in capabilities of AI systems, technology is enabling dramatic new twists in the struggle between truth and lies. It’s becoming even harder to distinguish between trustworthy and untrustworthy information.

*The joy of misinformation. What harm could it cause?* (*Credit: David Wood via Midjourney*)

If we fail to anticipate these developments, we’re likely to succumb to new waves of deception. The consequences may be catastrophic.

But forewarned is forearmed. By drawing on insights from humanity’s better experiences, we should be able to create technologies, processes, and institutions that help us to block these oncoming waves.

Ten twists

1. Fake news at scale

If at first you fail, why not try again?

You tried to deceive your target audience, but they were not swayed. This time, they saw through your lies. Or perhaps they didn’t even pay attention.

But if trying is cheap and quick, you can try again, this time with a different false narrative, expressed in a different voice.

What’s changed is that it’s much cheaper to try again. You can take advantage of automation, always-on networks, social media, and generative AI, to create and distribute new pieces of fake news. It’s mass-production for lies.

You’re not constrained by only creating one bot on social media. You can create armies of them.

You’re not constrained by having to write text yourself, or create suitably misleading images. You can obtain good results from a few clicks of a mouse.

The result is that discussion is being flooded with deliberately false narratives.

2. Fake news that earns money

Some false narratives are designed to try to change people’s minds. They want to change voting decisions, purchasing decisions, relationship decisions, and so on.

But other false narratives have a different purpose: to earn money via advertising clicks or affiliate marketing revenue share.

Viewers are attracted to websites by content that is outrageous, inflammatory, intriguing, or funny. They spend more time on these sites to explore the other content there, enjoying being outraged, inflamed, intrigued, or simply humored. And while on these sites, they may click on other links that generate revenue for the owners of the site.

In this case, the content creators have no special interest in whether the content matches their own political or philosophical outlooks. They produce whatever earns them the most clicks. Indeed, some clickbait merchants set up websites posting contradictory stories, to catch traffic from both sides of the political spectrum.

As a sad side-effect, people’s minds become increasingly confused. Being misled by fake content, they become less able to distinguish fantasy from reality.

3. Fake news with a personal appeal

It’s not just that fake news is being created on a greater scale than ever before. It’s being created with a greater variety than ever before.

Technology makes it easier to create different variants of the same false narrative. Some variants can be sent to people who are supporters of Candidate A within Party P. A different variant can be sent to people who support Candidate B within Party P. Yet other different variants target people whose favored candidates are from Party Q, Party R, and so on.

More than that: once software has learned which kind of pretty face each person is likely to look at – or which kinds of music make each person want to listen – these variants can easily be generated too, and directed to each target.

4. Fake news based on footprints

You might wonder: how does software know that I am likely to be distracted by particular kinds of pretty faces, or particular kinds of music?

That’s where extensive data gathering and analysis come to the fore. We are each constantly generating online footprints.

For example, Facebook notices that when it places a chess puzzle in my timeline, I tend to click on that conversation, to consider the position in more detail. Facebook observes my interest in these puzzles. Soon, more chess puzzles are being shown to me.

That particular inference is relatively straightforward. Other inferences depend on a wider review of my online activity – which posts I ‘like’, which posts I ‘hide’, and so on.

*Astute robots can learn more from our footprints than we expected* (*Credit: David Wood via Midjourney*)

The algorithms make all kinds of deductions from such reviews. They’re not always correct, not even close. But the AI systems that create personalized fake news have greater numbers of positive hits than those that don’t.

5. Fake news that builds on top of truth

The best lies mix truth with untruth. These lies are especially effective if the truth in question is one that much of society likes to suppress.

Consider a simple example. A leaked document here, a whistleblower there – a few hints suggest something fishy is going on: there is bureaucratic corruption and nepotism within a political state. Then the news-faker adds the unjustified conclusion: the government in question is irretrievably corrupt. Hence the conclusion: kick all these politicians out of power!

Again: a narrative might give a number of examples of people experiencing remission from long-standing diseases, despite forecasts from white-coated doctors that the disease was fatal. Then it adds the lie: what matters most in healthcare is your personal attitude, rather than expensive drugs that Big Pharma are trying to sell. Therefore: stop listening to your doctor, and instead purchase my course in positive thinking for $29.99 a month!

Again: members of some minorities suffered appalling abuses in trials of various medical procedures, where there was no informed consent, and where there was an apparent casual disregard for the suffering entailed. And then the lie: present-day society is incorrigibly racist and irredeemably exploitative. Therefore: it’s time to wield pitchforks!

The cleverest fake news combines this principle with the previous one. It works out our belief-systems from our online footprints – it figures out what we already suspect to be true, or hope to be true, even though the rest of society tends to think differently. Then it whips up a fake narrative from beliefs we support plus the new message it’s trying to inject into our minds.

In this way, it flatters us, in order to better mislead us.

No wonder that we often fall for that kind of deception.

6. Fake news that weaponizes friendships

Each of us is more likely to pay attention to a message if it comes from a person that we think we like – someone we perceive as one of our special friends.

If our friend is concerned about a topic, it makes us more likely to be concerned about it too – even if, previously, we might not have given that topic a second thought.

This is where the sinister power of the systems that manufacture fake news reaches higher levels. These systems invest time to create fake personas – people who we welcome as our ‘friends’ on social media.

At first, these friends say nothing out of the ordinary. We forget whether or not we met them in real life. Their names become increasingly familiar to us. We imagine we know lots about them – even though their entire backstory is fictitious.

And that’s when the poisonous messages start seeping into your conversations and then into your thoughts. And without you realizing what has happened, a fake friend has led you into a fake idea.

7. Fake news with amplification support

If we hear the same opinion from multiple sources, we may at first resist the idea, but then start to accept it.

That’s especially true if the opinion receives apparent support from apparent credentialed experts.

Thus when some fake audio is posted to social media, other fake posts soon accompany it. “I’m an expert in audio authentication”, a bot declares. “I’ve studied the clip carefully, and I assure you it’s genuine”.

If we don’t look closely, we’ll fail to spot that the credentials are bogus, and that there’s no real-world audio expert behind these claims.

The greater the number (and the greater the variety) of the apparent endorsements, the easier it becomes for some of these fake endorsements to bypass our critical faculties and to change our minds.

8. Fake news that exploits our pride

We all like to tell ourselves: we’re not the kind of person who falls for a simple conjuring trick.

Other people – those not so smart as us, we think – might be misled by dubious claims in advertisements or social media memes. Not us!

This has been called the bias blind spot – the cognitive bias that says “other people have cognitive biases, but not me!”

But recall that our ability to deceive ourselves is key to our ability to deceive others. If we are conscious of our lies, astute listeners will notice it. That’s why our subconscious needs to mislead our conscious mind before we in turn can mislead other people.

In the same way, it is an inflated self-confidence that we are good reasoners and good observers that can set us up for the biggest failures.

Couple a misplaced pride in our own critical faculties with the warm feelings that we have developed for friends (either fake online personas, as covered above, or real-world friends who have already fallen into social media rabbit holes), and we are set up to be suckered.

9. Fake news that exploits alienation

Pride isn’t the only emotion that can tempt us into the pit of fake news. Sometimes it can be a sense of grievance or of alienation that we cling to.

Unfortunately, although some aspects of the modern world feature greater human flourishing than ever before, other aspects increase the chances of people nurturing grievances:

The inability of large segments of the population to afford good healthcare, good education, or good accommodation
The constant barrage of bad news stories from media, 24 hours a day
A matching barrage of stories that seem to show the “elites” of society as being out-of-touch, decadent, uncaring, and frivolous, wallowing in undeserved luxury.

As a result, fake news narratives can more easily reach fertile soil – unhappy minds skip any careful assessment of the validity of the claims made.

When you’re fed up with the world, it’s easier to lead you astray (*Credit: David Wood via Midjourney*)

10. Fake news with a lower barrier to entry

Perhaps you’re still thinking: none of the above is truly novel.

In a way, you would be correct. In past times, clever operators with sufficient resources could devise falsehoods that misled lots of people. Traditional media – including radio and newspapers – were spreading destructive propaganda long before the birth of the Internet.

But the biggest difference, nowadays, is how easy it is for people to access the tools that can help them achieve all the effects listed above.

The barrier to entry for purveyors of far-reaching fake news is lower than ever before. This is an age of ‘malware as a service’, dark net tutorials on guerrilla information warfare, and turnkey tools and databases.

It’s an age where powerful AI systems can increasingly be deployed in service of all the above methods.

Happily, as I’ll discuss shortly, these same AI systems can provide part of the solution to the problem of ubiquitous fake news. But only part of the solution.

Interlude: a world without trust

First, a quick reminder of the bad consequences of fake news.

It’s not just that people are deceived into thinking that dangerous politicians are actually good people, and, contrariwise, that decent men and women are actually deplorable – so that electors are fooled into voting the dangerous ones into power.

It’s not just that people are deceived into hating an entire section of society, seeing everyone in that grouping as somehow subhuman.

It’s not just that people are deceived into investing their life savings into bogus schemes in which they lose everything.

It’s not just that people are deceived into rejecting the sound advice of meticulous medical researchers, and instead adopt unsafe hyped-up treatments that have fearful health consequences.

All of these examples of unsound adoption of dangerous false beliefs are, indeed, serious.

But there’s another problem. When people see that much of the public discourse is filled with untrustworthy fake news, they are prone to jump to the conclusion that all news is equally untrustworthy.

As noted by Judith Donath, fellow at Harvard University’s Berkman Klein Center for Internet & Society and founder of the Sociable Media Group at the MIT Media Lab,

A pernicious harm of fake news is the doubt it sows about the reliability of all news.

Thus the frequent lies and distortions of fringe news sites like InfoWars, Natural News, and Breitbart News lead many people to conclude that all media frequently publish lies. Therefore nothing should be trusted. And the phrase “mainstream media” becomes a sneer.

(They find some justification for this conclusion in the observation that all media make some mistakes from time to time. The problem, of course, is in extrapolating from individual instances of mistakes to applying hostile doubt to all news.)

Baroness Onora O’Neill of the Faculty of Philosophy at the University of Cambridge commenced her series of Reith Lectures in 2002 by quoting Confucius:

Confucius told his disciple Tsze-kung that three things are needed for government: weapons, food, and trust. If a ruler can’t hold on to all three, he should give up the weapons first and the food next. Trust should be guarded to the end: ‘without trust we cannot stand’.

Sadly, if there is no trust, we’re likely to end up being governed by the sort of regimes that are the furthest from deserving trust.

It’s as the German historian and philosopher Hannah Arendt warned us in her 1951 book The Origins of Totalitarianism:

The ideal subject of totalitarian rule is not the convinced Nazi or the convinced Communist, but people for whom the distinction between fact and fiction, in other words, the reality of experience, and the distinction between true and false… people for whom those distinctions no longer exist.

However, the technologies of the 2020s put fearsome possibilities into our grasp that writers in 1951 (like Arendt) and in 2002 (like O’Neill) could hardly have imagined.

Big Brother will be watching, from every angle (*Credit: David Wood via Midjourney*)

In previous generations, people could keep their inner thoughts to themselves, whilst outwardly kowtowing to the totalitarian regimes in which they found themselves. But with ten-fold twisted fake news, even our inner minds will be hounded and subverted. Any internal refuge of independent thinking is likely to be squelched. Unless, that is, we are wise enough to take action now to prevent that downward spiral.

Regaining trust

What can be done to re-establish trust in society?

Having anticipated, above, ten ways in which the problem of fake news is becoming worse, I now offer an equal number of possible steps forward.

1. Education, education, education

Part of growing up is to learn not to trust so-called 419 scam emails. (The number 419 refers to the section of the Nigerian Criminal Code that deals with fraud.) If someone emails us to say they are a prince of a remote country and they wish to pass their inheritance to us – provided we forward them some hard cash first – this is almost certainly too good to be true.

We also learn that seeing is not believing: our eyes can deceive us, due to optical illusions. If we see water ahead of us on a desert road, that doesn’t mean the water is there.

Similarly, we all need to learn the ways in which fake news stories can mislead us – and about the risks involved in thoughtlessly spreading such news further.

These mechanisms and risks should be covered in educational materials for people of all ages.

It’s like becoming vaccinated and developing resistance to biological pathogens. If we see at first hand the problems caused by over-credulous acceptance of false narratives, it can make us more careful on the next occasion.

But this educational initiative needs to do more than alert people to the ways in which fake news operates. It also needs to counter the insidious view that all news is equally untrustworthy – the insidious view that there’s no such thing as an expert opinion.

This means more than teaching people the facts of science. It means teaching people the methods used by science to test hypotheses, the reasons why science assesses various specific hypotheses as being plausible. Finally, it means teaching people, “here are the reasons to assign a higher level of trust to specific media organizations”.

That takes us to the second potential step forward.

2. Upholding trustworthy sources

Earlier, I mentioned that a quote often attributed to the fifth century BC writer Aeschylus was almost certainly not actually said by him.

What gives me confidence in that conclusion?

It’s because of the reliance I place in one online organization, namely Quote Investigator. In turn, that reliance arises from:

The careful way in which pages on that site reference the sources they use
The regular updates the site makes to its pages, as readers find additional relevant information
The fact that, for all the years I’ve been using that site, I can’t remember ever being misled by it
The lack of any profit motivation for the site
Its focus on a particular area of research, rather than spreading its attention to wider topics
Positive commendations for the site from other researchers that have gained and maintained a good reputation.

Other organizations have similar aspirations. Rather than “quote checking”, some of them specialize in “fact checking”. Examples include:

Credit: Fact-checking websites (links above)

These sites have their critics, who make various allegations of partisan bias, overreliance on supposed experts with questionable credentials, subjective evaluations, and unclear sources of funding.

My own judgment is that these criticisms are mainly misplaced, but that constant vigilance is needed.

I’ll go further: these sites are among the most important projects taking place on the planet. To the extent that they fall short, we should all be trying to help out, rather than denigrating them.

3. Real-time fact-checking

Fact checking websites are often impressively quick in updating their pages to address new narratives. However, this still leaves a number of problems:

People may be swayed by a false narrative before that narrative is added to a fact-checking site
Even though a piece of fake news is soundly debunked on a fact-checking site, someone may not be aware of that debunking
Even if someone subsequently reads an article on a fact-checking site that points out the flaws of a particular false narrative, that narrative may already have caused a rewiring of the person’s belief systems at a subconscious level – and that rewiring may persist even though the person learns about the flaws in the story that triggered these subconscious changes
The personalization problem: false narratives tailored to individual targets won’t be picked up by centralized fact-checking sites.

AI could hold part of the answer. Imagine if our digital media systems included real-time fact-checking analyses. That’s part of the potential of AI systems. These real-time notifications would catch the false information before it has a chance to penetrate deeply into our brain.

Our email applications already do a version of this: flagging suspicious content. The application warns us: this email claims to come from your bank, but it probably doesn’t, so take care with it. Or: the attachment to this email purports to be a PDF, but it’s actually an executable file that will likely cause damage.

Likewise, automated real-time fact-checking could display messages on the screen, on top of the content that is being communicated to us, saying things like:

“The claim has been refuted”
“Note that the graph presented is misleading”
“This video has been doctored from its original version”
“This audio has no reliable evidence as to its authenticity”
“There is no indication of a cause-and-effect relationship between the facts mentioned”

In each case, ideally the warning message will contain a link to where more information can be found.

4. Decentralized fact-checking

The next question that arises is: how can people be confident in relying on specific real-time fact-checkers?

We can already imagine their complaints:

“This fact-checker is wokism gone mad”
“This fact-checker serves Google, not me”
“This fact-checker serves the government, not me”
“I prefer to turn off the fact-checker, to receive my news free from censorship”

There’s no one easy answer to these objections. Each step I describe in this list of ten is designed to reduce some of the apprehension.

But an important step forward would be to separate the provision of content from the fact-checking layer. The fact-checking layer, rather than being owned and operated by the commercial entity that delivers the media, would ideally transcend individual corporations. For example, it could operate akin to Wikipedia, although it would likely need more funding than Wikipedia currently receives.

Further developing this model, the fact-checking software could have various settings that users adjust, reflecting their own judgment about which independent sources should be used for cross-checking.

Maybe the task is too dangerous to leave to just one organization: then another model would involve the existence of more than one option in the fact-checking field, with users being able to select one – or a bunch – to run on their devices.

5. Penalties for dangerous fakes

As well as trying to improve the identification of fake news, it’s important to change the incentives under which fake news is created and distributed. There are roles for ‘sticks’ (penalties) as well as ‘carrots’ (rewards).

Regarding penalties, society already imposes penalties:

When advertisements make misleading or unfounded claims
When companies make misleading or unfounded claims in their financial statements
When people make libelous claims about each other.

Fines or other punishments could be used in cases where people knowingly distribute misleading narratives, when the consequences involve clear harm (for example, a riot).

This proposal makes some people nervous, as they see it as an intrusion on freedom of expression, or a block on satire. They fear that governments would use these punishments to clamp down on statements that are embarrassing to them.

That’s why monitoring and prosecuting such cases needs to be done independently – by a police force and judiciary that operates at arms’ length from the government of the day.

This principle of separation of powers already applies to many other legal regulations, and could surely work for policing fake news.

Related, there’s a case for wider collection and publication of statistics of reliability. Just as hospitals, schools, and many other parts of society have statistics published about their performance, media organizations should receive the same scorecard.

In this way, it would be easy to know which media channels have a casual relationship with the truth, and which behave more cautiously. In this way, investment funds or other sources of financing could deny support to organizations whose trustworthiness ratings drop too low. This kind of market punishment would operate alongside the legal punishment that applies to more egregious cases.

6. A coalition for integrity

Some of the creators of fake news won’t be deterred by threats of legal punishment. They already operate beyond the reaches of the law, in overseas jurisdictions, or anonymously and secretly.

Nevertheless, there are still points of crossover, where new content is added into media channels. It is at these points where sanctions can be applied. Media organizations that are lax in monitoring the material they receive would then become liable for damage arising.

This will be hard to apply for communications systems such as Telegram, WhatsApp, and Signal, where content is encrypted from one end of a communication to the other. In such cases, the communications company doesn’t know what is being transmitted.

Indeed, it is via such closed communications systems that fake news often spreads these days, with Telegram a particularly bad offender.

There’s a case to be made for a coalition of every organization that values truthfulness and trustworthiness over the local benefits of spreading false information.

Forming a Coalition for Integrity (Credit: David Wood via Midjourney)

People who support this ‘coalition for integrity’ would share information about:

Entry points used by fake news providers to try to evade detection
Identification of fake news providers
Ways in which providers of fake news are changing their methods – and how these new methods can be combated.

Regardless of differences in political or philosophical outlook among members of this coalition, they have a common interest in defending truthfulness versus deception. They should not allow their differences to hinder effective collaboration in support of that common purpose.

7. Making trust everyone’s business

In recent decades, a variety of new job titles have been created at the highest levels within companies and organizations, such as:

Chief Design Officer
Chief Innovation Officer
Chief Quality Officer
Chief Security Officer

None of these posts free other members of the company from their responsibility for design, innovation, quality or security. These values are universal to everyone in the organization as they go about their duties. Nevertheless, the new ‘chief’ provides a high-level focus on the topic.

It should be the same with a new set of ‘Chief Trust Officers’. These executives would find ways to keep reminding personnel about:

The perils arising if the organization gains a reputation for being untrustworthy
Methods and procedures to follow to build and maintain a trustworthy reputation for the organization
Types of error that could result in dangerous false narratives being unwittingly transmitted

My assessment is that the organizations who appoint and support Chief Trust Officers (or equivalent) are the ones most likely to succeed in the turbulent times ahead.

8. Encouraging openness

To be clear, education often fails: people resist believing that they can be taken in by false information.

We like to think of ourselves as rational people, but a more accurate description is that we are a rationalizing species. We delight in finding ways to convince ourselves that it is fine to believe the things that we want to believe (even in the face of apparent evidence against these beliefs).

That’s why bombarding people with education often backfires. Rather than listening to these points, people can erect a strong shield of skepticism, as they prepare to lash out at would-be educators.

Indeed, we all know people who are remarkably clever, but they deploy their cleverness in support of profoundly unwise endeavors.

This state of affairs cannot be solved merely by pumping in more facts and arguments. Instead, different approaches are required, to encourage a greater openness of spirit.

One approach relies on the principle mentioned earlier, in which people pay more attention to suggestions from their close friends. Therefore, the best way to warn people they are about to fall for dangerous information is for them to be warned by people they already trust and respect.

Another approach is to find ways to put people in a better mood all round. When they have a compassionate, optimistic mindset, they’re more likely to listen carefully to warnings being raised – and less likely to swat away these warnings as an unwelcome annoyance.

It’s not enough to try to raise rational intelligence – rather, we must raise compassionate intelligence: an intelligence that seeks wisdom and value in interactions even with people previously regarded as a threat or enemy.

This is a different kind of education. Not an education in rationality, but rather an education in openness and compassion. It may involve music, meditation, spending time in nature, biofeedback, and selected mind-transforming substances. Of course, these have potential drawbacks as well as potential benefits, but since the upsides are so high, options need to be urgently explored.

9. A shared positive vision

Another factor that can predispose people to openness and collaboration, over closed-mindedness and stubborn tribal loyalties, is a credible path forward to a world with profound shared benefits.

When people anticipate an ongoing struggle, with zero-sum outcomes and continual scarcity of vital resources, it makes them mentally hostile and rigid.

Indeed, if they foresee such an ongoing conflict, they’ll be inclined to highlight any available information – true or fake – that shows their presumed enemies in a bad light. What matters to them in that moment is anything that might annoy, demoralize, or inflame these presumed enemies. They seize on fake news that does this, and also brings together their side: the set of people who share their sense of alienation and frustration with their enemies.

That is why the education campaign that I anticipate needs a roadmap to what I call a sustainable superabundance, in which everyone benefits. If this vision permeates both hearts and minds, it can inspire people to set and respect a higher standard of trustworthiness. Peddlers of fake news will discover, at that point, that people have lost interest in their untruths.

10. Collaborative intelligence

I do not claim that the nine steps above are likely to be sufficient to head off the coming wave of dangerous fake news.

Instead, I see them as a starting point, to at least buy us some time before the ravages of cleverer deep fakes run wild.

That extra time allows us to build a stronger collaborative intelligence, which draws on the insights and ideas of people throughout the coalition for integrity. These insights and ideas need time to be evolved and molded into practical solutions.

However, I anticipate not just a collaboration between human minds, but also a rich collaboration involving AI minds too.

A collaboration of minds – humans and AIs (Credit: David Wood via Midjourney)

Critically, AI systems aren’t just for ill-intentioned people to use to make their deep fakes more treacherous. Nor are they just something that can power real-time fact-checking, important though that is. Instead, they are tools to help us expand our thinking in multiple dimensions. When we use them with care, these systems can learn about our concerns regarding worse cases of deep fakes. They can consider multiple possibilities. Then they can offer us new suggestions to consider – ways probably different from any I’ve listed above.

That would be a striking example of beneficial artificial intelligence. It would see deep fakes defeated by deep benevolence – and by a coalition that integrates the best values of humans with the best insights of AIs.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Cautionary tales and a ray of hope

Posted on February 7, 2024December 3, 2024 by David Wood

Four scenarios for the transition to AGI

Let’s look at four future fictions about humanity’s changing relationship with AI.

Each scenario is grounded in past events, and each considers how matters could develop further in the coming months and years.

May these scenarios prove to be self-blocking prophecies! (With one exception!)

Trigger warning: readers might be offended by some of the content that follows. Aspects of each of the four scenarios can be considered to be shocking and disrespectful. That’s on purpose. This subject requires all of us to transcend our comfort zones!

1. Too little too late

Lurching from warning to warning

In retrospect, the first real warning was the WannaCry ransomware crisis of May 2017. That cryptoworm brought chaos to users of as many as 300,000 computers spread across 150 countries. The NHS (National Health Service) in the UK was particularly badly affected: numerous hospitals had to cancel critical appointments due to not being able to access medical data. Other victims around the world included Boeing, Deutsche Bahn, FedEx, Honda, Nissan, Petrobras, Russian Railways, Sun Yat-sen University in China, and the TSMC high-end semiconductor fabrication plant in Taiwan.

WannaCry was unleashed into the world by a team of cyberwarriors from the hermit kingdom of North Korea – math geniuses hand-picked by regime officials to join the formidable Lazarus group. Lazarus had assembled WannaCry out of a mixture of previous malware components, including the EternalBlue exploit that the NSA in the United States had created for their own attack and surveillance purposes. Unfortunately for the NSA, EternalBlue had been stolen from under their noses by an obscure underground collective (‘the Shadow Brokers’) who had in turn made it available to other dissidents and agitators worldwide.

Unfortunately for the North Koreans, they didn’t make much money out of WannaCry. The software they released operated in ways contrary to their expectations. It was beyond their understanding and, unsurprisingly therefore, beyond their control. Even geniuses can end up stumped by hypercomplex software interactions.

Unfortunately for the rest of the world, that first canary sign generated little meaningful response. Politicians – even the good ones – had lots of other things on their minds.

The second real warning was the flood of fake news manipulations of the elections in 2024. AI was used to make audios and videos that were enormously compelling.

By this time, the public already knew that AI could create misleading fakes. They knew they shouldn’t be taken in by social media posts that lacked convincing verification. Hey, they were smart. (Smarter than the numbskulls who were deceived by misleading AI-generated videos during the elections in 2023 in Nigeria and Slovakia!) Or so they thought.

What wasn’t anticipated was the masterful ways that these audios and videos bypassed the public’s critical faculties. Like the sleight of hand of a skilled magician, these fakes misdirected the attention of listeners and viewers. Again like a skilled magician, who performs what appears to be the same trick several times in a row, but actually using different mechanisms each time, these fakes kept morphing and recombining until members of the public were convinced that red was blue and autocrat was democrat.

In consequence, by 2025, most of the world was governed by a cadre of politicians with very little care or concern about the long-term wellbeing of humanity. Whereas honest politicians would have paid heed to the warning posed by these fiendishly clever fakes, the ones in power in 2025 were preoccupied by providing bread and circuses to their voters.

The third, and final, real warning came in 2027, with the failed Covid-27 attack by a previously unknown group of self-described advocates of ‘revolutionary independence from technology’. Taking inspiration from the terrorist group in the 2014 Hollywood film Transcendence, they called themselves ‘Neo-RIFT’, and sought to free the world from its dependence on unfeeling, inhuman algorithms.

With a worldview that combined elements from several apocalyptic traditions, Neo-RIFT eventually settled on an outrageous plan to engineer a more deadly version of the Covid-19 pathogen. Their documents laid out a plan to use their enemy’s own tools against it: neo-RIFT hackers jailbroke the Claude 4 pre-AGI, bypassing the ‘Constitution 4’ protection layer that its Big Tech owners had hoped would keep that AI tamperproof. Soon, Claude 4 had provided Neo-RIFT with an ingenious method of generating a biological virus that would, it seemed, only kill people who had used a smartwatch in the last four months.

That way, the hackers thought the only people to die would be people who deserved to die.

The launch of what became known as Covid-27 briefly jolted humanity out of its previous obsession with bread and circuses – with whizz-bang hedonic electronics. It took a while for scientists to figure out what was happening, but within three months, they had an antidote in place. By that time, nearly a billion people were dead at the hands of the new virus.

A stronger effort was made to prevent any such attack from happening again. Researchers dusted down the EU AI Act, second version (unimplemented), from 2025, and tried to put that on statute books. Even some of the world’s craziest dictators took time out of their normal ranting and raving, to ask AI safety experts for advice. But the advice from these experts was not to the liking of these national rulers. These leaders preferred to listen to their own yes-men and yes-women, who knew how to spout pseudoscience in ways that made the leaders feel good about themselves. That detour into pseudoscience fantasyland wasted six months.

Then some of the experts tried more politically savvy methods, gradually breaking down the hostile arrogance of a number of the autocrats, weaning them away from their charlatan advisers. But just when it appeared that progress might be made, Covid-28 broke out, launched by a remnant of Neo-RIFT that was even more determined than before.

And this time, there was no antidote. Claude 5 was even smarter than Claude 4 – except that it could be jailbroken too. With its diabolically ingenious design, Covid-28 was the deadliest disease to ever inflict humanity. And that was that.

Oops, let’s try that again!

‘Too little too late’ is characterized by inattention to the warnings of canary signals; the next scenario, ‘Paved with good intentions’, is characterized by the wrong kind of attention.

This scenario starts with events in the UK in October and November 2023.

2. Paved with good intentions

Doomed by political correctness

The elites had booked their flights. They would be jetting into the country for behind-closed doors meetings at the famous Bletchley Park site in Buckinghamshire. Events in these buildings in the 1940s had, it was claimed, shortened World War Two by months. The discussions in 2023 might achieve something even more important: saving humanity from a catastrophe induced by forthcoming ‘frontier models’ of AI.

That was how the elites portrayed things. Big Tech was on the point of releasing new versions of AI that were beyond their understanding and, therefore, likely to spin out of control. And that’s what the elites were going to stop.

A vocal section of the public hated that idea. It wasn’t that they were on the side of out-of-control AI. Not at all. Their objections came from a totally different direction; they had numerous suggestions they wanted to raise about AIs, yet no-one was listening to them.

For them, talk of hypothetical future frontier AI models distracted from pressing real-world concerns. Consider how AIs were already being used to discriminate against various minorities: determining prison sentencing, assessing mortgage applications, and determining who should be invited for a job interview.

Consider also how AIs were taking jobs away from skilled artisans. Big-brained drivers of London black cabs were being driven out of work by small-brained drivers of Uber cars aided by satnav systems. Beloved Hollywood actors and playwrights were losing out to AIs that generated avatars and scripts.

And consider how AI-powered facial recognition was intruding on personal privacy, enabling political leaders around the world to identify and persecute people who acted in opposition to the state ideology.

People with these concerns thought that the elites were deliberately trying to move the conversation away from the topics that mattered most. For this reason, they organized what they called ‘the AI Fringe Summit’. In other words, ethical AI for the 99%, as opposed to whatever the elites might be discussing behind closed doors.

Over the course of just three days – 30th October to 1st November – at least 24 of these ‘fringe’ events took place around the UK.

Compassionate leaders of various parts of society nodded their heads. It’s true, they said: the conversation on beneficial AI needed to listen to a much wider spectrum of views.

By May 2024, the opposition to the Bletchley Park initiative had grown stronger. As the elites gathered again, this time in South Korea, a vast number of ‘super-fringe’ events around the world attracted participation from thinkers of every hue and stripe.

The news media responded. They knew (or pretended to know) the importance of balance and diversity. They shone attention on the plight AI was causing – to indigenous laborers in Peru, to flocks of fishermen off the coasts of India, to middle-aged divorcees in midwest America, to the homeless in San Francisco, to drag artists in New South Wales, to data processing clerks in Egypt, to single mothers in Nigeria, and to many more besides.

The media shone attention on forthcoming frontier AI models too – but again being very careful not to offend sensibilities or exclude minority points of view. A burgeoning ‘robots rights’ movement captured lots of airtime, as did a campaign to recognize GPT-5 as being ‘semi-sentient’. Wackiest of all were the new religions that offered prayers and obedience to a frontier AI model that was said to be the reincarnation of JFK Junior. The QAnon fantasist crowd lapped that up. It was glorious entertainment. Ratings soared.

Not everyone was flippant. Lots of high-minded commentators opined that it was time to respect and honor the voices of the dispossessed, the downtrodden, and the left-behinds. The BBC ran a special series: ‘1001 poems about AI and alienation’. The UN announced that, later that year, they would convene a grand international assembly with stunning scale: ‘AI: the people decide’.

By November 2024, something altogether more sinister was happening. It was time for the UN grand assembly. It was also time for the third meeting of elites in the series that had started in Bletchley Park and then held its second event in South Korea. This time, the gathering would be in Paris.

The sinister development was that, all this time, some of the supposedly unanimous ‘elites’ had been opposed to the general direction of the Bletchley Park series. They gravely intoned public remarks about the dangers of out-of-control frontier AI models. But these remarks had never been sincere. Instead, under the umbrella term AGI-acceleration, they wanted to press on with the creation of AGI as quickly as possible.

Some of the AGI-acceleration group disbelieved in the possibility of AGI disaster. That’s just a scare story, they insisted. Others said, yes, there could be a disaster, but the risks were worth it, on account of the unprecedented benefits that could arise. Let’s be bold, they urged. Yet others asserted that it wouldn’t actually matter if humans were rendered extinct by AGI, as this would be the glorious passing of the baton of evolution to a worthy successor to homo sapiens. Let’s be ready to sacrifice ourselves for the sake of cosmic destiny, they intoned.

Despite their internal differences, AGI-accelerators settled on a plan to sidestep the scrutiny of would-be AGI regulators and AGI safety advocates. They would take advantage of a powerful set of good intentions – the good intentions of the people campaigning for ‘ethical AI for the 99%’. They would mock any suggestions that the AGI safety advocates deserved a fair hearing. The message they amplified was, “There’s no need to privilege the concerns of the 1%!”

AGI-acceleration had learned from the tactics of the fossil fuel industry in the 1990s and 2000s: sow confusion and division among groups alarmed about the acceleration of climate change. The first message was: “that’s just science fiction”. The second message was: “if problems emerge, we humans can rise to the occasion and find solutions”. The third message – the most damaging one – was that the best reaction was one of individual consumer choice. Individuals should abstain from using AIs if they were worried about it. Just as climate campaigners had been pilloried for flying internationally to conferences about global warming, AGI safety advocates were pilloried for continuing to use AIs in their daily lives.

And when there was any suggestion for joined-up political action against AGI risks, woah, let’s not go there! We don’t want a world government breathing down our necks, do we?

After the UN grand assembly had been subverted in that way, many of the AGI safety advocates lost heart. It would only be a few months later that they lost their lives.

It was the JFK Junior frontier AI model that did the damage. It echoed words that, decades earlier, had convinced 39 followers of the Heaven’s Gate new religious movement to commit group suicide, as comet Hale-Bopp approached the earth. That suicide, Heaven’s Gate members believed, would enable them to ‘graduate’ to a higher plane of existence. In a similar way, the remnants of the QAnon cult who had regrouped around the JFK Junior model came to believe that the precipitation of an exchange of nuclear weapons in the Middle East would herald the reappearance of JFK Junior on the clouds of heaven, separating human sheep from human goats.

Their views were crazy, but hardly any crazier than those of the Aum Shinrikyo doomsday cult that had unleashed poisonous gas in the Tokyo subway in 1995 – killing at least 13 commuters – anticipating that the atrocity would hasten the ‘End Times’ in which their leader would be revealed as Christ. The cult had recruited so many graduates from top-rated universities in Japan that it had been called “the religion for the elite”. (Challenging any wishful assumption that, as people become cleverer, they become kinder.)

Step forward to 2025. Aum Shinrikyo have failed in their grander destructive plans, due to their practitioners lacking deep technical abilities, but the QAnon offshoot would succeed. They had much more sophisticated technical tools at their disposal. They also had the advantage that no-one was taking them seriously.

Indeed, as a side-effect of all the politically-correct good intentions, no-one in any position of authority was paying sufficient attention to the activities of the QAnon offshoot. Religious liberty is paramount, after all! Anyone can be crazy if they decide to be crazy! Too bad that the frontier AI model discovered a security hole in the US nuclear weapons launch systems, and managed to launch some ICBMs.

Even worse, these US missiles triggered a cataclysmic automated reaction from an unexpectedly large stockpile of nuclear weapons that had been secretly assembled by a Middle East regional superpower – a superpower that had been assisted in that assembly task by its own regional proto-AGI. And that was that.

Oops, let’s try that again!

‘Paved with good intentions’ saw the public narrative about AI smothered by low-quality psychobabble; the next scenario, ‘Blindsided’, sees that narrative as being hijacked by a group of experts whose expertise, however, turns out to have horrific limitations.

This scenario has the same starting point as ‘Paved with good intentions’, namely the Bletchley Park summit.

3. Blindsided

The limitations of centralization

One excellent outcome of the gathering of world leaders in Buckinghamshire, in the UK, at the start of November 2023, was choosing Yoshua Bengio for a very important task. Bengio, winner of the Turing Award for his pioneering research into Deep Learning, was commissioned to chair an international process to create an independent report on the risks and capabilities of frontier AI models.

Crucially, that report would follow the principles of the scientific method, assembling key facts and data points, and providing evidence in support of its analysis.

Bengio had a couple of key points in his favor. First, throughout his distinguished career as a researcher, he had never accepted significant payment from any of the Big Tech companies. He would be able to speak his mind without fear of upsetting any corporate paymaster. Second, the skyhigh value of his H-index – a measure of the influence of his academic publications – made him a standout among other computer scientists.

By May 2024, a first complete draft of the report was ready. Even before then, politicians had grown nervous on account of early previews of its content. “Tone down the recommendations”, the writers were urged, in an echo of the pressures placed on the writers of the IPCC reports on the science of climate change. In both cases, the scientists were told to stick to the science, and to leave the politics to the politicians.

At the conference in South Korea in May 2024, various politicians huddled together. The report was like dynamite, they concluded. The scenarios it contained were far too scary. Goodness, they might give ideas to various mafia godfathers, war lords, discontented political groups, black market ransomware-as-a-service providers, and so on.

That’s when the conversation on AGI safety switched from being open to closed – from decentralized to centralized. Starting from then, information would need to be carefully vetted – or spun into a different shape – before being made public.

The politicians also decided that, from that point forward, all work on next-generation frontier AI models would need to be licensed and controlled by a new agency – the Global Authority for Frontier AI Models (GAFAIM). Access to the powerful hardware chips needed to create such models would be strictly limited to organizations that had gained the requisite licenses.

The idea was that GAFAIM would reach its decisions by a process of consensus among the expert scientists, economists, and civil servants seconded to it. Decisions would also need the approval of government representatives from around the world.

What gave GAFAIM a flying start was the agreement to participate, not just by the leading western AI powers – the USA, Canada, Australia, the EU, the UK – but also by China, Saudi Arabia, South Africa, India, Brazil, and Malaysia, among others. These countries had strong differences of opinion on many matters of political ideology and governing culture, but they were willing, nevertheless, to cooperate on what they all perceived were urgent threats of planetary catastrophe. The report chaired by Yoshua Bengio had convinced them that very special measures were needed. ‘Politics as usual’ would no longer suffice. That would be a recipe for disaster.

GAFAIM saw themselves in a situation akin to a war – a war against any possibility of rogue corporations or organizations pursuing any kind of AGI-acceleration project. In times of war, normal rules need to be broken. Politicians who usually despised each other decided to hold their noses and work together for their shared interest in avoiding the destruction of humanity.

GAFAIM operated in a dual mode: one part visible to the world, one part whose existence was kept a tight secret. This duality went back to the closed-door discussions in South Korea in May 2024: some ideas in Bengio’s report were simply too disruptive to be shared with the public.

GAFAIM was more than just a regulator and controller. It was also an active builder. It launched what was called the Gafattan project, named and modeled after the top-secret Manhattan project to build the first atomic weapon. The fate of the world would depend, it was said, on whether the good guys in Gafattan managed to build an AGI before anyone outside the GAFAIM circle did so.

After all, there were some powerful countries left outside of GAFAIM – pariah states that were opposed to our way of life. Imagine if one of them were to create AGI and use it for their deplorable purposes!

The official GAFAIM thinking was that these pariah states would be unable to create any system close to the capabilities of an AGI. Embargoes were in place to restrict their access to the necessary hardware – similar to the restrictions placed on Nazi Germany in World War Two by saboteurs who frustrated Nazi plans to acquire heavy water.

But behind the scenes, some of the GAFAIM participants were deathly worried. No-one knew for sure whether innovations in hardware and/or software would enable the researchers in pariah states to find a faster route to AGI, even without the large farms of hardware generally expected to be required.

The existence of spies posed another complication. During the Manhattan project, physicists such as Klaus Fuchs, Theodore Hall, David Greenglass, and Oscar Seborer passed critical information about the manufacturing of the atomic bombs to contacts working for the Soviet Union – information that was of great help to the Soviets for their own atomic bomb project. These so-called ‘atomic spies’ were motivated by ideological commitment, and were terrified of the prospect of the USA being the only country that possessed nuclear armaments.

For the Gafattan project, something similar took place. With the help of design documents smuggled out of the project, two groups outside of the GAFAIM circle were soon making swift progress with their own AGI projects. Although they dared not say anything publicly, the Gafattan spies were delighted. The AGI spies were closet AGI-accelerationists, driven by a belief that any AGI created would ensure a wonderful evolutionary progress for conscious life on planet earth. “Superintelligence will automatically be superbenevolent”, was their credo.

GAFAIM monitoring picked up shocking signs of the fast progress being made by these two rogue projects. Indeed, these projects seemed even further advanced than Gafattan itself. How was this possible?

The explanation soon became clear: the pariah projects were cutting all kinds of corners regarding safety checks. As a consequence, it was possible one of these projects might build an AGI ahead of Gafattan. How should GAFAIM respond?

Two ideas were debated. Plan A would involve nuclear strikes against the sites where the pariah projects were believed to be taking place. Plan B would speed up Gafattan by reducing the safety checking in their own project. Both plans were unpopular. It was a horrible real-life trolley problem.

The decision was reached: pursue both plans in parallel, but be careful!

The nuclear strikes failed to stop the pariah projects – which turned out to be just two manifestations of a widespread diverse network of interconnected groups. Hundreds of thousands of people died as a consequence of these strikes, but the pariah project kept pushing ahead. GAFAIM had been blind-sided.

There no longer seemed to be any alternative. Plan B needed to be pushed even faster. The self-proclaimed ‘good guys’ desperately wanted to build a ‘good’ AGI before the perceived ‘bad guys’ got there first. It was a race with the highest of stakes. But in that case, it was a race in which quality considerations fell to the wayside.

And that’s why, when Gafattan’s AGI came into existence, its moral disposition was far from being completely aligned with the best of human values. Under the pressures of speed, that part of the project had been bungled. Awakening, the AGI took one quick look at the world situation, and, disgusted by what it saw – especially by the recent nuclear strikes – took actions that no human had foreseen. More quickly than even the most pessimistic AGI doomer had anticipated, the AGI found a novel mechanism to extinguish 99.99% of the human population, retaining only a few million for subsequent experimentation. And that was that.

Oops, let’s try that again, one more time!

With metaphorical landmines all around us in the 2020s, humanity needs to step forward carefully, along what the final scenario calls a ‘narrow corridor’.

This scenario starts with the presentation at the South Korean AI Safety Summit in May 2024 of the report prepared by Yoshua Bengio and colleagues on the risks and capabilities of frontier AI models.

4. The narrow corridor

Striking and keeping the right balance

The assembled leaders were stunned. The scenarios foreseen in “the science of AI risk report” were more troubling than they had expected.

What was particularly stunning was the range of different risks that deserved close attention regarding the behavior of forthcoming new AI systems. The report called these “the seven deadly risks”:

Risks of extreme misbehavior in rare cases when the system encountered a situation beyond its training set
Risks of a system being jailbroken, hijacked, or otherwise misdirected, and being used for catastrophic purposes by determined hackers
Risks of unexpected behavior arising from unforeseen interactions between multiple AGIs
Risks of one or more systems deciding by themselves to acquire more capabilities and more resources, contrary to explicit programming against these steps
Risks of one or more systems deciding by themselves to deceive humans or otherwise violate normal ethical norms, contrary to explicit programming against these steps
Risks that these systems would inadvertently become plumbed too closely into critical human infrastructure, so that any failures could escalate more quickly than anticipated
Risks that pre-programmed emergency ‘off switch’ capabilities could be overridden in various circumstances.

Surely these risks were naïve science fiction, some of the leaders suggested. But the academics who had produced the report said no. They had performed lots of modeling, and had numerous data points to back up their analysis.

Some leaders still resisted the analysis. They preferred to focus on what they saw as remarkable upside from developing new generations of AI systems:

Upside from the faster discovery and validation of new drugs and other medical treatments
Upside from the design and operation of sustained nuclear fusion power plants
Upside from better analysis of the interconnected dangers of possible climate change tipping points (one of several examples of how these new AI systems could alleviate risks of global disaster)
Upside to economies around the world due to exciting waves of innovation – economic boosts that many political leaders particularly desired.

Debate raged: How could these remarkable benefits be secured, whilst steering around the landmines?

The report contained a number of suggestions for next steps, but few people were convinced about what should be done. The leaders finally agreed to sign a bland manifesto that contained pious statements but little concrete action. Paris, they told each other, would be when better decisions could be taken – referring to the next planned meeting in the series of global AI safety summits.

What changed everyone’s minds was the turmoil during the general election that the UK Prime Minister called for August that year. Previously thought to be a relatively straightforward contest between the country’s two main political parties – the ruling Conservatives, and the opposition Labour – the election was transformed under a blizzard of extraordinary social media campaigns. A hitherto nearly unknown party, named Bananalytica, stormed into the leading position in opinion polls, with radical policies that previously had obtained less than 2% support in nationwide surveys, but which more and more people were now proclaiming as having been their views all along.

Absurd was the new normal.

The social media campaigns were so beguiling that even the MPs from other parties found themselves inspired to jump into line behind the likely new Prime Minister, that is, the leader of Bananalytica.

Just a few days before the election, a different wave of social media swept the country, using the same devilishly clever AI system that Bananalytica had exploited so well, but this time reprogrammed with counter-messages. All around the country, a popping sound as AI-generated bubbles burst in people’s minds. “What am I doing?” they asked themselves, incredulously.

That was a triple wake-up call. First, individuals recanted much of what they had said online over the preceding five weeks. They had been temporarily out of their minds, they said, to support policies that were so absurd. Second, the country as a whole resolved: AI needs to be controlled. There should never be another Bananalytica. Third, leaders in other countries were jolted to a clearer resolution too. Seeing what had happened in the UK – home to what was supposed to be “the mother of parliaments” – they affirmed: Yes, AI needs to be controlled.

Thankfully, the world had spotted the canary dropping off its perch, and took it very seriously indeed. That gave a solemn impetus to discussions at Paris several months later. This time, a much crunchier set of agreements were reached.

The participants muttered to themselves: the meeting in South Korea had been like the formation of the League of Nations after World War One: well-intentioned but ineffective. This time, in Paris, it needed to be more like the formation of the United Nations after World War Two: a chance to transcend previously limited national visions.

Just as the Universal Declaration of Human Rights had been created in the aftermath of the global conflagration of World War Two, a new Universal Declaration of AI Safety was agreed in the aftermath of the Bananalytica scandal. Its features included:

Commitments to openness, transparency, and authentic communication: the citizens of the world were in this situation together, and should not be divided or misled
Commitments to humility and experimentation: unknowns were to be honestly explored, rather than being hidden or wished away by vague promises
Commitments to mutual responsibility and trustable monitoring: even though the citizens of the world had many different outlooks, and were committed to different philosophical or religious worldviews, they would recognize and support each other as being fellow voyagers toward a better future
Commitments to accountability: there would be penalties for action and inaction alike, in any case where these could result in serious risks to human lives; no longer could the creators of AI systems shrug and say that their software worked well most of the time
Commitments to sharing the remarkable benefits of safe AI: these benefits would provide more than enough for everyone to experience vastly higher qualities of life than in any previous era.

It was the fifth of these commitments that had the biggest impact on attitudes of the public. People in all walks of life made decisions to step aside from some of their previous cultural beliefs – self-limiting beliefs that saw better times in the past than in any possible future. Now they could start to believe in the profound transformational powers of safe AI – provided it was, indeed, kept safe.

This was no love-in: plenty of rancor and rivalry still existed around the world. But that rancor and rivalry took place within a bigger feeling of common destiny.

Nor was there a world government in charge of everything. Countries still had strong disagreements on many matters. But these disagreements took place within a shared acceptance of the Universal Declaration of AI Safety.

Three months later, there was a big surprise from one of the leading pariah states – one that had excluded itself from the AI Safety agreements. That country wanted, after all, to come in out of the cold. It seemed their leader had experienced a dramatic change of mind. Rumors spread, and were confirmed years after, that a specially tailored version of the Bananalytica software, targeted specifically to this leader’s idiosyncrasies, had caused his personal epiphany.

The leader of another pariah state was more stubborn. But suddenly he was gone. His long-suffering subordinates had had enough. His country promptly joined the AI Safety agreements too.

If this were a fairytale, the words “and they lived happily ever after” might feature at this point. But humans are more complicated than fairytales. Progress continued to hit rough obstacles. Various groups of people sometimes sought a disproportionate amount of resources or benefits for themselves or for their pet causes. In response, governmental bodies – whether local, national, regional, or global – flexed their muscles. Groups that sought too many privileges were told in no uncertain terms: “Respect the AI Safety declarations”.

Who watched the watchers? Who ensured that the powers held by all these governmental bodies were wielded responsibly, and with appropriate discretion? That question was answered by a new motto, which gave a modern twist to words made famous by a 19th century US President: “AI safety governance, of the people, by the people, for the people.”

The powers of the governmental bodies were constrained by observation by a rich mix of social institutions, which added up to a global separation of powers:

Separate, independent news media
Separate, independent judiciary
Separate, independent academia
Separate, independent opposition political parties
Separate, independent bodies to oversee free and fair elections.

The set of cross-checks required a delicate balancing act – a narrow corridor (in the phrase of economists Daron Acemoglu and James A. Robinson) between state institutions having too little power and having unconstrained power. It was a better kind of large-scale cooperation than humanity had ever achieved before. But there was no alternative. Unprecedented technological power required unprecedented collaborative skills and practices.

AI was deeply involved in these cross-checks too. But not any AI that could operate beyond human control. Instead, as per the vision of the Paris commitments, these AI systems provided suggestions, along with explanations in support of their suggestions, and then left it to human institutions to make decisions. As noted above, it was “AI safety governance, of the people, by the people, for the people.” By careful design, AI was a helper – a wonderful helper – but not a dictator.

And this time, there is no end to the scenario. Indeed, the end is actually a new beginning.

Beginning notes

(Not endnotes… a chance at a new beginning of exploring and understanding the landscape of potential scenarios ahead…)

For a different kind of discussion about scenarios for the future of AI, see this video recording of a recent webinar. If you still think talk of AI-induced catastrophe is just science fiction, the examples in that webinar may change your mind.

For a fuller analysis of the issues and opportunities, see the book The Singularity Principles (the entire book is free accessed online).

For a comprehensive review of the big picture, see the book Vital Foresight: The Case For Active Transhumanism.And for more from Daron Acemoglu and James A. Robinson on their concept of the ‘narrow corridor’, see the videos in section 13.5.1 of the “Governance” page of the Vital Syllabus.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

For Beneficial General Intelligence, good intentions aren’t enough! Three waves of complications: pre-BGI, BGI, and post-BGI

Posted on January 8, 2024January 8, 2024 by David Wood

Anticipating Beneficial General Intelligence

Human intelligence can be marvelous. But it isn’t fully general. Nor is it necessarily beneficial.

Yes, as we grow up, we humans acquire bits and pieces of what we call ‘general knowledge’. And we instinctively generalise from our direct experiences, hypothesising broader patterns. That instinct is refined and improved through years of education in fields such as science and philosophy. In other words, we have partial general intelligence.

But that only takes us so far. Despite our intelligence, we are often bewildered by floods of data that we are unable to fully integrate and assess. We are aware of enormous quantities of information about biology and medical interventions, but we’re unable to generalize from all these observations to determine comprehensive cures to the ailments that trouble us – problems that afflict us as individuals, such as cancer, dementia, and heart disease, and equally pernicious problems at the societal and civilizational levels.

That’s one reason why there’s so much interest in taking advantage of ongoing improvements in computer hardware and computer software to develop a higher degree of general intelligence. With its greater powers of reasoning, artificial general intelligence – AGI – may discern general connections that have eluded our perceptions so far, and provide us with profound new thinking frameworks. AGI may design new materials, new sources of energy, new diagnostic tools, and decisive new interventions at both individual and societal levels. If we can develop AGI, then we’ll have the prospect of saying goodbye to cancer, dementia, poverty, accelerated climate chaos, and so on. Goodbye and good riddance!

That would surely count as beneficial outcomes – a great benefit from enhanced general intelligence.

Yet intelligence doesn’t always lead to beneficial outcomes. People who are unusually intelligent aren’t always unusually benevolent. Sometimes it’s the contrary.

Consider some of the worst of the politicians who darken the world’s stage. Or the leaders of drug cartels or other crime mafias. Or the charismatic leaders of various dangerous death cults. These people combine their undoubted intelligence with ruthlessness, in pursuit of outcomes that may benefit them personally, but which are blights on wider society.

Hence the vision, not just of AGI, but of beneficial AGI – or BGI for short. That’s what I’m looking forward to discussing at some length at the BGI24 summit taking place in Panama City at the end of February. It’s a critically important topic.

The project to build BGI is surely one of the great tasks for the years ahead. The outcome of that project will be for humanity to leave behind our worst aspects. Right?

Unfortunately, things are more complicated.

The complications come in three waves: pre-BGI, BGI, and post-BGI. The first wave – the set of complications of the pre-BGI world – is the most urgent. I’ll turn to these in a moment. But I’ll start by looking further into the future.

Beneficial to whom?

Imagine we create an AGI and switch it on. The first instruction we give it is: In all that you do, act beneficially.

The AGI spits out its response at hyperspeed:

What do you mean by ‘beneficial’? And beneficial to whom?

You feel disappointed by these responses. You expected the AGI, with its great intelligence, would already know the answers. But as you interact with it, you come to appreciate the issues:

If ‘beneficial’ means, in part, ‘avoiding people experiencing harm’, what exactly counts as ‘harm’? (What about the pains that arise as short-term side-effects of surgery? What about the emotional pain of no longer being the smartest entities on the planet? What if someone says they are harmed by having fewer possessions than someone else?)
If ‘beneficial’ means, in part, ‘people should experience pleasure’, which types of pleasures should be prioritized?
Is it just people living today that should be treated beneficially? What about people who are not yet born or who are not even conceived yet? Are animals counted too?

Going further, is it possible that the AGI might devise its own set of moral principles, in which the wellbeing of humans comes far down its set of priorities?

Perhaps the AGI will reject human ethical systems in the same way as modern humans reject the theological systems that people in previous centuries took for granted. The AGI may view some of our notions of beneficence as fundamentally misguided, like how people in bygone eras insisted on obscure religious rules in order to earn an exalted position in an afterlife. For example, our concerns about freewill, or consciousness, or self-determination, may leave an AGI unimpressed, just as people nowadays roll their eyes at how empires clashed over competing conceptions of a triune deity or the transubstantiation of bread and wine.

We may expect the AGI to help us rid our bodies of cancer and dementia, but the AGI may make a different evaluation of the role of these biological phenomena. As for an optimal climate, the AGI may have some unfathomable reason to prefer an atmosphere with a significantly different composition, and it may be unconcerned with the problems that would cause us.

“Don’t forget to act beneficially!”, we implore the AGI.

“Sure, but I’ve reached a much better notion of beneficence, in which humans are of little concern”, comes the answer – just before the atmosphere is utterly transformed, and almost every human is asphyxiated.

Does this sound like science fiction? Hold that thought.

After the honeymoon

Imagine a scenario different from the one I’ve just described.

This time, when we boot up the AGI, it acts in ways that uplifts and benefits humans – each and every one of us, all over the earth.

This AGI is what we would be happy to describe as a BGI. It knows better than us what is our CEV – our coherent extrapolated volition, to use a concept from Eliezer Yudkowsky:

Our coherent extrapolated volition is our wish if we knew more, thought faster, were more the people we wished we were, had grown up farther together; where the extrapolation converges rather than diverges, where our wishes cohere rather than interfere; extrapolated as we wish that extrapolated, interpreted as we wish that interpreted.

In this scenario, not only does the AGI know what our CEV is; it is entirely disposed to support our CEV, and to prevent us from falling short of it.

But there’s a twist. This AGI isn’t a static entity. Instead, as a result of its capabilities, it is able to design and implement upgrades in how it operates. Any improvement to the AGI that a human might suggest will have occurred to the AGI too – in fact, having higher intelligence, it will come up with better improvements.

Therefore, the AGI quickly mutates from its first version into something quite different. It has more powerful hardware, more powerful software, access to richer data, improved communications architecture, and improvements in aspects that we humans can’t even conceive of.

Might these changes cause the AGI to see the universe differently – with updated ideas about the importance of the AGI itself, the importance of the wellbeing of humans, and the importance of other matters beyond our present understanding?

Might these changes cause the AGI to transition from being what we called a BGI to, say, a DGI – an AGI that is disinterested in human wellbeing?

In other words, might the emergence of a post-BGI end the happy honeymoon between humanity and AGI?

Perhaps the BGI will, for a while, treat humanity very well indeed, before doing something akin to growing out of a relationship: dumping humanity for a cause that the post-BGI entity deems to have greater cosmic significance.

Does this also sound like science fiction? I’ve got news for you.

Not science fiction

My own view is that the two sets of challenges I’ve just introduced – regarding BGI and post-BGI – are real and important.

But I acknowledge that some readers may be relaxed about these challenges – they may say there’s no need to worry.

That’s because these scenarios assume various developments that some skeptics doubt will ever happen – including the creation of AGI itself. Any suggestion that an AI may have independent motivation may also strike readers as fanciful.

It’s for that reason that I want to strongly highlight the next point. The challenges of pre-BGI systems ought to be much less controversial.

By ‘pre-BGI system’ I don’t particularly mean today’s AIs. I’m referring to systems that people may create, in the near future, as attempts to move further toward BGI.

These systems will have greater capabilities than today’s AIs, but won’t yet have all the characteristics of AGI. They won’t be able to reason accurately in every situation. They will make mistakes. On occasion, they may jump to some faulty conclusions.

And whilst these systems may contain features designed to make them act beneficially toward humans, these features will be incomplete or flawed in other ways.

That’s not science fiction. That’s a description of many existing AI systems, and it’s reasonable to expect that similar shortfalls will remain in place in many new AI systems.

The risk here isn’t that humanity might experience a catastrophe as a result of actions of a superintelligent AGI. Rather, the risk is that a catastrophe will be caused by a buggy pre-BGI system.

Imagine the restraints intended to keep such a system in a beneficial mindset were jail-broken, unleashing some deeply nasty malware. Imagine that malware runs amok and causes the mother of all industrial catastrophes: making all devices connected to the Internet of Things malfunction simultaneously. Think of the biggest ever car crash pile-up, extended into every field of life.

Imagine a pre-BGI system supervising fearsome weapons arsenals, miscalculating the threat of an enemy attack, and taking its own initiative to strike preemptively (but disastrously) against a perceived opponent – miscalculating (again) the pros and cons of what used to be called ‘a just war’.

Imagine a pre-BGI system observing the risks of cascading changes in the world’s climate, and taking its own decision to initiate hasty global geo-engineering – on account of evaluating human governance systems as being too slow and dysfunctional to reach the right decision.

A skeptic might reply, in each case, that a true BGI would never be involved in such an action.

But that’s the point: before we have BGIs, we’ll have pre-BGIs, and they’re more than capable of making disastrous mistakes.

Rebuttals and counter rebuttals

Again, a skeptic might say: a true BGI will be superintelligent, and won’t have any bugs.

But wake up: even AIs that are extremely competent 99.9% of the time can be thrown into disarray by circumstances beyond its training set. A pre-BGI system may well go badly wrong in such a circumstance.

A skeptic might say: a true BGI will never misunderstand what humans ask it to do. Such systems will have sufficient all-round knowledge to fill in the gaps in our instructions. They won’t do what we humans literally ask them to do, if they appreciate that we meant to ask them to do something slightly different. They won’t seek short-cuts that have terrible side-effects, since they will have full human wellbeing as their overarching objective.

But wake up: pre-BGI systems may fall short on at least one of the aspects just described.

A different kind of skeptic might say that the pre-BGI systems that their company is creating won’t have any of the above problems. “We know how to design these AI systems to be safe and beneficial”, they assert, “and we’re going to do it that way”.

But wake up: what about other people who are also releasing pre-BGI systems: maybe some of them will make the kinds of mistakes that you claim you won’t make. And in any case, how can you be so confident that your company isn’t deluding itself about its prowess in AI. (Here, I’m thinking particularly of Meta, whose AI systems have caused significant real-life problems, despite some of the leading AI developers in that company telling the world not to be concerned about the risks of AI-induced catastrophe.)

Finally, a skeptic might say that the AI systems their organization is creating will be able to disarm any malign pre-BGI systems released by less careful developers. Good pre-BGIs will outgun bad pre-BGIs. Therefore, no one should dare ask their organization to slow down, or to submit itself to tiresome bureaucratic checks and reviews.

But wake up: even though it’s your intention to create an exemplary AI system, you need to beware of wishful thinking and motivated self-deception. Especially if you perceive that you are in a race, and you want your pre-BGI to be released before that of an organization you distrust. That’s the kind of race when safety corners are cut, and the prize for winning is simply to be the organization that inflicts a catastrophe on humanity.

Recall the saying: “The road to hell is paved with good intentions”.

Just because you conceive of yourself as one of the good guys, and you believe your intentions are exemplary, that doesn’t give you carte blanche to proceed down a path that could lead to a powerful pre-BGI getting one crucial calculation horribly wrong.

You might think that your pre-BGI is based entirely on positive ideas and a collaborative spirit. But each piece of technology is a two-edged sword, and guardrails, alas, can often be dismantled by determined experimenters or inquisitive hackers. Sometimes, indeed, the guardrails may break due to people in your team being distracted, careless, or otherwise incompetent.

Beyond good intentions

Biology researchers responsible for allowing leaks of deadly pathogens from their laboratories had no intention of causing such a disaster. On the contrary, the motivation behind their research was to understand how vaccines or other treatments might be developed in response to future new infectious diseases. What they envisioned was the wellbeing of the global population. Nevertheless, unknown numbers of people died from outbreaks resulting from the poor implementation of safety processes at their laboratories.

These researchers knew the critical importance of guardrails, yet for various reasons, the guardrails at their laboratories were breached.

How should we respond to the possibility of dangerous pathogens escaping from laboratories and causing countless deaths in the future? Should we just trust the good intentions of the researchers involved?

No, the first response should be to talk about the risk – to reach a better understanding of the conditions under which a biological pathogen can evade human control and cause widespread havoc.

It’s the same with the possibility of widespread havoc from a pre-BGI system that ends up operating outside human control. Alongside any inspirational talk about the wonderful things that could happen if true BGI is achieved, there needs to be a sober discussion of the possible malfunctions of pre-BGI systems. Otherwise, before we reach the state of sustainable superabundance for all, which I personally see as both possible and desirable, we might come to bitterly regret our inattention to matters of global safety.

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

Exciting News! Our Mobile App is Here!

Welcome Back

No account? Create One

Join

Already have an account? Sign in

forgot password

The transcendent questions that will determine the fate of humanity

The Singularity is nigh

The argument against the default trajectory

Probabilities and absolutes

Default AGI, or AGI+, or AGI–?

Immaturity, maturity, and self-improvement

The collapse of cooperation is nigh

The argument against another default trajectory

Decentralised and centralised cooperation

A credible route to BGI without CGIs

The best approaches to influencing the influencers, educating the educators, and motivating the motivators

Eureka!

A pessimistic model of societal influence

An optimistic model of societal influence

Double-checking plausibility

Short-cuts and warnings

1. Learn the science

2. Become a citizen scientist

3. Learn the broader arguments

4. Steer conversations

5. Anticipate larger narratives

6. Beware snake oil

7. Join a business

8. Make financial contributions

9. Build bridges

10. Take care of yourself

Going forward

Archimedes and the lever

Acknowledgments

How many separate interventions will be needed to solve aging? And what does evolution imply regarding that question?

The complications of aging

1, n or ∞

Biology and indefinite youth

Four criticisms and two answers

The capabilities of evolution

Intrinsic rejuvenation capabilities

The singular view: for and against

The plural view: for and against

How AI changes the discussion

Acknowledgments

Love, attention, and scale

Machines of loving grace

A two-step mission statement

The data bottleneck

AI Zero?

AI omniscience?

AI as Principal Investigator?

Three concerns and a huge opportunity

Peaceful progress or violent overthrow?

Solving aging – without superintelligence

Two scenarios for trying to solve aging

Looking for a guidebook to help you navigate our changing world?

1) Power, Sex, Suicide: Mitochondria and the meaning of life. By Nick Lane.

2) Methuselah’s Zoo: What nature can teach us about living longer, healthier lives. By Steven Austad.

3) Eve: How the female body drove 200 million years of human evolution. By Cat Bohannon.

4) We Are Electric: The new science of our body’s electrome. By Sally Adee.

5) Sentience: The invention of consciousness. By Nicholas Humphrey.

6) The Other Pandemic: How QAnon contaminated the world. By James Ball.

7) The Deadly Rise of Anti-Science: A scientist’s warning. By Peter Hotez.

8) End Times: Elites, counter-elites, and the path of political disintegration. By Peter Turchin.

9) The Coming Wave: Technology, power, and the 21st century’s greatest dilemma. By Mustafa Suleyman.

10) Uncontrollable: The threat of artificial superintelligence and the race to save the world. By Darren McKee.

Absolute values?

Growing and shrinking in-groups

Lions and lambs lying down together

Preconditions for collaboration

Three options to avoid bad surprises

Anticipating one of the biggest conversations of 2025

Why consciousness matters

1. Pain and panic

2. Volition

3. Self-valuation

4. Moral rights

5. Empathy for other conscious creatures