SCI ‘It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures

Melodi

Disaster Cat
I ran this by Nightwolf because I wasn't sure what it was - after he read it he said: "Probably the most important discovery in medicine since the finding of cellular structures or the discovery of DNA..."

He also said that like when DNA was discovered, this is just so amazing and has so many implications that probably most researchers don't have a clue what exactly to do with it yet.

In a cliff notes version - this process gives medical researchers the ability to take any protein and get the "structure" so they can build it, Nightwolf said it was liking have the lego frame with instructions and even numbers on each piece and a diagram on how to connect them.

Although, the way the on-line media were suggesting this morning it was "discovered" by an AI, and I asked Nightwolf, "So who gets the noble prize, the AI or the scientist who programmed it?"....Melodi


NEWS
30 NOVEMBER 2020

‘It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures
Google’s deep-learning program for determining the 3D shapes of proteins stands to transform biology, say scientists.

Ewen Callaway


T1037, part of a protein from (Cellulophaga baltica crAss-like) phage phi14:2, a virus that infects bacteria.

A protein’s function is determined by its 3D shape.Credit: DeepMind
An artificial intelligence (AI) network developed by Google AI offshoot DeepMind has made a gargantuan leap in solving one of biology’s grandest challenges — determining a protein’s 3D shape from its amino-acid sequence.
DeepMind’s program, called AlphaFold, outperformed around 100 other teams in a biennial protein-structure prediction challenge called CASP, short for Critical Assessment of Structure Prediction. The results were announced on 30 November, at the start of the conference — held virtually this year — that takes stock of the exercise.
“This is a big deal,” says John Moult, a computational biologist at the University of Maryland in College Park, who co-founded CASP in 1994 to improve computational methods for accurately predicting protein structures. “In some sense the problem is solved.”

AI protein-folding algorithms solve structures faster than ever
The ability to accurately predict protein structures from their amino-acid sequence would be a huge boon to life sciences and medicine. It would vastly accelerate efforts to understand the building blocks of cells and enable quicker and more advanced drug discovery.
AlphaFold came top of the table at the last CASP — in 2018, the first year that London-based DeepMind participated. But, this year, the outfit’s deep-learning network was head-and-shoulders above other teams and, say scientists, performed so mind-bogglingly well that it could herald a revolution in biology.
“It’s a game changer,” says Andrei Lupas, an evolutionary biologist at the Max Planck Institute for Developmental Biology in Tübingen, Germany, who assessed the performance of different teams in CASP. AlphaFold has already helped him find the structure of a protein that has vexed his lab for a decade, and he expects it will alter how he works and the questions he tackles. “This will change medicine. It will change research. It will change bioengineering. It will change everything,” Lupas adds.
In some cases, AlphaFold’s structure predictions were indistinguishable from those determined using ‘gold standard’ experimental methods such as X-ray crystallography and, in recent years, cryo-electron microscopy (cryo-EM). AlphaFold might not obviate the need for these laborious and expensive methods — yet — say scientists, but the AI will make it possible to study living things in new ways.
The structure problem
Proteins are the building blocks of life, responsible for most of what happens inside cells. How a protein works and what it does is determined by its 3D shape — ‘structure is function’ is an axiom of molecular biology. Proteins tend to adopt their shape without help, guided only by the laws of physics.
For decades, laboratory experiments have been the main way to get good protein structures. The first complete structures of proteins were determined, starting in the 1950s, using a technique in which X-ray beams are fired at crystallized proteins and the diffracted light translated into a protein’s atomic coordinates. X-ray crystallography has produced the lion’s share of protein structures. But, over the past decade, cryo-EM has become the favoured tool of many structural-biology labs.
Scientists have long wondered how a protein’s constituent parts — a string of different amino acids — map out the many twists and folds of its eventual shape. Early attempts to use computers to predict protein structures in the 1980s and 1990s performed poorly, say researchers. Lofty claims for methods in published papers tended to disintegrate when other scientists applied them to other proteins.
Moult started CASP to bring more rigour to these efforts. The event challenges teams to predict the structures of proteins that have been solved using experimental methods, but for which the structures have not been made public. Moult credits the experiment — he doesn’t call it a competition — with vastly improving the field, by calling time on overhyped claims. “You’re really finding out what looks promising, what works, and what you should walk away from,” he says.
Infographic: Structure solver. DeepMind's AlphaFold 2 algorithm outperformed other teams at the CASP14 protein folding contest.

Source: DeepMind
DeepMind’s 2018 performance at CASP13 startled many scientists in the field, which has long been the bastion of small academic groups. But its approach was broadly similar to those of other teams that were applying AI, says Jinbo Xu, a computational biologist at the University of Chicago, Illinois.
The first iteration of AlphaFold applied the AI method known as deep learning to structural and genetic data to predict the distance between pairs of amino acids in a protein. In a second step that does not invoke AI, AlphaFold uses this information to come up with a ‘consensus’ model of what the protein should look like, says John Jumper at DeepMind, who is leading the project.
The team tried to build on that approach but eventually hit the wall. So it changed tack, says Jumper, and developed an AI network that incorporated additional information about the physical and geometric constraints that determine how a protein folds. They also set it a more difficult, task: instead of predicting relationships between amino acids, the network predicts the final structure of a target protein sequence. “It’s a more complex system by quite a bit,” Jumper says.
Startling accuracy
CASP takes place over several months. Target proteins or portions of proteins called domains — about 100 in total — are released on a regular basis and teams have several weeks to submit their structure predictions. A team of independent scientists then assesses the predictions using metrics that gauge how similar a predicted protein is to the experimentally determined structure. The assessors don’t know who is making a prediction.

The computational protein designers
AlphaFold’s predictions arrived under the name ‘group 427’, but the startling accuracy of many of its entries made them stand out, says Lupas. “I had guessed it was AlphaFold. Most people had,” he says.
Some predictions were better than others, but nearly two-thirds were comparable in quality to experimental structures. In some cases, says Moult, it was not clear whether the discrepancy between AlphaFold’s predictions and the experimental result was a prediction error or an artefact of the experiment.
AlphaFold’s predictions were poor matches to experimental structures determined by a technique called nuclear magnetic resonance imaging, but this could be down to how the raw data is converted into a model, says Moult. The network also struggles to model individual structures in protein complexes, or groups, whereby interactions with other proteins distort their shapes.
Overall, teams predicted structures more accurately this year, compared with the last CASP, but much of the progress can be attributed to AlphaFold, says Moult. On protein targets considered to be moderately difficult, the best performances of other teams typically scored 75 on a 100-point scale of prediction accuracy, whereas AlphaFold scored around 90 on the same targets, says Moult.
About half of the teams mentioned ‘deep learning’ in the abstract summarizing their approach, Moult says, suggesting that AI is making a broad impact on the field. Most of these were from academic teams, but Microsoft and the Chinese technology company Tencent also entered CASP14.
Mohammed AlQuraishi, a computational biologist at Columbia University in New York City and a CASP participant, is eager to dig into the details of AlphaFold’s performance at the contest, and learn more about how the system works when the DeepMind team presents its approach on 1 December. It’s possible — but unlikely, he says — that an easier-than-usual crop of protein targets contributed to the performance. AlQuraishi’s strong hunch is that AlphaFold will be transformational.
“I think it’s fair to say this will be very disruptive to the protein-structure-prediction field. I suspect many will leave the field as the core problem has arguably been solved,” he says. “It’s a breakthrough of the first order, certainly one of the most significant scientific results of my lifetime.”
British artificial intelligence scientist and entrepreneur Demis Hassabis, 2019.

Demis Hassabis, DeepMind’s chief executive, says that the company is learning what biologists want from AlphaFold.Credit: OLI SCARFF/AFP/Getty
Faster structures
An AlphaFold prediction helped to determine the structure of a bacterial protein that Lupas’s lab has been trying to crack for years. Lupas’s team had previously collected raw X-ray diffraction data, but transforming these Rorschach-like patterns into a structure requires some information about the shape of the protein. Tricks for getting this information, as well as other prediction tools, had failed. “The model from group 427 gave us our structure in half an hour, after we had spent a decade trying everything,” Lupas says.
Demis Hassabis, DeepMind’s co-founder and chief executive, says that the company plans to make AlphaFold useful so other scientists can employ it. (It previously published enough details about the first version of AlphaFold for other scientists to replicate the approach.) It can take AlphaFold days to come up with a predicted structure, which includes estimates on the reliability of different regions of the protein. “We’re just starting to understand what biologists would want,” adds Hassabis, who sees drug discovery and protein design as potential applications.
In early 2020, the company released predictions of the structures of a handful of SARS-CoV-2 proteins that hadn’t yet been determined experimentally. DeepMind’s predictions for a protein called Orf3a ended up being very similar to one later determined through cryo-EM, says Stephen Brohawn, a molecular neurobiologist at the University of California, Berkeley, whose team released the structure in June. “What they have been able to do is very impressive,” he adds.
Real-world impact
AlphaFold is unlikely to shutter labs, such as Brohawn’s, that use experimental methods to solve protein structures. But it could mean that lower-quality and easier-to-collect experimental data would be all that’s needed to get a good structure. Some applications, such as the evolutionary analysis of proteins, are set to flourish because the tsunami of available genomic data might now be reliably translated into structures. “This is going to empower a new generation of molecular biologists to ask more advanced questions,” says Lupas. “It’s going to require more thinking and less pipetting.”
“This is a problem that I was beginning to think would not get solved in my lifetime,” says Janet Thornton, a structural biologist at the European Molecular Biology Laboratory-European Bioinformatics Institute in Hinxton, UK, and a past CASP assessor. She hopes the approach could help to illuminate the function of the thousands of unsolved proteins in the human genome, and make sense of disease-causing gene variations that differ between people.
AlphaFold’s performance also marks a turning point for DeepMind. The company is best known for wielding AI to master games such Go, but its long-term goal is to develop programs capable of achieving broad, human-like intelligence. Tackling grand scientific challenges, such as protein-structure prediction, is one of the most important applications its AI can make, Hassabis says. “I do think it’s the most significant thing we’ve done, in terms of real-world impact.”
doi: https://doi.org/10.1038/d41586-020-03348-4
 

mzkitty

I give up.
Well......... soon the pseudo-humans. It will figure out intelligence in humans, and then apply proteins and viola'

:lol:


AlphaFold’s performance also marks a turning point for DeepMind. The company is best known for wielding AI to master games such Go, but its long-term goal is to develop programs capable of achieving broad, human-like intelligence. Tackling grand scientific challenges, such as protein-structure prediction, is one of the most important applications its AI can make, Hassabis says. “I do think it’s the most significant thing we’ve done, in terms of real-world impact.”
 

Melodi

Disaster Cat
Ah, but you see they are combining their stories, they want the public to focus on the AI aspects of this story as in "gee aren't they wonderful 'they' will change everything."

Except what is really going on here, if I understood Nightwolf correctly, is that one of the most important discoveries in the history of modern medicine has just been made by a big "computer" that was PROGRAMED and CREATED by a human being (or a team of human beings).

The Discovery has nothing to do with AI, other than it was an "AI" that was used to discover it - do you remember ages ago, oh about five whole years when instead of AI this would have said:

"Dr. So and So and his team using a SUPERCOMPUTER discovered that..blah, blah blah..."

Except now, we don't even get the name of the scientists (likey to get a noble prize(s) until further down the page, it was an AI or as this article calls him: Deep Mind.

So my question to Nightwolf was:

Does the scientific team that works on this project get the noble prize or does the AI/Supercomputer?

------------------------
Meanwhile, Nightwolf said this actual discovery has vast potential applications from the treating of some genetic diseases to 3D printed organs or new ways of making vaccines. He said there were so many things possible researchers probably don't even quite know where to start.

Like all technologies, he pointed out it can probably do amazingly wonderful things and some pretty horrific things that he could think up off the top of his head.

But the news media wants you to think AI is Wonderful! AI discovers things! Google is Good!...etc..
 

BassMan

Veteran Member
I saw this too: awesome!

What blows my mind, is that at the level of DNA, RNA, proteins, etc, to "photograph" something, you need an electron microscope, and pretty-much everything is blurry, because you are down to the quantum level.
 

Faroe

Un-spun
Definitely enormous amount of applications. Proteins aren't like sugars or fats with a simple ring structure with a few off-shoots, or three arms stuck together at one end. They are sort of the workhorses of biology for structure and catalysts, yet they are huge complicated molecules twisted up in snarled tangles. Sort of like unraveling a dreadlock?

The AI part strikes me as hype - how does Deep Mind's Alpha Fold program differ from a "super computer," or quantum computer?? I can't say, except for maybe the applied learning part, but supposedly IBM's chess master Deep Blue was doing that twenty years ago? I didn't find anything specific to that in the article. This seems like more of a problem of raw number crunching of probabilities - try every possible combination for the tangled protein until you hit on one that works. Humans don't have the life span to find these solutions, so the computer compresses the time by calculating out all the possibilities at once. Not getting what is "human-like" in that - the computer works differently. The program's success is a breakthrough, but I think the programmers are trying to make this achievement more philosophically profound than it actually is.

Anyway, feels like First World society is about to blow up, so who knows if anyone will get to use it. We'll see.
 
Last edited:
Top