14.05.2023

A natural language model initiates protein design by creating active enzymes.

The advantage of artificial protein sequence mutation was not enough to stop it working and partially resembled some known natural protein. 

The AI managed to learn about enzymes’ shapes by studying the crude arrangement data. The atomic structures were similar, the arrangement was different. 

The ProGen algorithm was trained similarly to the English language text program was.

Their background taught them that AI could teach itself grammar, lexicology, writing and grammar.

When the system in hand knows a lot about data, it is very capable of learning structure and rules, according to Nikhil Naik, Ph.D. The system learns that words can co-occur and compositionality.

Proteins open limitless opportunities. Lysozymes are smaller than proteins, but there are 20,300 possible combinations next to the 20 amino acids, which is a lot. 

AI- based enzymes show efficacy identical to natural enzymes, even if unnaturally created amino acids arrangement are different from any available natural protein. 

According to the experiment, the natural language processing for the purposes of text reading, writing can take hold of biological fundamentals. ProGen, AI algorithm uses next-token prediction to make artificial proteins out of amino acid arrangements.

Scholars say, the new technology is potentially more powerful than directed evolution. It can energize the long-standing protein production by speeding manufacturing of new proteins, used in virtually every field. 

According to James Fraser from UCSF School of Pharmacy, the artificial composition shows better results than the evolutionary process. 

The language model differs from the normal evolutionary process, although it learns aspects of evolution. We can program those properties for specific effects, like thermostability or acid resistance.

The learning machine has been fed by a large number of amino acid sequences of different kinds and has been left for a few weeks. The next step was coupling those with 56,000 arrangements from 5 lysozyme families with some information about the proteins.

Once the model had generated a million of sequences, the research team picked 100 for a test, based on their proximity to the arrangements of natural proteins.

The first part of 100 proteins became a part of 5 artificial proteins for testing and comparison to enzymes in the chicken eggs whites. Similar substances are in human tears, saliva, milk fighting with bacteria (HEWL).

Two artificial enzymes managed to break into the bacteria cell wall similar to the activity with HEWL with sequence similarity no more than 18%. The artificial arrangements constituted 70 and 90% identity to natural ones.

Yasmin Anderson

AI Catalog's chief editor

Share on social networks:

Similar news

Stay up to date with the latest news and developments in AI tools at our AI Catalog. From breakthrough innovations to industry trends, our news section covers it all.

29.05.2023

Fashion Brands use AI to create a variety of models. To complete the idea of the diff...

30.05.2023

Country’s Spring Budget is directed towards supporting the AI industry. In the recent...

30.05.2023

Facial recognition tool Clearview AI has revealed that it reached almost a million sea...