First genetic sequencing of Brazilian pit viper is completed

Publicado em 17 agosto 2021

Researchers describe the genome of Bothrops jararaca and suggest the origin of genes responsible for toxins in its venom. Credit: Rafael Marques Porto/Instituto Butantan

A group led by researchers at Butantan Institute and funded by São Paulo Research Foundation—FAPESP has completed the first sequencing of a Brazilian snake's genome. The study is reported in an article published in the journal PNAS. It suggests that the nine genes that encode toxins produced by the jararaca pit viper Bothrops jararaca probably originated in genes that had different functions in the ancestral species.

"In sequencing the snake's genome, we identified markers that enabled us to compare toxin genes with genes in the same position in the genomes of other animals, such as snakes without venom, lizards and amphibians. We found nine of the 12 toxin genes in the jararaca to be highly similar to those occupying the same position in the DNA of these other species. We concluded that most of the toxin genes probably arose from elements that already existed in the same part of the genome of the ancestor common to all these animals," said Inácio Junqueira de Azevedo, a researcher at Butantan Institute and last author of the article.

Azevedo is a principal investigator at the Center for Research on Toxins, Immune Response and Cell Signaling (CeTICS), one of the Research, Innovation and Dissemination Centers (RIDCs) funded by FAPESP.

"These genes had physiological functions in the common ancestor of all these species. At some point they probably began playing a role similar to toxin genes or were selected for this route and lost their original functions. Our study located elements that will help scientists understand the evolution of toxins and the mechanisms that led to the recruitment of certain genes to perform this new function in the production of venom," said Diego Dantas Almeida, first author of the article. The study was conducted during his Ph.D. research, which was supported by FAPESP.

The sequencing also showed that two genes that encode important toxins probably originated in duplication. In any organism, a gene normally evolves more freely and ends up performing different functions when a copy performs its original functions.

In the jararaca, the copies must have come under selective pressure to produce two families of toxins that account for most of the venom's action: snake venom metalloproteinases (SVMPs) and phospholipases A2 (PLA2). Most of the genes that encode toxins in this snake were already thought to have originated in this manner. It was impossible to determine the origin of only one of the 12 gene families that encode its toxins.

"We were able to show that non-toxic 'ancestral' genes exist in these two families. They're still present in the DNA, right alongside the toxin genes. The ancestral genes have completely disappeared from the other families. They've probably been transformed into toxin genes," said Vincent Louis Viala, a co-author of the article and former recipient of a postdoctoral fellowship from FAPESP.

Research effort

The Butantan Institute group began sequencing this snake's genome in 2013. B. jararaca is responsible for a large proportion of snake-bite accidents in Brazil and is one of the most studied snakes for this reason. The sequencing produced the fundamental information on the origins of its venom they had lacked hitherto.

In addition to enhancing knowledge of the genes in an organism, sequencing its genome assembles them in the right order. This is one of the most complex parts of the task because sequencing generates a vast amount of data, which has to be processed using computational tools.

Only in recent years, after combining several methods, did the group succeed in assembling the genome satisfactorily, with the collaboration of researchers at Ohio State University in the United States. The complete genome sequence is available online to anyone who wishes to study it.

The project produced answers to several other key questions. In 2009, an analysis by Japanese researchers of certain toxin genes from Protobothrops flavoviridis, which belongs to the same family as the jararaca (Viperidae), suggested the gene that encodes the toxin VEGF-F, also present in the Brazilian snake, probably resulted from duplication of the gene VEGF-A. The Brazilian group have now shown that it is more likely to have originated in a different gene family known in the scientific literature as "PGF-like".

The group also assembled more evidence that bradykinin-potentiating peptides (BPP), which are the basis for the anti-hypertensive drug captopril, probably originated in the gene CNP, which encodes C-type natriuretic peptides present in other vertebrates, including humans.

"The study illustrates the need to identify the context in which genes are inserted in order to understand their origin and evolution correctly," Azevedo said.

The researchers are now working on more refined versions of the jararaca's genome and those of other venomous snake families, hoping to find novel toxins and link them to proteins of relevance in the physiology of other organisms. ... viper.html

Journal Reference:

Diego Dantas Almeida et al, Tracking the recruitment and evolution of snake toxins using the evolutionary context provided by the Bothrops jararaca genome, Proceedings of the National Academy of Sciences (2021). DOI: 10.1073/pnas.2015159118


Venom is a key adaptive innovation in snakes, and how nonvenom genes were co-opted to become part of the toxin arsenal is a significant evolutionary question. While this process has been investigated through the phylogenetic reconstruction of toxin sequences, evidence provided by the genomic context of toxin genes remains less explored. To investigate the process of toxin recruitment, we sequenced the genome of Bothrops jararaca, a clinically relevant pitviper. In addition to producing a road map with canonical structures of genes encoding 12 toxin families, we inferred most of the ancestral genes for their loci. We found evidence that 1) snake venom metalloproteinases (SVMPs) and phospholipases A2 (PLA2) have expanded in genomic proximity to their nonvenomous ancestors; 2) serine proteinases arose by co-opting a local gene that also gave rise to lizard gilatoxins and then expanded; 3) the bradykinin-potentiating peptides originated from a C-type natriuretic peptide gene backbone; and 4) VEGF-F was co-opted from a PGF-like gene and not from VEGF-A. We evaluated two scenarios for the original recruitment of nontoxin genes for snake venom: 1) in locus ancestral gene duplication and 2) in locus ancestral gene direct co-option. The first explains the origins of two important toxins (SVMP and PLA2), while the second explains the emergence of a greater number of venom components. Overall, our results support the idea of a locally assembled venom arsenal in which the most clinically relevant toxin families expanded through posterior gene duplications, regardless of whether they originated by duplication or gene co-option.