**1.1 Eukaryotic ribosome**

A eukaryote ribosome where Plasmodium species falls is designated as 80S and contains two subunits. The smaller subunit 40s is comprised of 18 s rRNA and 33 proteins whereas the large subunit 60S (**Figure 1**) is comprised of the 28S, 5S and 5.8S rRNA and 49 proteins [7–9].

X-ray crystallography and Cryo-electron microscopy methods have been used to solve the three dimensional structures of the ribosome with or without complexed cofactors examples be tRNA, mRNA among other macro molecules successfully [10]. The very first models of eukaryotic ribosome's at resolution between 6.1 and 15 Å were provided by Cryo-electron microscopy, revealed the location and the shapes of the RNA expansion segments and indicated the position of additional protein moieties [11–13]. Later, a crystalline structure of the complete eukaryotic ribosome from *Saccharomyces cerevisiae* was determined at 4.15 Å and later 3.0 Å resolutions [9, 14]. These structures with more clarity gave more insights on understanding the process of translation which captured the ratcheted states of the ribosome which had been postulated over 40 years [15, 16]. Recently the overall crystal structure of the eukaryotic ribosome of *S. cerevisiae* obtained revealed basic architectural similarity, but the larger assembly compared with the prokaryotic counterpart [14].

In addition this structure shows the E-site, A-site, the ribosomal proteins of both the 60S and the 40S subunits together with the expansion segments which gave more knowledge about eukaryotic protein synthesis process. This followed through earlier studies that were done and showed both interfaces of the 60S and 40S subunits views with numbered bridges [17].

### **1.2 18S rRNA structure**

RNA molecules are polymers of nucleotides comprised of 3′-5′ phosphor-diester linked ribose sugars attached to the four bases, pyrimidines: cytosine and uracil and

**Figure 1.**

*A graphical presentation of the components of the 40S ribosomal subunit of* Plasmodium falciparum.

*rRNA Platform Technology for Drug Discovery Methods for Identifying Ligands That Target… DOI: http://dx.doi.org/10.5772/intechopen.98776*

purines: adenine and guanine in contrast with the DNA [18]. RNA is single stranded containing ribose sugars and uracil base in place of thymine.

While the G-U wobble pair is somewhat different in shape. Base pairing occurs in both DNA and RNA using the hydrogen bonds formed this pairing patterns are governed by Watson and crick rules [19], where adenine complements uracil in RNA (thymine in DNA) and guanine with cytosine in both known as conical base pairing. In places where this pairing does not happen it's known as non-Watson crick structures among these are the sheared GA, GA imino, AU reverse Hoogsteen, and the GU and AC wobble pairs (**Figure 2**) [19, 20]. Determining the RNA secondary structure is the first step of understanding its mechanism of action which is defined by the canonical base paring [21]. The secondary structure of the RNA can adopt elements such as internal loops, mismatches bulges, multi-branched junctions, hairpins and pseudo knots (**Figure 3**).

#### **Figure 2.**

*Watson–Crick G-C and A-U base pairs with a similar angle of ~54. The G-U "wobble base pair" is also shown [9, 18].*

#### **Figure 3.** *RNA secondary structure motifs showing representation of Watson crick base paring [9, 18].*

Presence of these diverse motifs that can be adopted by the RNA makes it play its functional roles such as recognition, interaction, metal binding and other enzymatic activities [20]. The 18S fold primarily defines the eukaryotic 40S subunit structure which can be divided into features of the small ribosomal subunit including the head, platform, body, beak, shoulder, right foot and left foot (**Figure 4A**) [22]. The secondary structure of the eukaryotic ribosome forms a structure with four major domains which are named according to the region of the sequence from 5′ to the 3′ end. They are 5'major, central, 3'major, and finally 3'minor (**Figure 4B**).

**Figure 2** Watson–Crick G-C and A-U base pairs with a similar angle of ~54. The G-U —Wobble base pair‖ is also shown [20]. While the G-U wobble pair is somewhat different in shape. **Figure 3** RNA secondary structure motifs showing representation of Watson crick base paring [20].

The 18S rRNA is composed of a region homologous to the prokaryotic 16S rRNA with several eukaryotic specific ESs [23] as shown in **Figure 5**. The ESs helical elements in eukaryotes display variable lengths; however their architecture is found to be preserved [22]. Both segments of ES3 are located on the 5′ domain, ES6 and ES7do form insertions in the central domain, and ES9 and ES12 are found in the 3′ major and 3'minor domain of the 18S RNA, respectively [22].

Eukaryotic 18S rRNA ES6, is the longest consisting of ~250 nucleotides which form five helices that replace the bacterial helix 21. ES6 is inserted between h20

*rRNA Platform Technology for Drug Discovery Methods for Identifying Ligands That Target… DOI: http://dx.doi.org/10.5772/intechopen.98776*

#### **Figure 4.**

*Architectural tertiary structure of* Plasmodium falciparum *40S front and back view. The 18S rRNA is colored differently depending with domains (5'major –red, central-yellow, 3'major-blue and 3'minor- green). Also shown are the 40S ribosomal 34 proteins of* Plasmodium falciparum *interacting with the 18S rRNA to make the total subunit.*

and h22 in the lower back region of the 40S body. A, B, C, and D helices form a large portion of the back of the 40S body and their structure differs considerably from a previous model [24]. C and D Helices are located at an equivalent position to bacterial h21. They stretch across the back of 40S and are buried underneath helix A.ES6B apical loop region is exposed and disordered in the structure, which makes it prone to chemical modification and cleavage by nucleases, as previously observed [25]. The loop region ES3B forms a base pair with helix ES6E loop region which yields to an extended helix which leads to projection from the center of the back toward the left foot of the 40S. Earlier demonstration by computational and biochemical experiments showed this rather unusual quaternary interactions.

The left foot along with ES6E is formed when the ES3B apical region packs against ES3A. ES3 and ES6 quaternary eukaryotic interaction, together with several proteins form a new domain, responsible for more prominent left foot features of the 40S and a broader back. Situated directly below the bleak is helix h16, which is shifted relative to the position in bacteria by as much as 40 Å [25].

In the 40S crystal structure the position of h16 is consistence with those observed in the solution by Cryo-EM of the empty yeast 40S, the 40S-eIF1 or40SeIF1A complex [26], and the canine and *Thermomyces lanuginosus* 80S [24, 27]. Formation of a connection between the head and the body of the 40S subunit involves h16 upon binding of the initiation factors eIF1 and eIF1A, which might point to a role of this helix in initiation [26].

#### **1.3 RNA as a drug target**

The pharmaceutical industries together with the researches have always forecast their efforts more on protein, rather than the nucleic acids as suitable targets of drugs. But the acceleration and advances of studies on the RNA synthesis, structure determination and therapeutic target identification has blown open the question of using RNA as a drug target a very genuine area [28]. In this age due to the wealth of the three dimensional structure of the RNA in various repositories, it is possible they could be used for drug design by observing the target design. It has been a

#### **Figure 5.**

*18S rRNA secondary structure of* Plasmodium falciparum *showing the four expansion segment regions (ES) located at the central and 3'major domain [9].*

difficult task in the past to obtain a three-dimensional structure for large RNAs using the crystal and magnetic resonance experimental techniques, which was just restricted to very small fragments [28–30] but newer techniques and more input to the older experimental methods are leading to accurate much larger structures. The RNA on its chemical basis does not show a promising drug target in that it's made up of four different planer bases and negatively charged nucleus [29, 31]. But upon the RNA adopting its conformational architecture, that shows the presence of cavities and pockets which could bind to shape specific rather than sequence specific molecules [31]. The RNA cavities compels the phosphate groups to be in close proximity, that lead to an intensified importance of tightly bound water molecules, electrostatic forces and ions, magnesium divalent ions in particular that can be partly dehydrated [30, 32]. Presence of non-Watson Crick pairs and bulged residues

*rRNA Platform Technology for Drug Discovery Methods for Identifying Ligands That Target… DOI: http://dx.doi.org/10.5772/intechopen.98776*

leads the formation of pockets and enlarged grooves that infer function through assembly of existing RNA motifs which are architecturally clearly diverse [30, 33]. The secondary structure of the RNA consist regular double stranded helices that gives it most of its energy content which in turn folds to subsequent three dimensional structure free energy content of between 5-10kcalmol [34]. This gives a Nano molar range binding constant which can be achieved for a small molecule usable to compete with the final step of RNA folding [30, 35]. From the discussed above projections with a number of ways of inserting non Watson crick pairs within helices is very limited leading to the restriction in the number of RNA motifs that are picked as drug target [30]. RNA motifs appear as decreasing size placed one inside the other, with the smaller motif associated with the larger motif [36, 37]. It has been shown that some antibiotics bind structurally in different regions of RNA molecules in the ribosome, such as streptomycin binding to the shallow groove, hygromycin binds to the deep groove of the helix, aminoglycosides at three adenine bulges and macrolides in the tunnel of the nascent polypeptide chain [8, 38–40].

Streptomycin interacts with only phosphate groups of many [40]. In conclusion there are various other ways that ligands interact with the RNA that are not known yet and the growth of a rich three dimensional structural of both RNA and ligands leads to newer technique of drug discovery that may include binding and docking experiments of the structures to obtain newer and stronger intervenes.

## **2. Methodology**

Functional RNA elements have specific RNA sequences and structures that work together as a functional unit. From the perspective of bioinformatics, homology-based searching for both conserved sequences and secondary structures is effective at finding structured RNA motifs [33]. These strategies to determine RNA structures provide the means to investigate the potential of these RNA motifs as drug targets. Although drug development can be accomplished without knowing how a compound works in the cell, hit-to-lead optimization (during which small molecule hits from a high-throughput screen undergo optimization in order to identify promising lead compounds) is greatly facilitated if the target is known [34].

Here, we present a set of in-silico based methods for developing reliable atomic level 3D-models of RNA structures efficiently and cheaply, using sequence information, to accelerate the drug discovery process for plasmodium. Our methodology employs molecular modeling and structure based drug design approach in which we use the 3D experimental ribosome structures information of Plasmodium to identify RNA motifs that could be potential drug targets. Using the information of the rRNA structure and movement we have identified and docked a set of promising anti-infective-like compounds that target biologically functional ribosomal RNA motifs that could be potential drugs against malaria.

#### **2.1 RNA tertiary structure prediction**

To determine the spatial structure of RNA, researchers can use experimental techniques, such as bio-crystallography or NMR spectroscopy. However, the experimental techniques, are tiresome, expensive, and require specialized equipment [41]. An alternative to the experimental techniques are computer modeling methods. Although the computer modeling methods are not as accurate as mentioned above experiments, they can be successfully used to investigate the function and mechanism of action of the RNA molecules [42, 43].

Therefore, there is a need for computational methods that can provide reliable models of RNA structures efficiently and cheaply, using only information on a nucleotide sequence [44]. The goal of computational structural bioinformatics is not to replace experimental techniques, but to compliment them especially when the answer for scientific questions are beyond their reach [45]. Unfortunately, despite the fact that computational methods are being continuously improved, they not always predict the correct structures of RNA [44].

The secondary structure determination (or prediction) is often the starting point for the spatial (3D) structure determination of RNA [46]. Methods for predicting tertiary structure of RNA are generally represented in two categories: The first category comprising of the methods that are based on the laws of physics known as ab initio modeling and the second category comprising of the methods based on experimental data.

Methods based on experimental data are further divided to either those that extrapolate knowledge of experimentally solved structures known as de novo modeling and the methods that extrapolate the fragments of already solved structures which include assembly-based methods, comparative or homology modeling and manual building structures based on figments [47–51].

Our methodology focused on using the homology and de novo prediction modeling methods of RNA 3D structure development to produce the 3D structure of the ribosomal RNA (rRNA) of the plasmodium. The implementation was done using ModeRNA [50], RNA123 [20], SimRNA [52, 53]. Any refinement was done by QRNAS software [54].

#### *2.1.1 Modeling of the rRNA model*

The ribosomal core model is used as an anchor point for modeling the expansion segments and variable regions. RNA sequences from the structure based sequence alignment were loaded in FASTA format and the cleaned template structure also loaded. The data is submitted for structure modeling as shown by **Figure 6**.

The modeling proceeds by getting a sequence which is obtained from atomic coordinates in one-letter-code, getting a secondary structure by extracting the 2D structure from 3D and return it in the Vienna format and finally analyzing the geometry to check whether the residues of the structure have any unusual features like strange bond and angle values.

#### *2.1.2 Structure evaluation*

The WebRASP server is used to compute energy scores for assessing the stability of the modeled RNA structure. The server receives as input a PDB file containing the atomic coordinates of the RNA structure and calculates the energy profile and total energy score of the molecule. The server displays the results graphically and the visualization can be modified by the user. The server relies on RASP [55] for the calculation which is an all-atom knowledge-based potential assessment for scoring of RNA 3D structures based on distance-dependent pairwise atomic interactions.

Further validation is done using MolProbity [56, 57] which is a structure validation server used for all-atom contact analysis. The web service provides broadspectrum model diagnosis and repair. It evaluates the model quality at both the global and local levels. It relies heavily on the power and sensitivity provided by optimized hydrogen placement and all-atom contact analysis, complemented by updated versions of covalent-geometry and torsion-angle criteria.

*rRNA Platform Technology for Drug Discovery Methods for Identifying Ligands That Target… DOI: http://dx.doi.org/10.5772/intechopen.98776*

**Figure 6.** *The schematic diagram of RNA modeling.*

### **2.2 Molecular dynamics simulation**

SimRNA [52, 53] was used to analyze conformational landscapes and identify potential alternative structures using the default parameters; 500 simulation steps and 1% of the lowest energy frames of the input subjected to clustering. The software employs a statistical potential to approximate the energy and identifies conformations that correspond to biologically relevant structures.

### *2.2.1 Visualization and structure analysis*

The sequence based and structure-based alignments are visualized using GNU emacs with RALEE [58] and Sublime text3 [59]. The secondary structures are visualized in Varna [60] and xRNA. The 3D model is visualized using USF Chimera [61, 62] and PyMol [63]. Video representations of the model are made using BIOVIA Discovery Studio [64].

## **2.3 Identification of rRNA motifs**

As functional components in three-dimensional conformation of an rRNA, structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms.

One major computational approach for studying RNA structural motifs is to search homologous instances of known motifs by using comparative methods. Traditionally, the motifs are modeled with their 3D geometric features, such as backbone conformations or torsion angles. Although a number of typical tools

primarily rely on the 3D atomic coordinates and do perform well for some simple motifs, they may not work for complex ones since the underlying computational methods are too rigid to identify the flexible variations in structures. Using thin formation we integrate pairwise interactions as constraints into the screening for RNA structural motifs besides 3D information, using the FR3D platform [65]. However, as the most critical character of RNAs, the base–base interactions should be used as key factors in the assessment of structural discrepancy directly. Based on this idea, we use RNA Motif Scan to search new motif candidates that share non-canonical base–base interaction patterns with the query [66, 67].

#### **2.4 Structure based molecular docking**

Molecular docking is also referred to as small molecular docking is a study of how two or more molecular structures, for instance, drug and catalyst or macromolecule receptor, match along to be a perfect fit [68]. The study employed both the Shape complementarity approach using PatchDock [69].

Shape complementary approach was successful in producing the docking results from compound in the Malaria Pathogen Box.

### **3. Conclusions**

Malaria is a disease spread by the female anopheles mosquito that contains protozoan organisms of the plasmodium genus that actually cause malaria [13]. These thrive in tropical and subtropical areas which predominant in Africa. According to WHO Sub-Saharan Africa carries a disproportionately high share of the global malaria burden.

Malaria is transmitted by the anopheles female mosquito that carries *the P falciparum* that causes malaria. There are about 100 types of plasmodium parasites which can infect a variety of species. Scientist have identified five types that specifically affect humans but of all *P. falciparum* stands out. *P falciparum* found worldwide in tropical and rural ranges, however predominately in Africa. An expected 1 million individuals are killed by this strain each year. The strain can multiply quickly and can adhere to blood vessels in the brain, bringing causing rapid onset of severe malaria including cerebral malaria [9].

The rRNA is a very core component of any cell for the *P falciparum* the 18S rRNA is where we draw our attention to, this is because even after the *P falciparum* undergoes mutation the 18S rRNA of it rarely undergoes any change thus if targeted there will not mutate to resist the drugs. In designing drugs that only attack the rRNA of the *P falciparum* not the proteins therefore we are able to make drugs that are more effective and have a higher efficacy. By docking a large library of compounds into a high-resolution structures of the target receptor 18S rRNA, fewer compounds typically need to be experimentally screened to identify compounds that are active against the target. These rRNA are the most abundant species of RNA in the living cells. They are the largest component of the ribosomes; large RNA-protein particles that form sites for synthesizes proteins and form the bulk of it. rRNAs have evolved as a hub of protein biosynthesis in all living cells performing both catalytic, regulatory and organizational roles [70, 71].

This spurred significant progress in the understanding biology of ribosomes. For example; the translational mechanism of the ribosome and the mode by which the function of the ribosome is altered by antibiotic inhibitors [4, 72]. These results have stimulated new interest in extending our understanding to the more complicated eukaryotic ribosome.

*rRNA Platform Technology for Drug Discovery Methods for Identifying Ligands That Target… DOI: http://dx.doi.org/10.5772/intechopen.98776*

As we begin to get more insights in the working of the prokaryotic ribosomes, the question we are also seeking to answer is "How does the complex eukaryotic ribosome assemble inside living cells? A number of approaches are beginning to address this question. The approaches comprise of a combination of genetic, biochemical, and structural approaches.
