Phytoconstituents of Artemisia annua as potential inhibitors of SARS CoV2 main protease: an in silico study

Background In November 2019, the world faced a pandemic called SARS-CoV-2, which became a major threat to humans and continues to be. To overcome this, many plants were explored to find a cure. Methods Therefore, this research was planned to screen out the active constituents from Artemisia annua that can work against the viral main protease Mpro as this non-structural protein is responsible for the cleavage of replicating enzymes of the virus. Twenty-five biocompounds belonging to different classes namely alpha-pinene, beta-pinene, carvone, myrtenol, quinic acid, caffeic acid, quercetin, rutin, apigenin, chrysoplenetin, arteannunin b, artemisinin, scopoletin, scoparone, artemisinic acid, deoxyartemisnin, artemetin, casticin, sitogluside, beta-sitosterol, dihydroartemisinin, scopolin, artemether, artemotil, artesunate were selected. Virtual screening of these ligands was carried out against drug target Mpro by CB dock. Results Quercetin, rutin, casticin, chrysoplenetin, apigenin, artemetin, artesunate, sopolin and sito-gluside were found as hit compounds. Further, ADMET screening was conducted which represented Chrysoplenetin as a lead compound. Azithromycin was used as a standard drug. The interactions were studied by PyMol and visualized in LigPlot. Furthermore, the RMSD graph shows fluctuations at various points at the start of simulation in Top1 (Azithromycin) complex system due to structural changes in the helix-coil-helix and beta-turn-beta changes at specific points resulting in increased RMSD with a time frame of 50 ns. But this change remains stable after the extension of simulation time intervals till 100 ns. On other side, the Top2 complex system remains highly stable throughout the time scale. No such structural dynamics were observed bu the ligand attached to the active site residues binds strongly. Conclusion This study facilitates researchers to develop and discover more effective and specific therapeutic agents against SARS-CoV-2 and other viral infections. Finally, chrysoplenetin was identified as a more potent drug candidate to act against the viral main protease, which in the future can be helpful. Supplementary Information The online version contains supplementary material available at 10.1186/s12879-024-09387-w.


Background
SARS-CoV-2 first emerged in China and then transmitted to the rest of the world.It first emerged in bats and was later transmitted to humans with an intermediatory source that is still not known [1].The virus belongs to the β-group and is a positive-sense RNA-enveloped virus.This zoonotic virus can easily adapt and become more virulent with time [2].The diameter ranges from 65-125 nm and has four main structural proteins, which include spike glycoproteins, membrane glycoproteins, small envelope glycoproteins and nucleocapsid protein with several other proteins [3].The life cycle of the virus depends upon two main polypeptides pp1a and pp1ab which further process 15 non-structural proteins with the help of papain-like protease and the main protease M pro [4].This main protease is responsible for cleavage at 11 sites in the replicase protease.For this reason, M pro is selected as a potential drug target site [5].M pro a 33.8 KDa CoV enzyme plays a major role by digesting the replicase polyproteins of the virus at 11 conserved sites.This makes M pro an efficient drug target site against the active constituents present in Artemisia annua [6].This protease is considered a cysteine protease as it has a cysteine histidine catalytic dyad and cleaves peptide bonds at Glm-Ser/ Ala/Gly [7].In SARS-CoV-2 14 different proteolytic sites of PLpro and 3CLpro were determined.At the N-terminal, PLpro cleaved three sites at 181-182, 818-819 and 2763-2764 and at the C-terminal it cleaves 11 different sites producing 15 non-structural proteins.These contain Nsp3 with multiple domains with a unique domain of SARS-a proteolytic enzyme and a deubiquitylation enzyme.Nsp5 is the 3CLpro, Nsp12 is RdRp (RNA dependent-RNA polymerase) and Nsp13 is a helicase [8].
The virus-infected individuals showed variations in symptoms and the rate of infection.Human coronavirus caused upper respiratory tract infections [9].An initial study showed that almost 91% of affected individuals showed high fever, 77% had a cough, 44% individuals felt fatigued, other symptoms like salivation were shown by 28%, headache by 8%, 5% had hemoptysis and 3% faced diarrhea [10].For the treatment of the symptoms shown, four kinds of vaccines have been developed, which include the whole virus vaccine, second one is the type of recombinant protein subunit vaccine which specifically targets the spike protein.The replication-incompetent vaccine and the nucleic (mRNA) based vaccine [11].Many drugs were also repurposed, azithromycin has been commonly used in Pakistan and other countries.Azithromycin is used during respiratory, urinary, dermal and other bacterial infections.It is also used during chronic inflammatory disorders that include posttransplant bronchitis, diffuse pan bronchitis and rosacea [12].Many medicinal plants such as Artemisia annua belonging to the family Asteraceae were also exploited.This plant is rich in around 600 active metabolites and has shown anti-fungal, anti-asthmatic, hepatoprotective and antioxidant properties [13].In Madagascar, a drink was prepared with the infusion of Artemisia annua and other plants to cure Covid-19 [14].
In addition, the development of antiviral medications that impede the activity of the SARS-CoV-2 Mpro is viable and has potential for practical use.Phytochemicals produced Artemisia annua aid in combating illnesses caused by fungus, bacteria, and plant viruses [15].The rationale for selecting this plant is based on the phytochemicals they contain generally have beneficial effects on health [16].This include bioactive nutrients that have been shown to have positive effects on human health by reducing the risk of major chronic illnesses.Phytochemicals have been shown to have potential efficacy in treating many illnesses, as indicated by preclinical, clinical, and epidemiological studies, owing to their antioxidant and anti-inflammatory properties [17].
Molecular docking has been in use for the past three decades for drug designing through computer assistance.Docking is preferred while performing virtual screening for the analysis of the functions of the compounds.Results can easily be classified through docking and it can give a detailed analysis of how the ligand interacts with the protein, which can optimize the lead compounds for drug development [18].Different docking programs use one or more search programs for prediction of possibilities of receptor-ligand complexes.For this purpose, molecular docking has become a key tool for drug discovery and molecular modeling applications.The result gives a score of the interaction, making it more reliable for predicting the ligand pose and, through that pose binding site of the ligand can easily be determined [19].

Retrieving protein's structure
For the study, the SARS CoV-23 CLpro or M pro was selected as a potential drug target site.The crystal structure of the main protease was downloaded under the PDB ID 6lu7 (https:// www.rcsb.org/ struc ture/ 6lu7).

Cleaning of protein's structure
M pro is a linear chain that consists of 1-306 amino acids referring to its A chain.Extra constituents were present as side chain C and the Nitrogen and water components were removed through Pymol [20].

Determining the physicochemical properties of M pro
The physicochemical properties of the protein which are molecular weight, isoelectric point, amount of negative and positive residues, extinction coefficient, instability index, aliphatic index and GRAVY were determined using ProtParam, which is a tool of ExPAsy [21].

Ligand selection and preparation
The reported antiviral ligands of Artemisia annua were selected as potential hits.The 3D structures for the 25 selected ligands were downloaded from the PubChem database (https:// pubch em.ncbi.nlm.nih.gov/).The data was obtained in SDF format.The 3D structures were represented in Kekule's form.The molecular mechanics 2 (MM2) energy of the ligands was minimized by the use of Chem3D ultra [22].The reported compounds compactness was removed with addition of hydrogen atoms and making them flexible against the target protein.This was followed by drug likeness and lead likeness by applying Lipinski rule of five and other filters using SwissAdme [23,24].

Molecular docking
A blind docking software CB-Dock [25] was used for the docking of proteins and ligands.It finds the docking sites automatically and gives the calculation of the size, center and sites of bonding in 5 different poses of interaction.A grid box of x, y and z coordinated were adjusted i.e. x = -27.77,y = 13.56,z = 56.09with size of x = 51.4,y = 62.45 and z = 53.84.The best pose is the one with the minimum vina score in KJ/mol.

Visualization of docking result
For visualization of the docked results, Pymol was used which provides a plugin that can access results easily and makes their visualization clear.

Analysis of docking result
For the 2D generation of protein-ligand complex LigPlot has been used which provides an analysis of hydrophobic and hydrogen bond interactions.
The druglikeness was studied using (SwissADME server http:// www.swiss adme.ch/) Lipinski rule of 5 (octanol-water partition coefficient log P value of drug like compound be limited to 5, molecular weight should be > 500, hydrogen bond acceptance number be 10 and hydrogen bond donor number should be 5) [24].

Molecular dynamics simulations
Molecular Docking Studies can predict ligand binding states in static conditions.Docking provides a static depiction of how a chemical interacts with the active site of a protein [27].To compute atom motions over time, MD simulations employ Newton's classical equation of motion.Amber16 software was applied for the complex system to check the carbon alpha atom structural deviation throughout the time scale [28].They can predict ligand binding status in physiological environments [29,30].Protein Preparation was done using chimera with 750 steepest and conjugate gradients with a total of 1500 steps to perform complicated optimization and minimization on the receptor-ligand combination.Both the systems were prepared against the Top1 and Top2 complex system using the System Builder tool.TIP3P (Transferable Intermolecular Interaction Potential 3 Points) of AMBER16 tool was chosen as a solvent model with an orthorhombic box.To neutralize the models, counter ions were introduced.0.15 M sodium chloride (NaCl) was added to mimic physiological circumstances.The NPT ensemble with a temperature of 300 K and a pressure of 1 atm was utilized throughout the simulation.Before running the simulation, the models were relaxed.The trajectories were collected every 200 ps, and the system was prepared by introducing minimization, heating, equilibrium and pressure states.Final production of the simulation has been run via Amber 16 and RMSD (root mean square deviation) of the protein and ligand over time was used to test the stability of the simulations.The total simulations time given for both the systems was applied for 100 ns.Followed by trajectories analysis with RMSD, RMSF, SASA, Hydrogen bonds analysis, PCA and Binding free energies calculations.

Refining of protein
The 3D structure was obtained from PDB (Protein Data Bank) under the name 6LU7 with the DOI number https:// doi.org/ 10. 2210/ pdb6L U7/ pdb [31].The structure obtained was attached with an N3 inhibitor, through PyMol the inhibitor and the extra side chain C was removed as shown in Fig. 1.Domains I and II had an anti-parallel β barrel structure, whereas Domain III has a globular structure consisting of 5 anti-parallel α helices.Domain III was connected by Domain II by a loop region consisting of 185-200 residues [6].

Physicochemical properties of protein
By using ProtParam a tool of ExPASy the physiochemical properties were studied.Table 1 shows the collective weight of negative and positive amino acid residues as 33,796.64.The pI value indicates acidic nature of M pro .The low value of GRAVY indicates good interactions with water molecules and values for instability index and aliphatic indexes show the stability of the protein.

Identification of functional domains of protein
Functional domains are the active part of the protein that are used for interaction with other proteins and substances.The InterPro job ID for finding the functional domains of 6LU7 is https:// www.ebi.ac.uk: 443/ inter pro// result/ Inter ProSc an/ iprsc an5-R2021 0417-071019-0353-62313 319-p2m/.Two promoters A and B combine to form a 306 amino acid long polypeptide consisting of three domains.Domain I is 8-101 residues, Domain II is 102-184 residues and Domain III are 201-303 residues.There is a cleft between domain I and II which acts as a binding site [6].

Drug-likeliness and toxicity prediction
Drug likeness of the compounds was done by Lipinski Rule of Five which indicates that the molecular weight of a protein should be ≤ 500, log P ≤ 5, H-bond donors ≤ 5, and H-bond acceptors ≤ 10.The rules are followed by orally administered drugs.A compound following three of the rules is considered a drug [24,36].Supplementary Table S1 shows that Rutin does not follow Lipinski rule whereas Sitogluside disobeys two of the rules.

Toxicity prediction
PkCSM [37] an online tool was used to check the toxicity of the ligands and that of the standard drug.Table 2 shows the results of toxicity prediction.The table shows that both artemisinin and dihydroartemisinin are AMES toxic which means that they can be mutagenic and can later be carcinogenic.Rutin and beta-sitosterol are hERG II inhibitors that can lead to the potassium channel inhibition leading to QT syndrome.Myrtenol and artemisinic acid are sensitive to skin.The values of T.pyriformis toxicity show that β-pinene, artemisinic acid and scopoletin

Oral rat acute toxicity
Oral rat chronic toxicity are toxic.Sitogluside and beta-sitosterol are minimally toxic.

Molecular docking
Molecular docking was performed using M pro as receptor protein and the ligands selected above.The protein in PDB format and the ligands in SDF format were docked.The CB dock server was applied and then check the input files by converting them into pdbqt format files using OpenBabel and MGL Tools [38,39].Then CB dock predicts the cavities of the receptor and calculates the centers and sizes of the top five cavities.Among the five best conformations the best one was selected based on a highaffinity score of the interaction between the protein and the ligand [40].Ligands showing the best binding score between the selected ligands and the protein M pro are shown in Table 3.
Ligands like quercetin, rutin, casticin, chrysoplenetin, apigenin, artemetin, artesunate, scopolin and sitogluside have shown good docking results.Rutin shows a binding score of -8.9 kJ/mol with a log P value of -1.6871.Apigenin shows a binding score of -7.8 kJ/mol with a log P value of 2.5768.chrysoplenetin shows a binding score of -7.7 kJ/mol and logP vale is 2.9056.Quercetin, Artemetin, casticin and sitogluside show vina score as -7.6 kJ/ mol with log P values as 1.988, 3.2086, 2.9056 and 5.849 respectively.Scopolin and artesunate show a score of -7.5 kJ/mol with log P values as -1.0197 and 2.6024.Ligands like quercetin, rutin and apigenin had already been reported to be docked against M pro by using Auto dock wizard as reported by Oluwaseun Taofeek in 2020.Quercetin showed a score of -7.2 which is less than the docking score by CB-dock.Rutin showed a score of -7.7 kJ/mol and apigenin gave a binding score of -6.8 kJ/ mol which is less than that shown by CB-dock [41].

Interaction of ligands and protein
The deducted docked results were analyzed using LigPlot and PyMol.The interaction between the ligands and the receptor protein was predicted through LigPlot + [42].The graphical system of LigPlot automatically generates 2D pictures of interactions from its 3d coordinates.The 2D diagrams of the interaction of the ligands with the best docking score and the standard drug with the protein are shown in Fig. 2a-J.Whereas Table 4 shows the hydrogen and hydrophobic interactions.

ADME properties of ligands
pkCSM is the second tool used for the assessment of ADME properties [18].

Distribution
Distribution properties (Table S4) show that quercetin, casticin, scopolin and rutin cannot cross the blood brain barrier.Quercetin, rutin, casticin, scopolin, artesunate and chrysoplenetin are not CNS permeable.Azithromycin shows a low VDss value which means the drug would not be properly distributed.The distribution parameters of quercetin, rutin and apigenin have already been studied by Oluwaseun Taofeek [23] and that of artemetin, scopolin, artesunate and casticin have been reported by Zarina Khurshid [40].

Excretion
Table 6 indicates that all the ligands and standard drugs are not renal OCT2 substrates which means that they would not be cleared out from the body.Pkcsm excretion properties of scopolin, artesunate, artemetin and casticin have been reported previously [40].Quercetin, rutin, and apigenin have been recorded by Oluwaseun Taofeek [41].

Lead compound identification
After the first knockout by Lipinski's rule and the second by the study of pharmacokinetics rutin was knocked out.Other ligands based on docking score and properties were taken to the next level.After that chrysoplenetin was selected as the lead compound depending upon its pharmacokinetics and pharmacodynamics.

Comparison of standard drug with lead compound
The lead compound chrysoplenetin was compared with the standard drug azithromycin and their physiochemical and pharmacokinetic properties were compared for the assessment of bioavailability, safety, efficiency, and drug-likeness.

Lipinski's rule comparison
The comparison shows that azithromycin breaks two of Lipinski's rules that are of molecular weight and H-bond acceptor as the molecular weight exceeds the limit of 500, and the H-bond acceptance value exceeds 10.The lead compound however follows all the given Lipinski's rules (Table S4).

ADMET properties comparison
Toxicity comparison The value of toxicity parameters of Azithromycin shows that this drug can be toxic to liver, but other parameters are in the range of positive values.Azithromycin cannot cause any sensitivity to the skin and is also not an inhibitor of hERG I and hERG II.The dose value of 1.927 is also tolerable.With that a no to AMES toxicity indicates that it is not carcinogenic.
The toxicity comparison is based on 9 models (Table 7).Model I of AMES toxicity shows that none of these are mutagenic, Model II shows that chrysoplenetin has a low value of dose.The third model shows that both are not hERGI and hERGII inhibitors.The fourth and fifth model of oral rat acute and chronic toxicity value give the value of a low dose that could result in an adverse effect.The hepatotoxicity model shows that azithromycin

Absorption properties comparison
The parameter of absorption is based on 6 models (Table 8).CaCO 2 solubility suggests that chrysoplenetin is absorbed more than Table 5 Metabolism properties of the ligands and standard drug azithromycin.Intestinal absorption of azithromycin is also low.Chrysoplenetin is a P-glycoprotein II inhibitor whereas azithromycin is a P-glycoprotein I inhibitor.

Metabolic properties comparison Metabolic properties indicate that azithromycin is a CYP3A4 substrate
whereas chrysoplenetin is a CYP3A4 substrate and an inhibitor to CYP1A2, CYP2C19 and CYP3A4 (Table 9).

Distribution properties comparison and PAINS alert
Table 10 shows that azithromycin has poor distribution to the brain, and it will not be able to pass the central nervous system.Further it has been investigated for PAINS alert as well to check whether the compounds interact with human proteins inadvertently or not.Upon investigating both compounds 0 PAINS alert were inferred showing no interactions with human protein targets.

Excretion properties comparison
The excretion properties of both the compounds are in the safe range.Chrysoplenetin has more total clearance than azithromycin (Table 11).

Physicochemical properties comparison
This screening tells that azithromycin has 38 carbon atoms, 72 hydrogen atoms, 2 nitrogen atoms and 12 oxygen atoms whereas chrysoplenetin has 19 carbon atoms, 18 hydrogen atoms and 8 oxygen atoms.The number of    The molecular weight of azithromycin also exceeds the rule limit of 500 g/mol.Chrysoplenetin has 5 rotatable bonds while azithromycin has 7 rotatable bonds (Table 12).

Docking score comparison
Both the standard drug and lead compound were docked against M pro .Table 13 shows the docking results which indicate that the binding score of azithromycin is -6.8 and that of chrysoplenetin is -7.7.The result indicates that chrysoplenetin can block or bind with M pro more efficiently than azithromycin.

Docking analysis comparison
The docking results are analyzed by Discovery Studio [43] based on the number of hydrogen bonds, number of hydrophobic interactions, steric interactions and number of interacting amino acids.Figure 3 shows the interaction of azithromycin and chrysoplenetin with the receptor.Azithromycin has formed only one hydrogen bond and ten hydrophobic interactions whereas chrysoplenetin forms seven hydrogen and nine hydrophobic interactions.Table 14 gives the details of this interaction.The binding site of the protease with previously reported inhibitors has been reinvestigated during the interaction of chrysoplenetin.It has been inferred that chrysoplenetin binds to the same residues sites where previously reported inhibitors has been found active [44][45][46].

Molecular dynamics simulations
Protein conformational dynamics are the most important aspect of its function.The functional information of a protein molecule is contained in its structure.The structure must be thoroughly examined to comprehend its functional variability [47].MD simulation using AMBER was utilized in this work to investigate the conformational component of protein-ligand interactions.RMSD depicts the backbone analysis and Cα atom dynamics of a docked protein during a 100 ns time for both complexes.A high fluctuation at 20-30 ns was observed with a maximum value was observed in Top1 (Azithromycin) and in Top2 (Chrysoplenetin), this has been depicted among 15-55 ns and the stability for the rest of the time scale has been observed for the rest of the simulation time interval.Herein, Fig. 4 (A) shows that the average RMSD value for docked protein was 4.69 Å, with a maximum peak of 7.87 Å.The RMSD graph does not support any dramatic domain alterations within the protein-ligand complex's structural framework.Whereas the average RMSD value for the Top2 complex observed was 1.71 Å with a maximum RMSD of 3.87 Å. Trajectory analysis was utilized to identify the protein substructures that generated the RMSF trend.This is followed by the RMSF value against both complex, where the average RMSF value recorded for Top1 is 1.7 Å with a maximum value of 5.2 Å and minimum value of 0.7 Å.The RMSD graph revealed substantial structural changes at 15 ns, 25 ns, 45 ns, and 65 ns, after which the protein stabilized.Figure 4 depicts the structural changes that occurred in the docked protein complex at various periods.RMSF quantifies the flexibility and variation of Cα residue structures across time.The average RMSF for the Top2 complex of docked protein for 100 ns was 1.61 Å, with a high of 4.2 Å, and significant changes at residues 76,103, 203 to 279, as well as at the end of the graph for residues 518 and 599.These were largely protein-loop areas where no structural deviation has been observed but shows fluctuations at RMSF level.

SASA analysis
We determined the solvent accessible surface area (SASA) of residues that were linked to significant Drug-Target interaction or catalytic activity, owing to intriguing findings from RMSD and RMSF analysis.When comparing the Top1 to Top2 complex system, the SASA profile (Fig. 5) clearly shows the reduction in the SASA of the critical residues.The enzymatic activities of targeted  proteins are inhibited in both systems because of the limited accessibility to critical residues inside complex systems, hence reducing the likelihood of complex interactions.According to the SASA analysis, there must have been some conformational changes in the protein surface, or the amino acid residue moved from the accessible area to the buried region because of complex formation.Stability of the key residues has been monitored throughout the simulation time intervals at the time scale of 100 ns.The average value recorded for Top1 complex system was 325.56 Å 2 and 191.74 Å 2 respectively offering a possible dynamics and stability of the system.

Hydrogen bond analysis
Hydrogen bonds play an important role during simulation reflecting the stability of the system.This was observed by applying the amber16 tools by giving all the frames of simulation time intervals.It has been inferred that Top1 complex system shows hydrogen bonds among the atom of the residues that include OD1 of Asn142 with O9 of ligand atom.This bond was continuously interacting with the residue till 100 ns time intervals with some minor breakage.This was followed by other hydrogen bonds i.e.Asn142-OD1-LIG-H10 and Glu-O-LIG-H111.Thus, making the Top1 in a stable position inside the active site.On other side the Top2 ligand have a strong hydrogen bonds interaction that includes Gly43 interacting the O3 and O4 atom of the inhibitor molecule followed by Cys145 AND Ser 144 of the target residues.

Binding free energies analysis
The total binding free energy and complex stability were mostly determined by the energy terms i.e. van der Waals (Evdw), electrostatic (Eele), and solvation (Esolvation), among other energy factors that contributed to the drug-target binding energy.Because end point freeenergy modelling techniques, such as MM/PBSA, are known to provide more accurate results than docking binding energy, these were used to investigate the binding free energies of both complexes.Table 15 provides an overview of the binding free energy of both compound complexes.After analysis, the average binding free energy of the Top1 complexes was found to be -45.5898kcal/mol in MMGBSA and -42.875 kcal/mol during MMPBSA approach.Figure 6 also provide the other binding energies factor which infers the complex's electrostatic (-10.8859kcal/mol), van der Waals (vdW) (-50.56 kcal/ mol), polar surface (-7.1783 kcal/mol) nonpolar surface (-5.678 kcal/mol).Whereas, the Top2 complex system depicting the total binding free of (-44.2875kcal/mol) for MMGBSA and (-43.376kcal/mol), with electrostatic energy of (-11.0774kcal/mol), van der Waals (vdW) (-57.1607kcal/mol), polar surface energy (-8.8117 kcal/ mol) and nonpolar surface energy (-5.6832 kcal/mol).Herein, for the Top1 and Top2 complex system the calculations of free energy showed that vdW energy plays a significant role in binding free energy, thus indicating the Top2 complex system shows a low binding energies with high affinity during the time intervals of 100 ns, thus helps the complex become transiently stable.

PCA analysis
To deduce the mechanical characteristics, such as structural movements and variations, of both complexes Principal Component Analysis (PCA) analysis was performed.Using vectorial representations, a collection of eigenvectors representing the motion of each individual component was obtained from the MD trajectories as shown in Fig. 7. Results evaluated shows the mobility of protein residues in both the system and it has been analyzed that both the drug bind to the active residues shows a slight deviation and alteration during the simulation Fig. 4 A Top1 depicts the RMSD and (B) RMSF plot of the C-alpha atoms of ligand-bound proteins throughout time interval.The figures suggest that the protein in the complex with Tocofersolan attained stability at 37 ns till the end of the simulations.Whereas this has been validated in the RMSF graph which attains stability of the system with few minor fluctuations at some loop region residual sites especially from 200-300 residual sites.This is followed by the B-Top2 figure, depicting the stable RMSD with a minor deviation at 20 ns showing a max RMSD of 3.8 Å. D depicting the RMSF of Top2 complex system with fluctuation at surface residues at the time scale of 100 ns Fig. 5 Depicting the dynamics of the Solvent surface residues of both complex system during the time scale of 100 ns time intervals.Clusters residues in the Top1 and Top2 system moving towards the lower state energies with drug molecules moving deep inside the active cavity of the target protein.Herein, Top2 complex system shows more deviation on x-axis as compared to Top1 which depicts the residues are moving towards high lower state level making the system more stable.

Discussion
With the emergence of the human SARS Coronavirus 2, many questions related to the evolution and introduction of the virus in the human race were raised.The biological community was indulged in finding answers to the reservoirs of the virus, its spread and its effect on the human race [48].It was found that upon transcription the beta coronavirus produces 800KD polypeptide, which is cleaved by papain-like protease and 3-chymotrypsinlike protease to generate various non-structural proteins that are involved in viral replication [49].The main protease protein having an essential role in the replication of viruses served as an amazing drug target site [5].Many drugs were used against M pro and few vaccines were developed for the inhibition of virus replication.Azithromycin has been commonly used in Pakistan and other countries.It has shown vital results during respiratory, chronic inflammatory disorders and bronchiolitis [12] for this reason it is used as a control.Medicinal plants have shown therapeutic properties, Artemisia annua is a herb used against malaria and other fevers.In Madagascar, a drink was prepared with an infusion of Artemisia annua to cure Covid-19 [14].For this purpose, the plant was Fig. 6 Top1 and Top2 complex depicting the hot spot residues involved in hydrogen bonds during the simulation time intervals after 100 ns time scale exploited, and 25 active constituents were selected to be docked against the main protease.CB-dock a blind docking tool was used.Docking focuses on cavity binding so that ratio of accuracy is higher [50].For performing docking the 3D structure of a protein in pdb format and the 3D structure of the ligand in sdf format was uploaded.The pose with minimum vina scores in KJ/mol was selected [51].The docking results were visualized in PyMol and the 2D representation was generated in LigPlot.After studying the ADMET properties of the ligands and passing them from the Lipinski Rule, 9 out of 25 ligands were selected which were quercetin, rutin, casticin, chrysoplenetin, apigenin, artemetin, artesunate, scopolin and sitogluside.After their analysis, chrysoplenetin was selected as the lead compound and was compared with the standard drug azithromycin.
Predictions of ADMET using in silico methods are essential for drug development, but they are subject to constraints including dependence on high-quality data, the application of the model, comprehension of intricate biological processes, variability between individuals, and difficulties in precisely predicting toxicity.The aforementioned constraints require enhancements in predicting methodologies and the incorporation of other scientific methodologies [52].Hrein, Azithromycin was shown to break the molecular weight and H-bond acceptor rule of Lipinski [24] whereas chrysoplenetin follows all the rules.Azithromycin was proved to be hepatotoxic when toxicity comparison was done by pkcsm.In the T.pyriformis (model 8) toxicity test, azithromycin was proved to be somewhat toxic.The intestinal absorption of azithromycin is also less than chrysoplenetin.Furthermore, Azithromycin showed a binding score of -6.8 kcal/mol forming only one hydrogen bond and ten hydrophobic bonds whereas chrysoplenetin showed a score of -7.7 kcal/mol forming seven hydrogen bonds and nine hydrophobic bonds.According to MD simulations, the drug receptor complex became stable in the physiochemical environment as its shape altered over time.Despite minor alterations inside chain and loop mobility, the inhibitor remained stable.The structural stability of the docked complex following simulation studies implies that the chosen ligand might be a viable lead chemical.Results were quite interesting as the protein in complex with chrysoplenetin attained high stability throughout the simulations time interval.Where a little change has been recorded in the RMSF.Binding free energies, SASA analysis, Hydrogen bond analysis and PCA analysis has shown a high ground stability of chrysoplenetin in complex with a target protein during the simulation time interval of 100 ns.All these studies showed that chrysoplenetin is a better drug candidate against the reported protein structure.Herein, Molecular dynamics (MD) simulations provide a comprehensive depiction of the fluctuations and conformational changes seen in biological systems by replicating the actual motions of atoms and molecules over a period of time.These parameters, including stability, binding affinity, and conformational alterations, play a crucial role in establishing the biological significance of the complex system.Such knowledge provided is of great use in furthering our comprehension

Conclusions
Covid-19 has created massive problems for the human race.M pro was revealed to be an active drug target.Some plants were exploited for their efficiency against M pro .25 ligands from the plant Artemisia annua showing antiviral properties were selected.Based on the inhibition effect involving hydrogen bonding and hydrophobic interaction, chrysoplenetin was selected as a lead compound which was then compared with the standard drug azithromycin.The inhibition process is explained by how the inhibitor binds and how the target protein acts as a catalyst.Likewise, the structure dynamics of the docked protein reveal useful information that can be used to improve the drug's effectiveness against the comparison between these two shows that chrysoplenetin can be better and more effective against M pro .This research contributes to the global effort to combat SARS-CoV-2 and provides a foundation for addressing future viral threats by emphasizing the potential of chrysoplenetin as a therapeutic agent.The recognition of chrysoplenetin as a highly effective inhibitor presents opportunities for additional comprehensive investigations, such as clinical trials, to ascertain its safety and efficacy in human subjects.Furthermore, the approaches and insights acquired have the potential to expedite the identification of innovative inhibitors targeting new viral proteases, thereby enhancing our readiness for prospective pandemics.

Fig. 2 A
Fig. 2 A Interaction of quercetin with M pro , B Interaction of rutin with M pro , C Interaction of apigenin with M pro , D Interaction of chrysoplenetin with M pro , E Interaction of artemetin with M pro , F Interaction of casticin with M pro , G Interaction of sitogluside at cavity 1 with M pro , H Interaction of sitogluside at cavity 4 with M pro (I) Interaction of scopolin with M pro and (J) 2 J Interaction of artesunate with M pro

Fig. 7
Fig. 7 PCA analysis.Top1 and Top2 complex systems show the movement of residues in comparison at the end of the simulation time intervals

Table 1
Physical properties of M pro * MW Molecular Weight, * pI Isoelectric Point, * NR negatively charged residues, * PR positively charged residues

Table 2
Toxicity prediction

Table 3
Molecular docking results of the selected compounds against the target protein * HBD Hydrogen bond donor, * HBA Hydrogen bond acceptor the result of absorption properties of ligands.Quercetin, rutin, sitogluside, scopolin and artesunate show low CaCO 2 solubility.Rutin shows poor intestinal absorption.All ligands act as P-glycoprotein substrates.Artemetin and sitogluside act as P-glycoprotein I inhibitors and artemetin, sitogluside, casticin and chrysoplenetin act as P-glycoprotein II inhibitors.Azithromycin shows a low CaCO 2 solubility and water solubility.It has a lower value of skin permeability and is a P-glycoprotein substrate and P-glycoprotein I inhibitor.The Pkcsm absorption properties of artemetin, casticin, scopolin, and artesunate have already been reported by Zarina Khurshid in 202.The absorption parameters of quercetin, rutin and apigenin have been studied by Oluwaseun Taofeek in 2020

Table 4
Hydrogen and hydrophobic interactions is hepatotoxic.Model seven indicates that neither of the two are skin sensitive.Toxicity model 8 and 9 indicate that azithromycin is somewhat toxic.

Table 6
Excretion properties of the ligands and standard drug

Table 9
Metabolic properties comparison

Table 10
Distribution properties comparison

Table 11
Excretion properties comparison

Table 12
Physiochemical properties comparison

Table 13
Docking score comparison

Table 14
Docking analysis comparison

Table 15
Binding free energies calculation of both Top1 and Top2 after simulation time intervals of 100 ns