AI4AXP AI4AXP Help About Us

alpha_helixInput Amino Acid Sequences in FASTA format alpha_helix

There are two methods used to submit sequence(s) for AXP (Anti-Microbial Peptide (AMP)/Anti-Viral Peptide (AVP)/ Anti-Fungal Peptide (AFP)/Anti-Cancer Peptide (ACP)/ Anti-COVID Peptide (ACVP) with hemolysis) prediction (Figure.1):

1.     Paste FASTA-format text in the area circled with the red line

2.Upload the FASTA file from the local disk folder via the button marked with the blue line.

•We ONLY accept FASTA-format text

ai4axp_webpage_input_new

Figure 1. GUI of AI4AXP.

alpha_helixLeave an Email address for system notification (Optional) alpha_helix

Usually, the time used for computation will not be too long. The user may wait until the result is available or leave a valid email address; the system will email the submitter when the result has come out.

1.     Provide an email address and click the checkbox, 'Terms of Use' explains that we only use provided email address for notification use. Email addresses will not be stored in our backend for another use.

2.     Press the Submit button, and please wait for the prediction came out.  

ai4axp_webpage_check

Figure 2. key-in a validated email (optional) if needs, then read the "Terms of Use" and  tick the checkbox.




alpha_helixHow to Read the results alpha_helix

ai4axp_webpage_output_table

Figure 3. The prediction results of the demo FASTA file for each submitted sequence indicate their potential functions.

The figure above shows the prediction results of the demo FASTA file. The column "Peptide" lists the names of input peptides. The column "Score" shows prediction scores indicating how much probability a peptide contains AXP activity. The column "Prediction results" shows whether the peptide is an AXP or not. Here, the threshold is around 0.5. "YES" means the prediction score of the peptide is more than 0.5. Otherwise, "No" means the prediction score of the peptide is less than 0.5.

Meanwhile, for those submitted sequences with the length > 50 a.a., AI4AXP will cut the sequence(s) into several ones by a sliding window (windows size =49 aa and a move =25 aa).

ai4axp_webpage_output_piechart




Figure 4. Overall the result of demo submission in piecharts as a global view.

The pie charts above show the visualization of the results in a global view (Figure 4). There are a total of 20 peptides in the demo FASTA file, and the prediction result shows the number of submitted peptides which may possess  the functions of AMPs/ AVPs/ ACPs/AFPs/ ACVP with the ability of non-hemolytic activity. Submitted long peptides (>50 aa) and unrecognized sequences are visualized and counted in separated piecharts. However, it must be emphasized that most AVPs composed of essential amino acids with a length shorter than 50 amino acids.

ai4axp_download_area

Figure 5. The download area for each submission.

Users can download their prediction results in the "Download area." Click "Result" to download the prediction results represented in the CSV file. Here are also "Submission in fasta file," "Unrecognized fasta file," and "Logfile" provided as references.

Table 1. The basic information for each prediction model with their source codes and references. 

Prediction

encoding Methods

Model

Github

Reference

AMP

PC6

CNN+LSTM

github-logo-300x300

mSystems, 2021

ACP

PC6

CNN

github-logo-300x300

Pharmaceuticals, 2022

AVP

PC6

CNN

github-logo-300x300

Bioinformatics Advances, 2022

AFP

PC6+Doc2Vec

Ensemble (RF+SVM+CNN) 

github-logo-300x300

Submitted

ACVP

PC6

CNN

-

This study

Hemolysis

PC6

CNN

-

This study

alpha_helix PC6 protein-encoding method alpha_helix

https://axp.iis.sinica.edu.tw/AI4AVP/help/avp_model_figure2.png

Figure A1. Encoding the amino acids sequence according six physicochemical properties (PC6). 

The idea of the PC6 encoding method is to use physicochemical properties as word embeddings. Each amino acid character in sequence would be replaced with a vector composed of six physicochemical property values (PC6). The six features of PC6 include hydrophobicity (H1), the volume of side chains (V), polarity (Pl), and pH at the isoelectric point (pI), the dissociation constant for the -COOH group (pKa), and the net charge index of the side chain (NCI) (detail in mSystems, 2021). 

alpha_helix Deep Learning Models used in AI4AMP alpha_helix

https://journals.asm.org/cms/10.1128/mSystems.00299-21/asset/c5d82c8c-cfdd-4c3d-a925-18334516fdea/assets/images/medium/msystems.00299-21-f006.gif

FIG A2 (A) PC6 protein-encoding method. Each input sequence will be transformed into a 200 ¡Ñ 6 matrix, respectively. (B) Deep neural network model. The PC6 encoded data matrix will pass through one convolution layer, one LSTM layer, and one dense layer (mSystems, 2021).

alpha_helix Deep Learning Models used in AI4ACP alpha_helix

Figure5_model arch

Figure A3.  Model architecture in this study. After PC6 encoding, protein sequences will go through every layer in this model (Pharmaceuticals, 2022).

 

alpha_helix Deep Learning Models used in AI4AVP alpha_helix

avp_model_figure1_0

Figure A4. The structure of models for GAN and Deep CNN used for AVP prediction (Bioinformatics Advances, 2022). 

The model used for AI4AVP was proposed in our study, 'Developing an Antiviral Peptides Predictor with Generative Adversarial Network Data Augmentation' (BioRxiv, 2021).' We used a generative adversarial network (GAN) to generate a positive training dataset to allow the deep CNN classifier to train on balanced datasets. Then, we encoded peptides using the PC6 protein-encoding method and trained the AI4AVP model based on three CNN blocks.

 

alpha_helixDeep Learning Models used in AI4AFPalpha_helix

afp_model

Figure A5. The ensemble model including two encoding and three ML/AI models for Anti-Fungal Peptide prediction.


Copyright © 2024 Institute of Information Science, Academia Sinica, TAIWAN.

All Rights reserved.