Investigation
of Missense Mutations in KCNJ2 Gene: A
Computational Approach
Kanika Verma,
V. Shanthi, K. Ramanathan*
Industrial Biotechnology Division, School of Bio
Sciences and Technology, VIT University, Vellore- 632014, Tamil Nadu, India.
*Corresponding Author E-mail: kramanathan@vit.ac.in
ABSTRACT:
Though gene are already
known to be responsible for ATS, but the knowledge of missense mutation that
disease gene have still to be under covered. The present study has focused aims
to address this issue particularly in KCNJ2 (Potassium Inwardly-Rectifying
Channel, Subfamily J, Member 2) gene aid of computational approach. Initially
64 missense mutation of KCNJ2 gene retrieved from dbSNP database. The
structural and functional impact of these mutations were studied by different
genomic algorithms such as SIFT, PolyPhen2.0, SNPs and GO,PANTHER, PhD-SNP,
I-Mutant 2.0, MUpro, iStable, Align GVGD, mCSM, nsSNP analyzer programs,
respectively. Subsequently, molecular docking study was also employed to
understand the effect of these mutations in the function of that protein. These
data suggest that 5 mutations namely V295A, R312C, R260P, C154F and D71V were
found to be more deleterious mutation in KCNJ2 gene. We sincerely hope that the
results could be of immense importance in understanding the genetic basis of
Anderson-Tawil syndrome.
KEYWORDS: KCNJ2 gene;
Flecainide, Molecular docking; Genomic algorithms.
INTRODUCTION:
Changes in the single base in human genome are known as single nucleotide
polymorphism (SNP) and it is most frequent type of genetic variation1, 2.
Upto December 10, 2013 a total of 284103258 SNPs in humans
have been identified and deposited in the NCBI dbSNP. When SNPs occur in a
coding region and cause the modification in the amino acid in the protein is
called nonsynonymous single
nucleotide polymorphisms (nsSNPs) 3. These nsSNPs occurring in a protein
coding regions affects the protein function and causes various genetic diseases4.
Andersen–Tawil syndrome
(ATS) is a rare inherited ion channel dysfunction often characterized by
periodic paralysis, prolongation of QT interval with ventricular arrhythmias
and characteristic physical features include clinodactyly, low set ears,
micrognathia, a broad forehead cleft plate and hypertelorism5, 6.
Reports have suggested that ATS is caused mainly due to mutations in KCNJ2 gene
which encodes an inward rectifying K+ channel protein, Kir2.1.6. The
encoded protein has a greater tendency to allow potassium to flow into a cell
rather than out of a cell, and participates in establishing action potential
waveform and excitability of neuronal and muscle tissues. The concentration of
extracellular potassium regulates their voltage dependence thus when external
potassium is raised, it leads to more positive voltage ranges for channel
opening. The inward rectification is principally owing to the blockage of
outward current by internal magnesium. The Kir2.1 channels are
predominantly expressed in the brain, heart and skeletal muscles7.
These channels are important regulators of cellular excitability and resting
membrane potential of both cardiac and skeletal muscles8. Structurally,
alpha subunit is the pore-forming subunit of KIR channels. It contains one pore
domain between two membrane spanning regions. A tetramer is formed by four
alpha subunits, with the pore domain of each subunit contributing to the
structure of the central pore. In this study we have tried to find out the
effect of flecainide (a sodium channel blocker), on Kir2.1. Many clinical
trials of this drug have been reported to be effective in controlling cardiac
events 9, 10. Mutant
sodium channels that are in closed inactivated state can offer a way to devise
a new strategy in developing mutation specific treatment 10, 11. Therefore in this study we
are looking at the binding efficiency of flecainide with mutant Kir2.1 channel
to suggest mutation specific treatment for ATS. Hence, the present study has focused to
understand the deleterious effect of genetic variations involved in the KCNJ2
with the help of bioinformatics. We have
incorporated different computational algorithms to eliminate false positive
results and to enhance the accuracy of the predictions.
MATERIAL AND
METHODS:
Sequence
data sets and polymorphism identification
The
SNPs and their associated protein sequence for KCNJ2 gene were obtained from
dbSNP (http://www. ncbi.nlm.nih.gov/SNP/) for our computational analysis.
Analysis
of functional consequences of coding nsSNPs by sequence homology based method
(SIFT) and Damaged nsSNP found by the PolyPhen-2 server:
SIFT
(Sorting Intolerant from Tolerant) is a sequence homology based tool, which
presumes that important amino acids will be conserved in the protein family12.
Hence, changes at well-conserved positions tend to be predicted as deleterious.
The underlying principle of this program is that SIFT takes a query sequence
and uses multiple alignment information to predict tolerated and deleterious
substitutions for every position of the query sequence. SIFT is a multistep
procedure that, given a protein sequence, (a) searches for similar sequences,
(b) chooses closely related sequences that may share similar function, (c)
obtains the multiple alignment of these chosen sequences, and (d) calculates
normalized probabilities for all possible substitutions at each position from
the alignment. Substitutions at each position with normalized probabilities
less than a chosen cutoff are predicted to be deleterious, and those greater
than or equal to the cutoff are predicted to be tolerated13. The
cutoff value in SIFT program is a tolerance index of >0.05. Higher the
tolerance index, less functional impact a particular amino acid substitution is
likely to have. PolyPhen input is the amino acid sequence of a protein or the
SWALL database ID or accession number together with sequence position and two
amino acid variants characterizing the polymorphism. Sequence-based
characterization of the substitution site, profile analysis of homologous
sequences and mapping of the substitution site for a known protein's
3-dimensional structures are the parameters taken into account by PolyPhen
server to calculate the score. It calculates position-specific independent
counts (PSIC) scores for each of the two variants, and then computes the PSIC
score difference between them. Higher the PSIC score difference, higher is the
functional impact a particular amino acid substitution is likely to have14.
Analysis by SNPs and GO, PANTHER and PhD-SNP:
SNPs
and GO collects in unique framework information derived from protein sequence,
3D structure, protein sequence profile, and protein function. The enormous
number of human SNPs available in the data bases poses the question of relating
protein variations to diseases. We propose a new server that uses different
pieces of information, including that is derived from the Gene Ontology
annotation to predict if a given variation can be classified disease-related or
neutral. SNPs and GO is an SVM-based classifier consisting of a single SVM
that takes in an input protein sequence, profile and functional information.
The server also implements PHD-SNP method that takes in input different subsets
of SNPs and GO's input features15. PANTHER algorithm is based on a
library of Hidden Markov Models (HMMs) obtained from the multiple sequence
alignments of different protein families. Thus it performs the evolutionary
analysis of coding nsSNPs. PANTHER estimates the likelihood of a particular
nsSNP causing a functional impact on the protein. The software computes the
predicted free energy change value or sign (∆∆G) which is
calculated from the unfolding Gibbs free energy value of the mutated protein
minus unfolding Gibbs free energy value of the native protein (kcal/mol). A
positive ∆∆G value indicates that the mutated protein possesses
high stability and vice versa16-18. PHD-SNP is based on SVM-based
classifier. In the new version the authors have developed a predictor based on
a single SVM trained and tested on protein sequence and profile information.
PHD-SNP is optimized to predict if a given single point protein mutation can be
classified as disease-related or as neutral polymorphism. The output consists
of a table listing the number of the mutated position in the protein sequence,
the wild-type residue, the new residue and if the related mutation is predicted
as disease-related (Disease) or as neutral polymorphism (Neutral) 15, 16,
19.
Analysis by I-Mutant 2.0, iStable and
Mupro:
Mutant
2.0 is a support vector machine (SVM)-based tool for the automatic prediction
of protein stability changes upon single point mutations. I-Mutant 2.0
predictions are performed starting either from the protein structure or, more
importantly, from the protein sequence. This program was trained and tested on
a data set derived from ProTherm, which is presently the most comprehensive
available database of thermodynamic experimental data on free-energy changes of
protein stability upon mutation under different conditions.
Table 1: List of variants that were predicted to be
functionally significant by SIFT and Polyphen 2.0 algorithm.
|
S.No. |
SNPID |
Nucleotide change |
Amino acid Change |
Score |
Prediction |
Score |
|
1 |
rs375727662 |
C/G |
G33R |
0 |
Benign |
0.154 |
|
2 |
rs375646186 |
A/C |
K120T |
0.73 |
Benign |
0 |
|
3 |
rs375605948 |
G/T |
A70S |
0.36 |
Benign |
0 |
|
4 |
rs375330016 |
A/G |
E349K |
0.01 |
Probably damaging |
0.989 |
|
5 |
rs373799322 |
C/T |
P415L |
0.16 |
Benign |
0.051 |
|
6 |
rs370571312 |
A/G |
G206D |
0.05 |
Probably
damaging |
0.982 |
|
7 |
rs368406938 |
G/T |
C356F |
0 |
Probably
damaging |
1 |
|
8 |
rs367560052 |
A/G |
N318S |
1 |
Benign |
0 |
|
9 |
rs202149686 |
C/T |
I406T |
0.42 |
Benign |
0 |
|
10 |
rs202067116 |
C/T |
R325C |
0 |
Probably
damaging |
0.999 |
|
11 |
rs201162707 |
C/T |
V295A |
0 |
Probably
damaging |
1 |
|
12 |
rs199473659 |
C/T |
P351S |
0.04 |
Probably
damaging |
0.965 |
|
13 |
rs199473658 |
A/G |
M307I |
0.01 |
Probably
damaging |
0.993 |
|
14 |
rs199473657 |
G/T |
V227F |
0 |
Probably
damaging |
1 |
|
15 |
rs199473656 |
C/T |
L217P |
0 |
Probably
damaging |
1 |
|
16 |
rs199473655 |
C/T |
T192I |
0 |
Probably
damaging |
1 |
|
17 |
rs199473654 |
A/G |
G146S |
0 |
Probably
damaging |
1 |
|
18 |
rs199473653 |
A/G |
R82Q |
0 |
Probably
damaging |
1 |
|
19 |
rs199473652 |
A/G |
T74A |
0 |
Possibly
damaging |
0.951 |
|
20 |
rs199473651 |
G/T |
Y68D |
0 |
Probably
damaging |
1 |
|
21 |
rs199473650 |
G/T |
C54F |
0 |
Probably
damaging |
1 |
|
22 |
rs199473390 |
G/T |
R422L |
0.01 |
Benign |
0.425 |
|
23 |
rs199473389 |
C/T |
R312C |
0 |
Probably
damaging |
1 |
|
24 |
rs199473388 |
C/T |
T309I |
0.03 |
Probably
damaging |
0.997 |
|
25 |
rs199473387 |
A/C/G |
T305P |
0 |
Probably
damaging |
1 |
|
26 |
rs199473386 |
A/G |
E303K |
0 |
Probably
damaging |
0.997 |
|
27 |
rs199473385 |
C/G |
R260P |
0 |
Probably
damaging |
1 |
|
28 |
rs199473384 |
A/G |
R218Q |
0 |
Probably
damaging |
1 |
|
29 |
rs199473383 |
A/G |
G215D |
0 |
Probably
damaging |
1 |
|
30 |
rs199473382 |
A/G |
T192A |
0 |
Probably
damaging |
0.993 |
|
31 |
rs199473381 |
G/T |
R189I |
0 |
Probably
damaging |
1 |
|
32 |
rs199473380 |
G/T |
C154F |
0 |
Probably
damaging |
1 |
|
33 |
rs199473379 |
A/C/G |
G146D |
0 |
Probably
damaging |
1 |
|
34 |
rs199473378 |
A/G |
G144S |
0 |
Probably
damaging |
1 |
|
35 |
rs199473377 |
A/C/G |
G144D |
0 |
Probably
damaging |
1 |
|
36 |
rs199473376 |
C/T |
S136F |
0.1 |
Probably
damaging |
1 |
|
37 |
rs199473375 |
G/T |
V123G |
0 |
Probably
damaging |
0.999 |
|
38 |
rs199473374 |
C/T |
C101R |
0.3 |
Possibly
damaging |
0.614 |
|
39 |
rs199473373 |
C/T |
R82W |
0 |
Probably
damaging |
1 |
|
40 |
rs199473372 |
G/T |
D78Y |
0 |
Probably
damaging |
1 |
|
41 |
rs199473371 |
A/G |
D78G |
0 |
Probably
damaging |
1 |
|
42 |
rs199473370 |
A/G |
T75A |
0 |
Probably
damaging |
0.999 |
|
43 |
rs199473369 |
A/G |
D71N |
0 |
Probably
damaging |
1 |
|
44 |
rs199473368 |
A/G |
R67Q |
0 |
Probably
damaging |
1 |
|
45 |
rs151168735 |
C/T |
T130M |
0.39 |
Benign |
0.386 |
|
46 |
rs150326473 |
A/G |
I171V |
0.34 |
Benign |
0.448 |
|
47 |
rs147750704 |
A/G |
V93I |
0.47 |
Benign |
0.004 |
|
48 |
rs144022753 |
C/T |
T400M |
0.13 |
Benign |
0.025 |
|
49 |
rs141069645 |
A/G |
N410S |
0.76 |
Benign |
0 |
|
50 |
rs141035459 |
A/G |
G206S |
0.01 |
Possibly
damaging |
0.852 |
|
51 |
rs140147979 |
A/G |
V167M |
0.07 |
Possibly
damaging |
0.797 |
|
52 |
rs140090605 |
A/G |
R8H |
0.15 |
Benign |
0.192 |
|
53 |
rs140053197 |
A/G |
D389N |
0.21 |
Possibly
damaging |
0.6 |
|
54 |
rs111909178 |
A/T |
N410Y |
0.1 |
Benign |
0 |
|
55 |
rs104894585 |
C/G/T |
T75R |
0 |
Probably
damaging |
1 |
|
56 |
rs104894584 |
A/G |
D172N |
0.18 |
Probably
damaging |
1 |
|
57 |
rs104894583 |
A/C |
N216H |
0.01 |
Probably
damaging |
1 |
|
58 |
rs104894582 |
A/G |
V302M |
0 |
Probably
damaging |
1` |
|
59 |
rs104894581 |
C/T |
P186L |
0 |
Probably
damaging |
1 |
|
60 |
rs104894580 |
C/T |
R67W |
0 |
Probably
damaging |
1 |
|
61 |
rs104894579 |
A/C/G/T |
G300D |
0 |
Probably
damaging |
1 |
|
62 |
rs104894578 |
C/T |
R218W |
0 |
Probably
damaging |
1 |
|
63 |
rs104894575 |
A/T |
D71V |
0 |
Probably
damaging |
1 |
|
64 |
rs79650811 |
G/T |
F98V |
0.67 |
Benign |
0.001 |
Note: Letters in bold
indicates deleterious amino acid variants prediction by SIFT and Polyphen 2.0
program.
The output file shows the
predicted free-energy change value or sign (∆∆G), which is
calculated from the unfolding Gibbs free energy value of the mutated protein
minus the unfolding Gibbs free energy value of the native type (kcal/mol).
Positive ∆∆G values mean that the mutated protein posses high
stability and vice versa20. iStable 21 was the next
predictor which we used to predict protein stability changes which consists of
grid computing architecture which uses protein sequence as input and integrates
results from different element predictors. MUpro21 is another tool
used to predict stability changes due to single base mutations upon the input
of sequence information and rarely protein structure. It predicts if a
stability change will lead to destabilization or not without giving the actual
DDG value.
Analysis by Align-GVGD, mCSM and nsSNP Analyzer:
Align-GVGD22 is an extension of Grantham Difference, which
we used to predict the transactivation activity of each missense mutation. It
is predicted by combining GV and GD scores which considers composition,
polarity and volume of amino acids. These three parameters are taken as the
three axes and when all amino acids are plotted, it forms a cloud of points at a given position in multiple
sequence alignment . These points are enclosed in a box with maximum and
minimum values of the parameters as its coordinates. GV is the Euclidian length
of the main diagonal of the box which indicates the biochemical variation at a
particular position and GD is calculated by plotting a mutation the graph and
measuring the Euclidian distance between that mutation and a closest point on
the box. GD=0, if the substitution lies in the box otherwise it is greater than
0. Another tool is Mcsm23 which we used to predict stability changes
and it depends on graph based signatures which takes into account not only the
protein stability but also the protein-protein and protein-nucleic acid
interactions. Finally we used nsSNP Analyzer24 which predicts nsSNP’s phenotypic effect. nsSNPAnalyzer uses a machine learning method called
Random Forest to classify the nsSNPs. Random Forest is a type of classifier
consisting of an entity of tree-structured classifiers24. It takes a protein sequence and its nsSNPs as inputs and predicts if
the nsSNP is disease associated or functionally neutral.
Modeling and Protein docking:
SWISS-MODEL
is a fully automated protein structure homology-modeling server, accessible via
the ExPASy web server. The purpose of this server is to make Protein Modeling
easy. The amino acid sequence of a protein is submitted to construct a 3D
model. Template selection, alignment and model building are done completely
automated by the server25. Interaction free energies are very
important for analyzing binding tendency in proteins. We used the Docking
Server26 program which calculates the binding free energy between
protein complexes. Docking server calculates the parameters needed at different
steps for the docking, i.e. accurate ligand geometry optimization, energy
minimization, charge calculation, docking calculation and protein-ligand
complex representation. In order to calculate the binding affinity between
KCNJ2 protein and Flecainide, we used SWISSMODEL to model the KCNJ2 protein
structure and SWISSPDB viewer for performing mutation. The two dimensional
structure of Flecainide drug is obtained from the PubChem in the Smiles format.
Then the structure was submitted to corina to get the three dimensional
structure of drug molecule. Finally the 3D structures of protein and flecainide
were submitted to Docking Server to get them docked and then we got their free
binding energy.
RESULTS AND
DISCUSSION:
Screening of Deleterious Mutations by
SIFT and PolyPhen-2.0 Server:
In
disease associated gene studies, we need to distinguish between large number of
neutral SNPs and SNPs of functional importance. This task could be easily
achieved with the development computational techniques particularly with
different genomic algorithms. In the present investigation, the initial
screening was done using SIFT which predicted the tolerance indices of 64 SNP
variants. As the tolerance level decreases, the functional importance of the
amino acid substitution increases and vice versa. The results are shown in
Table 1.
Among
the 64 SNP variants, 47 variants were found to be deleterious having tolerance
indices ≤ 0.05. Subsequently, the structural alterations of these
variants were determined using PolyPhen-2.0 program. Protein sequence with
mutational position and residues were submitted to the server and it predicted
the level of damage for the structural alteration. Probably damaging stands the
nearest chances to find mutations make the protein non-functional, possibly
damaging are less likely prone in the category and benign come near the safe
zone. In Table 1, 64 variants are listed, out of which 44 variants are
predicted to be damaging by PolyPhen-2.0 server.
Prediction of Disease Probability of the Mutations by
SNPs and GO, PANTHER and PhD-SNP:
Furthermore,
SNPs and GO, PANTHER and PhD-SNP program were used to predict the disease and
probability score of the nsSNPs and the results are shown in (Table 2).
Table 2: Prediction of functionally significant nsSNPs
by SNPs and GO, PhD-SNP and PANTHER algorithm.
|
Mutation |
SNPs and GO |
PhD-SNP |
PANTHER |
||||||
|
|
Prediction |
Score |
RI |
Prediction |
Score |
RI |
Prediction |
Score |
RI |
|
E349K |
Disease |
4 |
0.719 |
Disease |
1 |
0.539 |
Disease |
7 |
0.873 |
|
G206D |
Disease |
6 |
0.818 |
Disease |
9 |
0.93 |
Disease |
5 |
0.765 |
|
C356F |
Disease |
6 |
0.803 |
Disease |
8 |
0.881 |
Disease |
8 |
0.877 |
|
R325C |
Disease |
6 |
0.815 |
Disease |
8 |
0.883 |
Disease |
10 |
0.99 |
|
V295A |
Disease |
3 |
0.668 |
Disease |
5 |
0.727 |
Disease |
6 |
0.806 |
|
P351S |
Disease |
3 |
0.639 |
Disease |
5 |
0.725 |
Disease |
3 |
0.639 |
|
V227F |
Disease |
1 |
0.574 |
Disease |
5 |
0.735 |
Disease |
4 |
0.722 |
|
L217P |
Disease |
7 |
0.828 |
Disease |
8 |
0.923 |
Disease |
9 |
0.972 |
|
T192I |
Disease |
5 |
0.739 |
Disease |
7 |
0.829 |
Disease |
10 |
0.989 |
|
G146S |
Disease |
7 |
0.848 |
Disease |
8 |
0.904 |
Disease |
10 |
0.989 |
|
R82Q |
Disease |
5 |
0.725 |
Disease |
8 |
0.891 |
Disease |
3 |
0.669 |
|
Y68D |
Disease |
8 |
0.894 |
Disease |
9 |
0.944 |
Disease |
9 |
0.953 |
|
C54F |
Disease |
7 |
0.856 |
Disease |
8 |
0.919 |
Disease |
10 |
0.99 |
|
R312C |
Disease |
5 |
0.768 |
Disease |
8 |
0.877 |
Disease |
10 |
0.983 |
|
T305P |
Disease |
5 |
0.755 |
Disease |
7 |
0.867 |
Disease |
9 |
0.944 |
|
E303K |
Disease |
8 |
0.879 |
Disease |
8 |
0.919 |
Disease |
10 |
0.977 |
|
R260P |
Disease |
6 |
0.813 |
Disease |
6 |
0.808 |
Disease |
2 |
0.598 |
|
R218Q |
Disease |
8 |
0.879 |
Disease |
9 |
0.94 |
Disease |
10 |
0.983 |
|
G215D |
Disease |
6 |
0.821 |
Disease |
8 |
0.901 |
Disease |
10 |
0.983 |
|
T192A |
Disease |
3 |
0.655 |
Disease |
2 |
0.617 |
Disease |
9 |
0.949 |
|
R189I |
Disease |
2 |
0.576 |
Disease |
5 |
0.736 |
Disease |
10 |
0.994 |
|
C154F |
Disease |
9 |
0.931 |
Disease |
10 |
0.982 |
Disease |
10 |
0.996 |
|
G146D |
Disease |
7 |
0.856 |
Disease |
9 |
0.982 |
Disease |
10 |
0.993 |
|
G144D |
Disease |
5 |
0.773 |
Disease |
9 |
0.935 |
Disease |
1 |
0.544 |
|
S136F |
Disease |
4 |
0.692 |
Disease |
8 |
0.882 |
Disease |
6 |
0.78 |
|
V123G |
Disease |
2 |
0.618 |
Disease |
7 |
0.831 |
Disease |
3 |
0.649 |
|
R82W |
Disease |
7 |
0.861 |
Disease |
9 |
0.961 |
Disease |
9 |
0.936 |
|
D78Y |
Disease |
8 |
0.922 |
Disease |
9 |
0.97 |
Disease |
10 |
0.98 |
|
D78G |
Disease |
9 |
0.927 |
Disease |
9 |
0.953 |
Disease |
8 |
0.923 |
|
T75A |
Disease |
8 |
0.903 |
Disease |
8 |
0.886 |
Disease |
9 |
0.955 |
|
D71N |
Disease |
7 |
0.869 |
Disease |
9 |
0.869 |
Disease |
10 |
0.982 |
|
R67Q |
Disease |
6 |
0.79 |
Disease |
6 |
0.814 |
Disease |
9 |
0.93 |
|
T75R |
Disease |
8 |
0.919 |
Disease |
9 |
0.934 |
Disease |
10 |
0.989 |
|
N216H |
Disease |
3 |
0.638 |
Disease |
6 |
0.778 |
Disease |
6 |
0.825 |
|
R67W |
Disease |
6 |
0.79 |
Disease |
8 |
0.904 |
Disease |
10 |
0.988 |
|
G300D |
Disease |
6 |
0.871 |
Disease |
8 |
0.898 |
Disease |
10 |
0.982 |
|
R218W |
Disease |
7 |
0.871 |
Disease |
9 |
0.962 |
Disease |
10 |
0.996 |
|
D71V |
Disease |
9 |
0.929 |
Disease |
9 |
0.969 |
Disease |
10 |
0.995 |
|
|
|
|
Fig.1. KCNJ2
protein-Flecainide docked complex structures
Table 3: Prioritization
of nsSNPs by I Mutant 2.0, MUpro and iStable
algorithm.
|
Complex Name |
Prediction |
ΔΔG (kcal/mol) |
Prediction |
Conf. Score |
Prediction |
Conf. Score |
|
E349K |
Decrease |
-1.69 |
Decrease |
-0.85 |
Decrease |
0.85 |
|
G146S |
Decrease |
-1.47 |
Decrease |
-0.22 |
Decrease |
0.78 |
|
G206D |
Decrease |
-1.15 |
Decrease |
-0.39 |
Decrease |
0.81 |
|
C356F |
Decrease |
-0.1 |
Decrease |
-0.06 |
Decrease |
0.7 |
|
V295A |
Decrease |
-0.37 |
Decrease |
-0.1 |
Decrease |
0.86 |
|
P351S |
Decrease |
-2.2 |
Decrease |
-1 |
Decrease |
0.81 |
|
L217P |
Decrease |
-0.34 |
Decrease |
-1 |
Decrease |
0.9 |
|
T192I |
Decrease |
-1.54 |
Decrease |
-0.8 |
Decrease |
0.62 |
|
Y68D |
Decrease |
-0.6 |
Decrease |
-0.09 |
Decrease |
0.87 |
|
E303K |
Decrease |
-0.05 |
Decrease |
-1 |
Decrease |
0.74 |
|
R312C |
Decrease |
-0.12 |
Decrease |
-0.14 |
Decrease |
0.6 |
|
R260P |
Decrease |
-2.66 |
Decrease |
-0.8 |
Decrease |
0.72 |
|
R218Q |
Decrease |
-1.26 |
Decrease |
-1 |
Decrease |
0.74 |
|
T192A |
Decrease |
-2.15 |
Decrease |
-0.97 |
Decrease |
0.69 |
|
R189I |
Decrease |
-0.59 |
Decrease |
-0.22 |
Decrease |
0.73 |
|
C154F |
Decrease |
-0.17 |
Decrease |
-1 |
Decrease |
0.79 |
|
G144D |
Decrease |
-1.29 |
Decrease |
-0.81 |
Decrease |
0.73 |
|
V123G |
Decrease |
-3.62 |
Decrease |
-1 |
Decrease |
0.83 |
|
D78Y |
Decrease |
-0.97 |
Decrease |
-0.41 |
Decrease |
0.74 |
|
D78G |
Decrease |
-0.28 |
Decrease |
-0.82 |
Decrease |
0.79 |
|
T75A |
Decrease |
-1.06 |
Decrease |
-1 |
Decrease |
0.82 |
|
D71N |
Decrease |
-1.13 |
Decrease |
-1 |
Decrease |
0.83 |
|
R67Q |
Decrease |
-0.9 |
Decrease |
-0.43 |
Decrease |
0.73 |
|
T75R |
Decrease |
-0.05 |
Decrease |
-0.23 |
Decrease |
0.79 |
|
N216H |
Decrease |
-1.73 |
Decrease |
-1 |
Decrease |
0.67 |
|
G300D |
Decrease |
-1.04 |
Decrease |
-0.71 |
Decrease |
0.74 |
|
R218W |
Decrease |
-0.61 |
Decrease |
-1 |
Decrease |
0.7 |
|
D71V |
Decrease |
-1.3 |
Decrease |
-0.09 |
Decrease |
0.84 |
Table 4: Prioritization of nsSNPs by Align GVGD, mCSM and nsSNP algorithm.
|
Complex
Name |
GV |
GD |
Prediction |
Score |
DDG
(kcal/mol) |
Stability |
nsSNP
analyzer |
|
G206D |
0 |
93.77 |
65 |
79.1 |
-0.903 |
Destabilizing |
Disease |
|
C356F |
0 |
204.39 |
65 |
44.3 |
-1.293 |
Destabilizing |
Disease |
|
V295A |
0 |
65.28 |
65 |
0.1 |
-2.296 |
Highly
destabilizing |
Disease |
|
L217P |
0 |
97.78 |
65 |
5.6 |
-1.105 |
Destabilizing |
Disease |
|
Y68D |
0 |
159.94 |
65 |
25.6 |
-1.807 |
Destabilizing |
Disease |
|
R312C |
0 |
179.53 |
65 |
11.7 |
-1.617 |
Destabilizing |
Disease |
|
R260P |
0 |
102.71 |
65 |
19.2 |
-0.353 |
Destabilizing |
Disease |
|
R189I |
0 |
97.59 |
65 |
0.3 |
-0.48 |
Destabilizing |
Disease |
|
C154F |
0 |
204.39 |
65 |
9.2 |
-1.282 |
Destabilizing |
Disease |
|
G144D |
0 |
93.77 |
65 |
0 |
-2.32 |
Highly
destabilizing |
Disease |
|
V123G |
0 |
109.55 |
65 |
5.1 |
-2.683 |
Highly
destabilizing |
Disease |
|
D78G |
0 |
93.77 |
65 |
43.4 |
-0.624 |
Destabilizing |
Disease |
|
T75R |
0 |
70.97 |
65 |
50.2 |
-0.686 |
Destabilizing |
Disease |
|
N216H |
0 |
68.35 |
65 |
3.3 |
-1.254 |
Destabilizing |
Disease |
|
G300D |
0 |
93.77 |
65 |
0.9 |
-2.91 |
Highly
destabilizing |
Disease |
|
R218W |
0 |
101.29 |
65 |
5.9 |
-0.656 |
Destabilizing |
Disease |
|
D71V |
0 |
152.22 |
65 |
0.9 |
-0.824 |
Destabilizing |
Disease |
Table 5: Docking analysis of flecainide with native and mutant types KCNJ2 protein
|
Complex Name |
Free energy of
binding (kcal/mol) |
Number of H- bonds |
|
Native KCNJ2-Flecainide |
-4.33 |
12 |
|
G206D- Flecainide |
-5.05 |
11 |
|
C356F- Flecainide |
-4.66 |
10 |
|
V295A- Flecainide |
-4.09 |
9 |
|
L217P- Flecainide |
-4.92 |
10 |
|
Y68D- Flecainide |
-4.71 |
7 |
|
R312C- Flecainide |
-4.32 |
10 |
|
R260P- Flecainide |
-3.99 |
5 |
|
R189I- Flecainide |
-5.43 |
9 |
|
C154F- Flecainide |
-4.23 |
9 |
|
G144D- Flecainide |
-4.45 |
10 |
|
V123G- Flecainide |
-4.4 |
9 |
|
D78G- Flecainide |
-4.68 |
8 |
|
T75R- Flecainide |
-4.72 |
9 |
|
N216H- Flecainide |
-4.65 |
8 |
|
G300D- Flecainide |
-4.57 |
9 |
|
R218W- Flecainide |
-4.98 |
6 |
|
D71V- Flecainide |
-3.57 |
9 |
Fig. 2. Details of
intermolecular hydrogen bonding network in (A) KCNJ2 protein-flecainide and (B)
mutant (D71V) KCNJ2 protein-flecainide complex structures.
The disease probability
score was predicted to be higher for the E349K, G206D, C356F, R325C, V295A,
P351S, V227F, L217P, T192I, G146S, R82Q, Y68D, C54F, R312C, T305P, E303K,
R260P, R218Q, G215D, T192A, R189I, C154F, G146D, G144D, S136F, V123G, R82W,
D78Y, D78G, T75A D71N, R67Q, T75R, N216H, R67W, G300D, R218W, D71V.All the 38
variants showed a reliability index values between 0.539 to 0.996.
Screening of nsSNPs using I-Mutant 2.0,
iStable and MUPRO:
All
the 64 SNPs were submitted individually to I-Mutant 2.0 program which predicted
the DDG value of each variant. The negative DDG value means that the point
mutation is less stable which makes the variants significant in our study.
Among the 64 SNPs, 28 were found to be less stable and thus functionally
significant. Further, we used two more programs namely iStable and MUPRO for
the prediction of stability changes which confirms 28 variants from the result of I-Mutant 2.0 to be less stable. The results
are listed in Table 3.
Prediction of functionally significant
mutation using Align-GVGD, mCSM and nsSNP Analyzer
The
28 SNPs identified to be less stable were then screened further using
Align-GVGD and based on GVGD values
predicted class 65 which is most likely significant were found to be 17 SNP
variants listed in Table 4. Then again stability was predicted using mCSM as
‘Destabilizing’ and ‘Highly Destabilizing’. Furthermore, the 17 variants were
finally confirmed as functionally significant by nsSNP. Analyzer as it was
predicted using this program that all the 17 SNP variants listed by Align-GVGD
and mCSM are disease causing SNPs. The screening efficiency was confirmed with
the presence of experimental proof for mutations R218W6, D71V 6,
R312C7 and N216H7, which tells that these mutations lead
to loss of function in Kir 2.1 channel.
Protein Docking:
Protein docking was carried
out for the drug and the 17 SNP variants to compare the binding efficiency
between native and mutant complexes. The free energy of the native complex
is-4.33 kcal/mol whereas for the mutant complexes the free binding energy
ranges from -3.57 to -5.43 kcal/mol. Minimum binding efficiency is shown by
D71V mutation which suggest that the 3D conformation of flecainide does not fit
effectively into the 3D space of the binding residues of the mutant as compared
to other mutants. The number of hydrogen bond interaction in the native complex
is 12 and for the mutant complexes the number of hydrogen bond interaction
ranges from 5 to 11. The free energy and number of H bond interactions of the
17 variants are shown in Table 5. The docked complex structure is shown in
Fig.1. Further intermolecular interactions were analysed and the results are
shown in Fig. 2.
CONCLUSION:
KCNJ2 gene belongs to a
family which produces potassium channels. These channels transports positive
charge ions of potassium out of cell and also play a key role in a cell’s
ability to generate and transmit electrical signals. In particular, a mutation
in the KCNJ2 gene causes ATS disease. Hence, in the present investigation we
made an attempt to detect the deleterious mutations in KCNJ2 gene by
computational analysis. A total of 64
missense mutations were retrieved for our analysis, SIFT predicted 71.87% of
SNPs as functionally significant, PolyPhen-2.0 predicted 56.25% of SNPs as
damaging. In addition, SNPs and Go, PANTHER and PhD-SNP predicted the
disease probability of the 38 nsSNP as high. I-Mutant 2.0, MUpro, iStable predicted 43.75% of SNPs to affect the stability, Align GVGD,
mCSM, nsSNP predicted 17 mutations to be highly deleterious. Moreover, docking
study was employed with the aid of flecainide molecule to improve the
prediction accuracy. Finally, the data obtained from different algorithms were
combined to eliminate the false positive in the computational prediction
methods. The results suggest that V295A, R312C,
R260P, C154F and D71V have dramatic impact in the KCNJ2 protein
structure and its function. We certainly believe that this approach of screening deleterious
mutations helps in predicting early prognosis and the response required for a
particular therapeutic approach.
ACKNOWLEDGMENTS:
The authors express deep
sense of gratitude to the management of Vellore Institute of Technology for all
the support, assistance and constant encouragements to carry out this work.
REFERENCES:
1.
Cargill M, Altshuler D, Ireland J, Sklar P, Ardlie K, Patil N, Shaw N,
Lane CR, Lim EP, Kalyanaraman N, Nemesh J, Ziaugra L, Friedland L, Rolfe A,
Warrington J, Lipshutz R, Daley GQ, Lander ES. Characterization of
single-nucleotide polymorphisms in coding regions of human genes. Nat Genet.
22(3); 1999: 231-238.
2.
Nadeau JH. Single nucleotide polymorphisms: tackling complexity.
Nature. 420(6915); 2002:517-518.
3.
Ramensky V, Bork P, Sunyaev S. Human non-synonymous SNPs: server and
survey. Nucleic Acids Res. 30(17); 2002:3894-900.
4.
Rennell D, Bouvier SE, Hardy LW, Poteete AR. Systematic mutation of
bacteriophage T4 lysozyme. J Mol Biol. 222(1); 1991:67-88.
5.
Choi BO, Kim J, Suh BC, Yu JS, Sunwoo IN, Kim SJ, Kim GH, Chung KW.
Mutations of KCNJ2 gene associated with Andersen-Tawil syndrome in Korean
families. J Hum Genet. 52(3); 2007:280-3.
6.
Tristani-Firouzi M, Jensen JL, Donaldson MR, Sansone V, Meola G, Hahn
A, Bendahhou S, Kwiecinski H, Fidzianska A, Plaster N, Fu YH, Ptacek LJ, Tawil
R. Functional and clinical characterization of KCNJ2 mutations associated with
LQT7 (Andersen syndrome). J Clin Invest. 110(3); 2002:381-8.
7.
Limberg MM, Zumhagen S, Netter MF, Coffey AJ, Grace A, Rogers J,
Böckelmann D, Rinné S, Stallmeyer B, Decher N, Schulze-Bahr E. Non
dominant-negative KCNJ2 gene mutations leading to Andersen-Tawil syndrome with
an isolated cardiac phenotype. Basic Res Cardiol. 108(3); 2013:353.
8.
Schulze-Bahr E. Short QT syndrome or Andersen syndrome: Yin and Yang of
Kir2.1 channel dysfunction. Circ Res. 96(7); 2005:703-704.
9.
Haruna Y, Kobori A, Makiyama T, Yoshida H, Akao M, Doi T, Tsuji K, Ono
S, Nishio Y, Shimizu W, Inoue T, Murakami T, Tsuboi N, Yamanouchi H, Ushinohama
H, Nakamura Y, Yoshinaga M, Horigome H, Aizawa Y, Kita T, Horie M.
Genotype-phenotype correlations of KCNJ2 mutations in Japanese patients with
Andersen-Tawil syndrome.Hum Mutat. 28(2); 2007:208.
10.
Jurkat-Rott K, Lehmann-Horn F. Muscle channelopathies and critical
points in functional and genetic studies. J Clin Invest. 115(8);
2005:2000-2009.
11.
Mohammadi B, Jurkat-Rott K, Alekov A, Dengler R, Bufler J, Lehmann-Horn
F. Preferred mexiletine block of human sodium channels with IVS4 mutations and
its pH-dependence. Pharmacogenet Genomics. 15(4); 2005:235-244.
12.
Kumar P, Henikoff S, Ng PC. Predicting the effects of coding
non-synonymous variants on protein function using the SIFT algorithm. Nat
Protoc. 4(7); 2009: 1073-1081.
13.
Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect
protein function. Nucleic Acids Res. 31(13); 2003:3812-3814.
14.
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P,
Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense
mutations. Nat Methods. 7(4); 2010:248-249.
15.
Calabrese R, Capriotti E, Fariselli P, Martelli PL, Casadio R.
Functional annotations improve the predictive score of human disease-related
mutations in proteins. Hum Mutat. 30(8); 2009:1237-1244.
16.
Capriotti E, Altman RB, Bromberg Y. Collective judgment predicts
disease-associated single nucleotide variants. BMC Genomics. 14 Suppl 3:S2.
2013.
17.
Thomas PD, Campbell MJ, Kejariwal A, Mi H, Karlak B, Daverman R, Diemer
K, Muruganujan A, Narechania A. PANTHER: a library of protein families and
subfamilies indexed by function. Genome Res. 13(9); 2003:2129-2141.
18.
Thomas PD, Kejariwal A, Campbell MJ, Mi H, Diemer K, Guo N, Ladunga I,
Ulitsky-Lazareva B, Muruganujan A, Rabkin S, Vandergriff JA, Doremieux O.
PANTHER: a browsable database of gene products organized by biological
function, using curated protein family and subfamily classification. Nucleic
Acids Res. 31(1); 2003:334-341.
19.
Capriotti E, Calabrese R, Casadio R. Predicting the insurgence of human
genetic diseases associated to single point protein mutations with support
vector machines and evolutionary information. Bioinformatics. 22(22);
2006:2729-34.
20.
Capriotti E, Fariselli P, Casadio R. I-Mutant2.0: predicting stability
changes upon mutation from the protein sequence or structure. Nucleic Acids
Res. 2005:33(Web Server issue):W306-10.
21.
Chen CW, Lin J, Chu YW. iStable: off-the-shelf predictor integration for
predicting protein stability changes. BMC Bioinformatics. 2013:14 Suppl 2:S5.
22.
Mathe E, Olivier M, Kato S, Ishioka C, Hainaut P, Tavtigian SV.
Computational approaches for predicting the biological effect of p53 missense
mutations: a comparison of three sequence analysis based methods. Nucleic Acids
Res. 34(5); 2006:1317-1325.
23.
Pires DE, Ascher DB, Blundell TL. mCSM: predicting the effects of
mutations in proteins using graph-based signatures. Bioinformatics. 30(3);
2014:335-42.
24.
Bao L, Zhou M, Cui Y. nsSNPAnalyzer: identifying disease-associated
nonsynonymous single nucleotide polymorphisms. Nucleic Acids Res. 2005:33(Web
Server issue):W480-2.
25.
Schwede T, Kopp J, Guex N, Peitsch MC. SWISS-MODEL: An automated
protein homology-modeling server. Nucleic Acids Res. 31(13); 2003:3381-3385.
26.
Bikadi Z, Hazai E. Application of the PM6 semi-empirical method to
modeling proteins enhances docking accuracy of AutoDock. J Cheminform. 1:15;
2009.
Received on 10.09.2015 Modified on 25.09.2015
Accepted on 28.09.2015 © RJPT All right reserved
Research J. Pharm. and Tech. 8(11): Nov., 2015; Page
1540-1547
DOI: 10.5958/0974-360X.2015.00275.9