Amino acid dipepetide frequency for Nipah virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.226AlaAla: 3.226 ± 1.579
0.509AlaCys: 0.509 ± 0.242
3.226AlaAsp: 3.226 ± 0.535
4.075AlaGlu: 4.075 ± 0.824
1.358AlaPhe: 1.358 ± 0.45
3.735AlaGly: 3.735 ± 0.801
0.509AlaHis: 0.509 ± 0.36
2.716AlaIle: 2.716 ± 1.168
3.735AlaLys: 3.735 ± 1.343
5.263AlaLeu: 5.263 ± 1.33
1.019AlaMet: 1.019 ± 0.302
1.698AlaAsn: 1.698 ± 0.571
1.358AlaPro: 1.358 ± 0.51
2.716AlaGln: 2.716 ± 0.529
1.358AlaArg: 1.358 ± 0.535
3.396AlaSer: 3.396 ± 0.654
2.207AlaThr: 2.207 ± 1.125
5.433AlaVal: 5.433 ± 1.911
1.019AlaTrp: 1.019 ± 0.437
1.698AlaTyr: 1.698 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.34CysAla: 0.34 ± 0.21
0.34CysCys: 0.34 ± 0.24
0.34CysAsp: 0.34 ± 0.24
0.509CysGlu: 0.509 ± 0.225
1.188CysPhe: 1.188 ± 0.385
1.019CysGly: 1.019 ± 0.483
0.17CysHis: 0.17 ± 0.12
1.528CysIle: 1.528 ± 0.59
0.34CysLys: 0.34 ± 0.312
1.528CysLeu: 1.528 ± 0.446
0.0CysMet: 0.0 ± 0.0
0.849CysAsn: 0.849 ± 0.475
1.358CysPro: 1.358 ± 0.901
1.698CysGln: 1.698 ± 0.431
0.679CysArg: 0.679 ± 0.473
2.037CysSer: 2.037 ± 0.35
1.358CysThr: 1.358 ± 0.689
0.849CysVal: 0.849 ± 0.6
0.509CysTrp: 0.509 ± 0.241
0.679CysTyr: 0.679 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
2.207AspAla: 2.207 ± 0.675
0.509AspCys: 0.509 ± 0.36
4.075AspAsp: 4.075 ± 1.23
4.754AspGlu: 4.754 ± 1.034
3.056AspPhe: 3.056 ± 0.609
3.226AspGly: 3.226 ± 1.184
1.019AspHis: 1.019 ± 0.523
3.735AspIle: 3.735 ± 0.46
4.924AspLys: 4.924 ± 0.376
5.942AspLeu: 5.942 ± 1.245
0.509AspMet: 0.509 ± 0.204
4.754AspAsn: 4.754 ± 0.658
4.075AspPro: 4.075 ± 0.58
2.547AspGln: 2.547 ± 0.607
3.056AspArg: 3.056 ± 0.82
4.754AspSer: 4.754 ± 0.717
2.547AspThr: 2.547 ± 0.932
4.075AspVal: 4.075 ± 0.743
0.849AspTrp: 0.849 ± 0.297
2.207AspTyr: 2.207 ± 0.606
0.0AspXaa: 0.0 ± 0.0
Glu
3.056GluAla: 3.056 ± 0.876
3.226GluCys: 3.226 ± 0.862
5.263GluAsp: 5.263 ± 1.773
5.093GluGlu: 5.093 ± 1.928
3.226GluPhe: 3.226 ± 0.8
4.075GluGly: 4.075 ± 1.021
1.528GluHis: 1.528 ± 0.459
6.621GluIle: 6.621 ± 0.571
2.547GluLys: 2.547 ± 0.895
5.093GluLeu: 5.093 ± 0.973
1.019GluMet: 1.019 ± 0.485
3.905GluAsn: 3.905 ± 1.145
2.547GluPro: 2.547 ± 0.717
2.547GluGln: 2.547 ± 0.673
3.735GluArg: 3.735 ± 0.826
3.905GluSer: 3.905 ± 0.835
3.905GluThr: 3.905 ± 0.707
2.716GluVal: 2.716 ± 0.666
0.509GluTrp: 0.509 ± 0.274
1.188GluTyr: 1.188 ± 0.664
0.0GluXaa: 0.0 ± 0.0
Phe
3.056PheAla: 3.056 ± 0.979
0.679PheCys: 0.679 ± 0.367
1.188PheAsp: 1.188 ± 0.512
2.207PheGlu: 2.207 ± 0.713
1.528PhePhe: 1.528 ± 0.443
1.188PheGly: 1.188 ± 0.476
0.509PheHis: 0.509 ± 0.242
2.207PheIle: 2.207 ± 0.511
2.377PheLys: 2.377 ± 0.999
3.226PheLeu: 3.226 ± 1.029
1.019PheMet: 1.019 ± 0.477
2.207PheAsn: 2.207 ± 0.79
1.358PhePro: 1.358 ± 0.425
0.679PheGln: 0.679 ± 0.26
1.698PheArg: 1.698 ± 0.406
1.528PheSer: 1.528 ± 0.614
2.037PheThr: 2.037 ± 0.567
2.207PheVal: 2.207 ± 0.543
0.34PheTrp: 0.34 ± 0.24
0.679PheTyr: 0.679 ± 0.421
0.0PheXaa: 0.0 ± 0.0
Gly
3.905GlyAla: 3.905 ± 1.061
0.17GlyCys: 0.17 ± 0.12
3.226GlyAsp: 3.226 ± 0.729
2.716GlyGlu: 2.716 ± 0.767
2.377GlyPhe: 2.377 ± 0.474
3.905GlyGly: 3.905 ± 1.325
0.679GlyHis: 0.679 ± 0.267
4.075GlyIle: 4.075 ± 0.629
4.584GlyLys: 4.584 ± 1.297
6.452GlyLeu: 6.452 ± 0.733
1.868GlyMet: 1.868 ± 0.492
2.547GlyAsn: 2.547 ± 0.47
2.377GlyPro: 2.377 ± 0.652
2.037GlyGln: 2.037 ± 0.427
4.075GlyArg: 4.075 ± 0.811
5.942GlySer: 5.942 ± 1.639
2.547GlyThr: 2.547 ± 0.895
3.905GlyVal: 3.905 ± 1.377
0.679GlyTrp: 0.679 ± 0.301
2.377GlyTyr: 2.377 ± 0.333
0.0GlyXaa: 0.0 ± 0.0
His
0.679HisAla: 0.679 ± 0.217
0.509HisCys: 0.509 ± 0.241
1.698HisAsp: 1.698 ± 0.466
0.679HisGlu: 0.679 ± 0.341
0.0HisPhe: 0.0 ± 0.0
1.019HisGly: 1.019 ± 0.31
1.188HisHis: 1.188 ± 0.661
1.019HisIle: 1.019 ± 0.377
0.509HisLys: 0.509 ± 0.36
3.056HisLeu: 3.056 ± 0.751
0.679HisMet: 0.679 ± 0.48
0.679HisAsn: 0.679 ± 0.48
0.849HisPro: 0.849 ± 0.37
0.679HisGln: 0.679 ± 0.31
0.509HisArg: 0.509 ± 0.242
0.849HisSer: 0.849 ± 0.6
0.509HisThr: 0.509 ± 0.36
0.679HisVal: 0.679 ± 0.271
0.509HisTrp: 0.509 ± 0.349
1.358HisTyr: 1.358 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
4.584IleAla: 4.584 ± 0.71
1.528IleCys: 1.528 ± 0.409
5.433IleAsp: 5.433 ± 0.812
3.735IleGlu: 3.735 ± 0.676
2.037IlePhe: 2.037 ± 0.823
5.603IleGly: 5.603 ± 1.158
1.358IleHis: 1.358 ± 0.66
6.452IleIle: 6.452 ± 1.281
6.791IleLys: 6.791 ± 0.638
5.772IleLeu: 5.772 ± 1.653
1.868IleMet: 1.868 ± 0.55
3.905IleAsn: 3.905 ± 0.412
2.886IlePro: 2.886 ± 0.758
4.075IleGln: 4.075 ± 1.313
4.075IleArg: 4.075 ± 0.645
8.829IleSer: 8.829 ± 2.151
4.414IleThr: 4.414 ± 1.173
3.565IleVal: 3.565 ± 0.618
0.509IleTrp: 0.509 ± 0.25
3.056IleTyr: 3.056 ± 1.061
0.0IleXaa: 0.0 ± 0.0
Lys
2.886LysAla: 2.886 ± 0.798
0.849LysCys: 0.849 ± 0.413
5.433LysAsp: 5.433 ± 1.575
4.584LysGlu: 4.584 ± 1.587
1.868LysPhe: 1.868 ± 0.667
3.565LysGly: 3.565 ± 0.567
0.849LysHis: 0.849 ± 0.453
4.924LysIle: 4.924 ± 2.108
3.735LysLys: 3.735 ± 0.69
5.093LysLeu: 5.093 ± 0.55
1.188LysMet: 1.188 ± 0.288
3.565LysAsn: 3.565 ± 0.656
1.698LysPro: 1.698 ± 0.631
1.528LysGln: 1.528 ± 0.62
3.396LysArg: 3.396 ± 0.72
6.112LysSer: 6.112 ± 1.113
4.584LysThr: 4.584 ± 0.62
4.414LysVal: 4.414 ± 1.588
0.34LysTrp: 0.34 ± 0.24
2.547LysTyr: 2.547 ± 0.745
0.0LysXaa: 0.0 ± 0.0
Leu
4.244LeuAla: 4.244 ± 0.774
1.188LeuCys: 1.188 ± 0.4
6.282LeuAsp: 6.282 ± 0.798
7.131LeuGlu: 7.131 ± 1.077
3.396LeuPhe: 3.396 ± 1.084
4.584LeuGly: 4.584 ± 0.716
1.528LeuHis: 1.528 ± 0.64
7.301LeuIle: 7.301 ± 1.273
5.603LeuLys: 5.603 ± 1.408
6.961LeuLeu: 6.961 ± 1.469
1.868LeuMet: 1.868 ± 0.41
5.942LeuAsn: 5.942 ± 0.991
3.226LeuPro: 3.226 ± 0.658
3.396LeuGln: 3.396 ± 0.67
4.924LeuArg: 4.924 ± 1.175
9.677LeuSer: 9.677 ± 1.285
3.565LeuThr: 3.565 ± 1.179
5.772LeuVal: 5.772 ± 0.582
0.509LeuTrp: 0.509 ± 0.225
2.716LeuTyr: 2.716 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
1.019MetAla: 1.019 ± 0.346
0.0MetCys: 0.0 ± 0.0
1.188MetAsp: 1.188 ± 0.51
1.698MetGlu: 1.698 ± 0.529
0.34MetPhe: 0.34 ± 0.2
1.188MetGly: 1.188 ± 0.582
0.509MetHis: 0.509 ± 0.36
2.377MetIle: 2.377 ± 0.968
1.019MetLys: 1.019 ± 0.317
3.056MetLeu: 3.056 ± 0.566
1.019MetMet: 1.019 ± 0.383
1.358MetAsn: 1.358 ± 0.612
1.019MetPro: 1.019 ± 0.609
0.17MetGln: 0.17 ± 0.12
1.358MetArg: 1.358 ± 0.337
1.868MetSer: 1.868 ± 0.523
1.358MetThr: 1.358 ± 0.547
1.358MetVal: 1.358 ± 0.526
0.17MetTrp: 0.17 ± 0.12
1.188MetTyr: 1.188 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
1.698AsnAla: 1.698 ± 0.573
1.868AsnCys: 1.868 ± 0.447
2.886AsnAsp: 2.886 ± 0.508
2.716AsnGlu: 2.716 ± 0.897
0.849AsnPhe: 0.849 ± 0.391
3.226AsnGly: 3.226 ± 0.33
1.019AsnHis: 1.019 ± 0.397
5.603AsnIle: 5.603 ± 1.722
2.207AsnLys: 2.207 ± 0.818
6.961AsnLeu: 6.961 ± 0.963
1.358AsnMet: 1.358 ± 0.523
3.905AsnAsn: 3.905 ± 0.546
3.735AsnPro: 3.735 ± 0.709
2.377AsnGln: 2.377 ± 0.666
1.868AsnArg: 1.868 ± 0.733
4.075AsnSer: 4.075 ± 0.528
4.924AsnThr: 4.924 ± 0.871
4.244AsnVal: 4.244 ± 0.958
1.019AsnTrp: 1.019 ± 0.514
1.528AsnTyr: 1.528 ± 0.653
0.0AsnXaa: 0.0 ± 0.0
Pro
2.207ProAla: 2.207 ± 0.936
0.17ProCys: 0.17 ± 0.12
1.698ProAsp: 1.698 ± 0.374
4.244ProGlu: 4.244 ± 0.9
1.528ProPhe: 1.528 ± 0.319
2.886ProGly: 2.886 ± 0.813
0.34ProHis: 0.34 ± 0.183
3.396ProIle: 3.396 ± 0.411
3.056ProLys: 3.056 ± 0.976
2.716ProLeu: 2.716 ± 0.835
1.698ProMet: 1.698 ± 0.778
2.886ProAsn: 2.886 ± 0.876
2.886ProPro: 2.886 ± 0.548
1.868ProGln: 1.868 ± 0.53
2.377ProArg: 2.377 ± 0.561
3.905ProSer: 3.905 ± 0.772
2.377ProThr: 2.377 ± 0.914
3.565ProVal: 3.565 ± 0.622
0.509ProTrp: 0.509 ± 0.388
2.207ProTyr: 2.207 ± 0.505
0.0ProXaa: 0.0 ± 0.0
Gln
2.716GlnAla: 2.716 ± 0.451
1.358GlnCys: 1.358 ± 0.721
1.528GlnAsp: 1.528 ± 0.718
2.207GlnGlu: 2.207 ± 0.564
1.188GlnPhe: 1.188 ± 0.501
2.377GlnGly: 2.377 ± 0.494
0.34GlnHis: 0.34 ± 0.21
1.528GlnIle: 1.528 ± 0.752
4.075GlnLys: 4.075 ± 0.968
3.056GlnLeu: 3.056 ± 0.689
0.17GlnMet: 0.17 ± 0.207
1.528GlnAsn: 1.528 ± 0.319
2.207GlnPro: 2.207 ± 0.603
2.716GlnGln: 2.716 ± 0.666
1.698GlnArg: 1.698 ± 0.272
3.905GlnSer: 3.905 ± 0.711
2.886GlnThr: 2.886 ± 1.145
1.698GlnVal: 1.698 ± 0.469
0.17GlnTrp: 0.17 ± 0.174
0.34GlnTyr: 0.34 ± 0.21
0.0GlnXaa: 0.0 ± 0.0
Arg
2.377ArgAla: 2.377 ± 0.766
0.34ArgCys: 0.34 ± 0.239
3.396ArgAsp: 3.396 ± 0.978
4.414ArgGlu: 4.414 ± 0.544
0.849ArgPhe: 0.849 ± 0.466
2.716ArgGly: 2.716 ± 0.569
1.019ArgHis: 1.019 ± 0.38
3.056ArgIle: 3.056 ± 0.888
1.698ArgLys: 1.698 ± 0.52
5.772ArgLeu: 5.772 ± 0.779
0.849ArgMet: 0.849 ± 0.37
3.735ArgAsn: 3.735 ± 0.715
2.207ArgPro: 2.207 ± 0.645
1.528ArgGln: 1.528 ± 0.481
4.075ArgArg: 4.075 ± 0.935
5.093ArgSer: 5.093 ± 0.781
2.547ArgThr: 2.547 ± 0.7
2.716ArgVal: 2.716 ± 0.538
0.34ArgTrp: 0.34 ± 0.348
1.358ArgTyr: 1.358 ± 0.606
0.0ArgXaa: 0.0 ± 0.0
Ser
3.565SerAla: 3.565 ± 0.921
1.019SerCys: 1.019 ± 0.458
7.131SerAsp: 7.131 ± 0.899
4.414SerGlu: 4.414 ± 0.856
2.716SerPhe: 2.716 ± 0.665
5.433SerGly: 5.433 ± 0.504
1.698SerHis: 1.698 ± 0.596
9.508SerIle: 9.508 ± 1.53
5.942SerLys: 5.942 ± 0.75
6.282SerLeu: 6.282 ± 1.18
3.396SerMet: 3.396 ± 0.866
5.433SerAsn: 5.433 ± 0.982
4.754SerPro: 4.754 ± 1.405
3.226SerGln: 3.226 ± 1.066
4.584SerArg: 4.584 ± 0.669
7.301SerSer: 7.301 ± 1.968
5.433SerThr: 5.433 ± 0.6
4.075SerVal: 4.075 ± 1.028
0.509SerTrp: 0.509 ± 0.302
2.207SerTyr: 2.207 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
4.075ThrAla: 4.075 ± 1.235
1.019ThrCys: 1.019 ± 0.552
4.075ThrAsp: 4.075 ± 1.474
4.584ThrGlu: 4.584 ± 0.731
0.679ThrPhe: 0.679 ± 0.282
3.396ThrGly: 3.396 ± 0.682
0.509ThrHis: 0.509 ± 0.261
4.924ThrIle: 4.924 ± 0.357
3.565ThrLys: 3.565 ± 0.718
3.905ThrLeu: 3.905 ± 0.785
1.358ThrMet: 1.358 ± 0.622
2.377ThrAsn: 2.377 ± 0.836
2.547ThrPro: 2.547 ± 0.393
1.019ThrGln: 1.019 ± 0.451
2.886ThrArg: 2.886 ± 0.43
6.282ThrSer: 6.282 ± 1.276
2.716ThrThr: 2.716 ± 0.501
2.886ThrVal: 2.886 ± 1.138
0.679ThrTrp: 0.679 ± 0.348
1.358ThrTyr: 1.358 ± 0.191
0.0ThrXaa: 0.0 ± 0.0
Val
1.698ValAla: 1.698 ± 0.576
0.849ValCys: 0.849 ± 0.34
2.886ValAsp: 2.886 ± 0.981
3.226ValGlu: 3.226 ± 0.873
1.868ValPhe: 1.868 ± 1.188
4.075ValGly: 4.075 ± 0.955
1.019ValHis: 1.019 ± 0.339
6.282ValIle: 6.282 ± 0.745
4.414ValLys: 4.414 ± 0.572
6.112ValLeu: 6.112 ± 1.885
1.188ValMet: 1.188 ± 0.474
4.075ValAsn: 4.075 ± 1.066
3.735ValPro: 3.735 ± 0.566
1.698ValGln: 1.698 ± 0.806
2.207ValArg: 2.207 ± 1.018
5.433ValSer: 5.433 ± 0.55
2.377ValThr: 2.377 ± 0.434
2.377ValVal: 2.377 ± 0.951
0.34ValTrp: 0.34 ± 0.21
2.377ValTyr: 2.377 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
1.188TrpAla: 1.188 ± 0.304
0.509TrpCys: 0.509 ± 0.241
0.509TrpAsp: 0.509 ± 0.241
1.188TrpGlu: 1.188 ± 0.408
0.34TrpPhe: 0.34 ± 0.24
0.17TrpGly: 0.17 ± 0.12
0.0TrpHis: 0.0 ± 0.0
1.019TrpIle: 1.019 ± 0.302
0.34TrpLys: 0.34 ± 0.2
0.509TrpLeu: 0.509 ± 0.245
0.17TrpMet: 0.17 ± 0.12
0.34TrpAsn: 0.34 ± 0.2
0.17TrpPro: 0.17 ± 0.12
0.0TrpGln: 0.0 ± 0.0
0.509TrpArg: 0.509 ± 0.257
1.698TrpSer: 1.698 ± 0.563
0.34TrpThr: 0.34 ± 0.21
0.17TrpVal: 0.17 ± 0.213
0.17TrpTrp: 0.17 ± 0.12
0.509TrpTyr: 0.509 ± 0.36
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.528TyrAla: 1.528 ± 0.465
0.679TyrCys: 0.679 ± 0.279
1.358TyrAsp: 1.358 ± 0.628
1.528TyrGlu: 1.528 ± 0.857
1.528TyrPhe: 1.528 ± 0.492
3.056TyrGly: 3.056 ± 0.907
1.868TyrHis: 1.868 ± 0.643
2.207TyrIle: 2.207 ± 1.101
1.698TyrLys: 1.698 ± 0.799
3.056TyrLeu: 3.056 ± 0.982
0.849TyrMet: 0.849 ± 0.412
2.377TyrAsn: 2.377 ± 1.053
1.528TyrPro: 1.528 ± 0.443
1.188TyrGln: 1.188 ± 0.325
1.019TyrArg: 1.019 ± 0.441
2.037TyrSer: 2.037 ± 0.624
2.207TyrThr: 2.207 ± 0.53
1.698TyrVal: 1.698 ± 0.696
0.17TyrTrp: 0.17 ± 0.226
1.698TyrTyr: 1.698 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski