Amino acid dipepetide frequency for Maprik virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.855AlaAla: 2.855 ± 1.158
0.519AlaCys: 0.519 ± 0.475
2.336AlaAsp: 2.336 ± 1.281
3.374AlaGlu: 3.374 ± 0.968
2.076AlaPhe: 2.076 ± 0.442
1.038AlaGly: 1.038 ± 1.427
0.519AlaHis: 0.519 ± 0.128
4.153AlaIle: 4.153 ± 0.111
4.672AlaLys: 4.672 ± 1.546
4.931AlaLeu: 4.931 ± 0.841
1.298AlaMet: 1.298 ± 0.246
4.412AlaAsn: 4.412 ± 0.875
1.817AlaPro: 1.817 ± 0.557
1.817AlaGln: 1.817 ± 1.404
2.595AlaArg: 2.595 ± 0.605
1.038AlaSer: 1.038 ± 0.294
1.817AlaThr: 1.817 ± 0.448
2.595AlaVal: 2.595 ± 0.702
0.519AlaTrp: 0.519 ± 0.316
1.557AlaTyr: 1.557 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
1.298CysAla: 1.298 ± 0.466
0.519CysCys: 0.519 ± 0.475
0.26CysAsp: 0.26 ± 0.237
1.038CysGlu: 1.038 ± 0.95
1.557CysPhe: 1.557 ± 1.053
2.336CysGly: 2.336 ± 1.764
0.519CysHis: 0.519 ± 0.475
1.557CysIle: 1.557 ± 0.694
2.076CysLys: 2.076 ± 1.162
2.855CysLeu: 2.855 ± 0.84
0.26CysMet: 0.26 ± 0.158
2.336CysAsn: 2.336 ± 1.042
1.298CysPro: 1.298 ± 0.817
1.298CysGln: 1.298 ± 0.246
0.519CysArg: 0.519 ± 0.128
1.298CysSer: 1.298 ± 0.817
2.595CysThr: 2.595 ± 2.374
0.26CysVal: 0.26 ± 0.237
0.0CysTrp: 0.0 ± 0.0
0.779CysTyr: 0.779 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
1.817AspAla: 1.817 ± 0.357
1.038AspCys: 1.038 ± 0.581
4.672AspAsp: 4.672 ± 1.332
3.114AspGlu: 3.114 ± 0.847
6.229AspPhe: 6.229 ± 0.803
2.336AspGly: 2.336 ± 1.423
0.519AspHis: 0.519 ± 0.128
5.969AspIle: 5.969 ± 0.479
3.893AspLys: 3.893 ± 1.033
7.267AspLeu: 7.267 ± 1.116
1.038AspMet: 1.038 ± 0.632
3.374AspAsn: 3.374 ± 0.584
3.114AspPro: 3.114 ± 1.415
1.298AspGln: 1.298 ± 0.443
1.298AspArg: 1.298 ± 0.443
3.374AspSer: 3.374 ± 0.243
2.595AspThr: 2.595 ± 0.605
2.595AspVal: 2.595 ± 0.34
0.26AspTrp: 0.26 ± 0.237
3.893AspTyr: 3.893 ± 1.272
0.0AspXaa: 0.0 ± 0.0
Glu
3.634GluAla: 3.634 ± 0.952
1.038GluCys: 1.038 ± 0.95
3.893GluAsp: 3.893 ± 1.662
3.634GluGlu: 3.634 ± 0.373
2.595GluPhe: 2.595 ± 0.886
1.817GluGly: 1.817 ± 0.927
2.076GluHis: 2.076 ± 0.442
6.748GluIle: 6.748 ± 1.967
4.412GluLys: 4.412 ± 0.875
6.229GluLeu: 6.229 ± 1.2
2.076GluMet: 2.076 ± 0.728
3.374GluAsn: 3.374 ± 0.966
2.076GluPro: 2.076 ± 0.588
2.855GluGln: 2.855 ± 1.458
3.114GluArg: 3.114 ± 0.882
3.634GluSer: 3.634 ± 1.178
3.374GluThr: 3.374 ± 0.766
3.634GluVal: 3.634 ± 0.119
0.26GluTrp: 0.26 ± 0.158
2.336GluTyr: 2.336 ± 0.537
0.0GluXaa: 0.0 ± 0.0
Phe
1.557PheAla: 1.557 ± 0.515
0.779PheCys: 0.779 ± 0.162
3.893PheAsp: 3.893 ± 0.509
2.595PheGlu: 2.595 ± 0.886
2.336PhePhe: 2.336 ± 0.735
1.557PheGly: 1.557 ± 0.591
0.519PheHis: 0.519 ± 0.128
4.153PheIle: 4.153 ± 0.927
5.45PheLys: 5.45 ± 0.918
4.931PheLeu: 4.931 ± 0.817
1.817PheMet: 1.817 ± 1.293
2.595PheAsn: 2.595 ± 0.886
0.519PhePro: 0.519 ± 0.316
0.519PheGln: 0.519 ± 0.128
3.114PheArg: 3.114 ± 0.221
3.634PheSer: 3.634 ± 0.373
4.412PheThr: 4.412 ± 0.875
2.855PheVal: 2.855 ± 0.546
0.26PheTrp: 0.26 ± 0.158
1.817PheTyr: 1.817 ± 1.272
0.0PheXaa: 0.0 ± 0.0
Gly
0.519GlyAla: 0.519 ± 0.767
2.595GlyCys: 2.595 ± 1.274
3.374GlyAsp: 3.374 ± 0.725
3.114GlyGlu: 3.114 ± 0.6
1.038GlyPhe: 1.038 ± 0.294
1.298GlyGly: 1.298 ± 0.817
1.038GlyHis: 1.038 ± 0.718
3.374GlyIle: 3.374 ± 1.513
2.076GlyLys: 2.076 ± 0.397
4.672GlyLeu: 4.672 ± 1.152
0.519GlyMet: 0.519 ± 0.128
3.374GlyAsn: 3.374 ± 0.958
1.038GlyPro: 1.038 ± 0.294
0.779GlyGln: 0.779 ± 0.696
1.298GlyArg: 1.298 ± 0.246
2.855GlySer: 2.855 ± 0.901
2.336GlyThr: 2.336 ± 1.397
2.076GlyVal: 2.076 ± 1.273
0.779GlyTrp: 0.779 ± 0.347
1.038GlyTyr: 1.038 ± 2.215
0.0GlyXaa: 0.0 ± 0.0
His
1.817HisAla: 1.817 ± 0.448
0.26HisCys: 0.26 ± 0.237
0.26HisAsp: 0.26 ± 0.158
1.298HisGlu: 1.298 ± 0.817
1.298HisPhe: 1.298 ± 0.443
2.076HisGly: 2.076 ± 1.214
0.519HisHis: 0.519 ± 0.316
0.519HisIle: 0.519 ± 0.128
1.557HisLys: 1.557 ± 0.384
2.855HisLeu: 2.855 ± 0.546
0.26HisMet: 0.26 ± 0.237
1.557HisAsn: 1.557 ± 0.325
0.26HisPro: 0.26 ± 0.237
0.779HisGln: 0.779 ± 0.712
0.519HisArg: 0.519 ± 0.767
1.038HisSer: 1.038 ± 0.294
1.557HisThr: 1.557 ± 1.053
1.038HisVal: 1.038 ± 0.256
0.0HisTrp: 0.0 ± 0.0
1.298HisTyr: 1.298 ± 0.246
0.0HisXaa: 0.0 ± 0.0
Ile
5.45IleAla: 5.45 ± 1.049
1.557IleCys: 1.557 ± 0.694
5.71IleAsp: 5.71 ± 2.079
7.527IleGlu: 7.527 ± 0.626
3.634IlePhe: 3.634 ± 0.119
2.855IleGly: 2.855 ± 0.546
2.595IleHis: 2.595 ± 0.398
5.71IleIle: 5.71 ± 1.408
6.229IleLys: 6.229 ± 1.501
9.862IleLeu: 9.862 ± 2.008
2.855IleMet: 2.855 ± 0.323
6.488IleAsn: 6.488 ± 0.458
4.672IlePro: 4.672 ± 1.152
2.336IleGln: 2.336 ± 0.487
2.076IleArg: 2.076 ± 0.442
8.046IleSer: 8.046 ± 2.186
5.71IleThr: 5.71 ± 1.529
3.374IleVal: 3.374 ± 0.958
1.298IleTrp: 1.298 ± 0.771
4.412IleTyr: 4.412 ± 1.108
0.0IleXaa: 0.0 ± 0.0
Lys
3.634LysAla: 3.634 ± 1.784
1.298LysCys: 1.298 ± 0.817
3.634LysAsp: 3.634 ± 0.929
8.046LysGlu: 8.046 ± 1.589
3.893LysPhe: 3.893 ± 0.509
3.893LysGly: 3.893 ± 0.139
2.336LysHis: 2.336 ± 0.477
7.527LysIle: 7.527 ± 1.04
5.969LysLys: 5.969 ± 1.557
6.488LysLeu: 6.488 ± 0.959
2.595LysMet: 2.595 ± 0.606
4.153LysAsn: 4.153 ± 0.822
1.557LysPro: 1.557 ± 0.384
3.893LysGln: 3.893 ± 1.869
3.374LysArg: 3.374 ± 0.64
5.969LysSer: 5.969 ± 0.997
4.931LysThr: 4.931 ± 1.103
4.931LysVal: 4.931 ± 0.735
0.519LysTrp: 0.519 ± 0.128
2.336LysTyr: 2.336 ± 0.516
0.0LysXaa: 0.0 ± 0.0
Leu
3.893LeuAla: 3.893 ± 0.646
2.336LeuCys: 2.336 ± 1.397
8.046LeuAsp: 8.046 ± 2.499
4.931LeuGlu: 4.931 ± 1.089
4.931LeuPhe: 4.931 ± 1.089
3.114LeuGly: 3.114 ± 0.768
2.595LeuHis: 2.595 ± 0.933
9.862LeuIle: 9.862 ± 2.408
9.343LeuLys: 9.343 ± 0.414
11.679LeuLeu: 11.679 ± 2.269
2.336LeuMet: 2.336 ± 0.477
4.931LeuAsn: 4.931 ± 1.089
4.672LeuPro: 4.672 ± 1.332
3.114LeuGln: 3.114 ± 0.65
3.634LeuArg: 3.634 ± 0.426
6.229LeuSer: 6.229 ± 2.843
5.71LeuThr: 5.71 ± 1.207
5.191LeuVal: 5.191 ± 1.269
0.0LeuTrp: 0.0 ± 0.0
4.153LeuTyr: 4.153 ± 1.176
0.0LeuXaa: 0.0 ± 0.0
Met
1.557MetAla: 1.557 ± 0.591
0.519MetCys: 0.519 ± 0.128
1.557MetAsp: 1.557 ± 0.591
0.779MetGlu: 0.779 ± 0.474
0.26MetPhe: 0.26 ± 0.757
0.519MetGly: 0.519 ± 0.475
1.038MetHis: 1.038 ± 0.256
2.595MetIle: 2.595 ± 0.606
1.817MetLys: 1.817 ± 0.357
3.634MetLeu: 3.634 ± 0.896
2.076MetMet: 2.076 ± 0.909
1.557MetAsn: 1.557 ± 0.949
1.038MetPro: 1.038 ± 0.256
1.038MetGln: 1.038 ± 1.42
3.374MetArg: 3.374 ± 0.984
1.557MetSer: 1.557 ± 2.13
0.779MetThr: 0.779 ± 0.162
1.038MetVal: 1.038 ± 0.256
0.26MetTrp: 0.26 ± 0.158
1.557MetTyr: 1.557 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
3.114AsnAla: 3.114 ± 1.975
2.076AsnCys: 2.076 ± 1.527
3.374AsnAsp: 3.374 ± 0.958
2.855AsnGlu: 2.855 ± 0.554
4.153AsnPhe: 4.153 ± 1.024
2.855AsnGly: 2.855 ± 1.648
1.557AsnHis: 1.557 ± 0.325
4.672AsnIle: 4.672 ± 0.953
3.893AsnLys: 3.893 ± 0.738
5.969AsnLeu: 5.969 ± 0.997
2.076AsnMet: 2.076 ± 0.588
4.931AsnAsn: 4.931 ± 1.553
3.374AsnPro: 3.374 ± 0.243
2.595AsnGln: 2.595 ± 1.542
3.374AsnArg: 3.374 ± 0.958
2.336AsnSer: 2.336 ± 1.066
3.374AsnThr: 3.374 ± 0.958
1.817AsnVal: 1.817 ± 0.357
1.038AsnTrp: 1.038 ± 0.256
3.893AsnTyr: 3.893 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
1.557ProAla: 1.557 ± 0.325
0.779ProCys: 0.779 ± 0.712
2.076ProAsp: 2.076 ± 0.588
3.374ProGlu: 3.374 ± 0.984
1.557ProPhe: 1.557 ± 0.949
2.336ProGly: 2.336 ± 1.281
0.519ProHis: 0.519 ± 0.128
3.374ProIle: 3.374 ± 0.584
1.817ProLys: 1.817 ± 0.357
2.855ProLeu: 2.855 ± 0.901
0.779ProMet: 0.779 ± 0.474
2.076ProAsn: 2.076 ± 1.016
0.779ProPro: 0.779 ± 0.162
0.519ProGln: 0.519 ± 0.128
0.26ProArg: 0.26 ± 0.158
2.595ProSer: 2.595 ± 1.321
1.557ProThr: 1.557 ± 0.694
2.336ProVal: 2.336 ± 1.397
0.26ProTrp: 0.26 ± 0.757
1.298ProTyr: 1.298 ± 0.791
0.0ProXaa: 0.0 ± 0.0
Gln
2.595GlnAla: 2.595 ± 2.734
0.519GlnCys: 0.519 ± 0.475
2.336GlnAsp: 2.336 ± 0.833
0.779GlnGlu: 0.779 ± 0.162
1.557GlnPhe: 1.557 ± 1.315
1.557GlnGly: 1.557 ± 0.384
0.519GlnHis: 0.519 ± 0.475
3.634GlnIle: 3.634 ± 1.505
3.893GlnLys: 3.893 ± 1.733
1.557GlnLeu: 1.557 ± 0.591
1.557GlnMet: 1.557 ± 0.515
2.336GlnAsn: 2.336 ± 0.537
0.26GlnPro: 0.26 ± 0.158
1.038GlnGln: 1.038 ± 2.215
3.114GlnArg: 3.114 ± 1.221
1.817GlnSer: 1.817 ± 0.357
2.595GlnThr: 2.595 ± 0.606
1.557GlnVal: 1.557 ± 0.597
0.26GlnTrp: 0.26 ± 0.158
1.557GlnTyr: 1.557 ± 0.591
0.0GlnXaa: 0.0 ± 0.0
Arg
2.076ArgAla: 2.076 ± 0.728
1.298ArgCys: 1.298 ± 0.466
2.595ArgAsp: 2.595 ± 0.886
3.634ArgGlu: 3.634 ± 1.178
1.557ArgPhe: 1.557 ± 0.694
1.038ArgGly: 1.038 ± 0.256
0.779ArgHis: 0.779 ± 0.474
4.412ArgIle: 4.412 ± 0.806
2.336ArgLys: 2.336 ± 0.516
5.71ArgLeu: 5.71 ± 0.56
0.519ArgMet: 0.519 ± 0.245
2.336ArgAsn: 2.336 ± 0.487
0.519ArgPro: 0.519 ± 0.475
1.298ArgGln: 1.298 ± 1.363
2.076ArgArg: 2.076 ± 0.588
2.595ArgSer: 2.595 ± 0.606
2.076ArgThr: 2.076 ± 0.909
1.557ArgVal: 1.557 ± 0.597
0.519ArgTrp: 0.519 ± 0.128
2.595ArgTyr: 2.595 ± 1.155
0.0ArgXaa: 0.0 ± 0.0
Ser
1.817SerAla: 1.817 ± 0.357
2.336SerCys: 2.336 ± 1.042
3.634SerAsp: 3.634 ± 0.119
3.893SerGlu: 3.893 ± 0.875
2.336SerPhe: 2.336 ± 1.188
2.595SerGly: 2.595 ± 1.209
1.038SerHis: 1.038 ± 0.607
7.527SerIle: 7.527 ± 1.694
6.488SerLys: 6.488 ± 1.091
5.969SerLeu: 5.969 ± 1.373
2.595SerMet: 2.595 ± 0.34
3.374SerAsn: 3.374 ± 0.584
1.557SerPro: 1.557 ± 0.325
1.817SerGln: 1.817 ± 0.448
3.114SerArg: 3.114 ± 1.599
3.634SerSer: 3.634 ± 0.373
4.412SerThr: 4.412 ± 0.052
3.114SerVal: 3.114 ± 0.392
0.0SerTrp: 0.0 ± 0.0
2.595SerTyr: 2.595 ± 1.059
0.0SerXaa: 0.0 ± 0.0
Thr
2.595ThrAla: 2.595 ± 0.606
2.076ThrCys: 2.076 ± 1.162
2.855ThrAsp: 2.855 ± 0.719
3.114ThrGlu: 3.114 ± 0.882
2.855ThrPhe: 2.855 ± 0.28
2.076ThrGly: 2.076 ± 1.527
0.779ThrHis: 0.779 ± 0.347
6.748ThrIle: 6.748 ± 1.734
6.229ThrLys: 6.229 ± 1.155
3.374ThrLeu: 3.374 ± 0.966
1.298ThrMet: 1.298 ± 0.771
3.893ThrAsn: 3.893 ± 1.399
1.298ThrPro: 1.298 ± 0.443
1.298ThrGln: 1.298 ± 0.443
1.557ThrArg: 1.557 ± 0.384
4.412ThrSer: 4.412 ± 1.054
2.595ThrThr: 2.595 ± 0.34
3.634ThrVal: 3.634 ± 0.801
1.038ThrTrp: 1.038 ± 0.718
4.153ThrTyr: 4.153 ± 0.833
0.0ThrXaa: 0.0 ± 0.0
Val
2.336ValAla: 2.336 ± 0.477
2.595ValCys: 2.595 ± 1.274
2.855ValAsp: 2.855 ± 0.901
2.336ValGlu: 2.336 ± 0.361
3.374ValPhe: 3.374 ± 0.243
2.076ValGly: 2.076 ± 0.512
0.519ValHis: 0.519 ± 0.128
4.412ValIle: 4.412 ± 1.875
3.634ValLys: 3.634 ± 0.119
3.374ValLeu: 3.374 ± 0.766
1.038ValMet: 1.038 ± 0.294
3.114ValAsn: 3.114 ± 1.181
0.26ValPro: 0.26 ± 0.158
4.412ValGln: 4.412 ± 0.725
1.557ValArg: 1.557 ± 0.325
4.153ValSer: 4.153 ± 1.421
2.076ValThr: 2.076 ± 0.442
2.336ValVal: 2.336 ± 0.859
0.26ValTrp: 0.26 ± 0.757
2.076ValTyr: 2.076 ± 0.512
0.0ValXaa: 0.0 ± 0.0
Trp
0.26TrpAla: 0.26 ± 0.158
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.779TrpGlu: 0.779 ± 0.673
0.519TrpPhe: 0.519 ± 0.128
0.519TrpGly: 0.519 ± 0.475
0.0TrpHis: 0.0 ± 0.0
0.519TrpIle: 0.519 ± 0.128
0.519TrpLys: 0.519 ± 0.767
1.557TrpLeu: 1.557 ± 0.384
0.26TrpMet: 0.26 ± 0.757
0.519TrpAsn: 0.519 ± 0.316
0.0TrpPro: 0.0 ± 0.0
0.26TrpGln: 0.26 ± 0.158
0.26TrpArg: 0.26 ± 0.158
1.298TrpSer: 1.298 ± 0.791
0.519TrpThr: 0.519 ± 0.71
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.557TyrAla: 1.557 ± 0.384
0.519TyrCys: 0.519 ± 0.128
2.336TyrAsp: 2.336 ± 1.174
2.595TyrGlu: 2.595 ± 0.492
1.557TyrPhe: 1.557 ± 0.851
1.298TyrGly: 1.298 ± 0.817
0.779TyrHis: 0.779 ± 0.347
4.931TyrIle: 4.931 ± 1.621
4.672TyrLys: 4.672 ± 0.723
4.931TyrLeu: 4.931 ± 1.437
1.038TyrMet: 1.038 ± 0.607
2.595TyrAsn: 2.595 ± 0.398
2.595TyrPro: 2.595 ± 1.209
2.076TyrGln: 2.076 ± 0.442
1.557TyrArg: 1.557 ± 0.597
2.336TyrSer: 2.336 ± 1.217
2.855TyrThr: 2.855 ± 1.04
2.855TyrVal: 2.855 ± 0.546
0.26TyrTrp: 0.26 ± 0.158
0.779TyrTyr: 0.779 ± 0.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3854 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski