Amino acid dipepetide frequency for Apis mellifera associated microvirus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.582AlaAla: 7.582 ± 3.649
0.758AlaCys: 0.758 ± 0.577
6.065AlaAsp: 6.065 ± 0.762
5.307AlaGlu: 5.307 ± 1.502
3.791AlaPhe: 3.791 ± 1.239
6.065AlaGly: 6.065 ± 2.847
3.033AlaHis: 3.033 ± 2.762
4.549AlaIle: 4.549 ± 2.077
3.791AlaLys: 3.791 ± 1.365
3.791AlaLeu: 3.791 ± 1.931
3.791AlaMet: 3.791 ± 1.66
5.307AlaAsn: 5.307 ± 1.466
2.274AlaPro: 2.274 ± 1.615
2.274AlaGln: 2.274 ± 0.655
6.065AlaArg: 6.065 ± 2.073
5.307AlaSer: 5.307 ± 2.415
7.582AlaThr: 7.582 ± 3.649
6.065AlaVal: 6.065 ± 2.856
0.0AlaTrp: 0.0 ± 0.0
2.274AlaTyr: 2.274 ± 0.915
0.0AlaXaa: 0.0 ± 0.0
Cys
0.758CysAla: 0.758 ± 0.577
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.758CysPhe: 0.758 ± 0.659
0.758CysGly: 0.758 ± 0.659
0.758CysHis: 0.758 ± 0.659
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.516CysLeu: 1.516 ± 0.551
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.758CysArg: 0.758 ± 0.659
0.0CysSer: 0.0 ± 0.0
0.758CysThr: 0.758 ± 0.659
1.516CysVal: 1.516 ± 1.319
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.791AspAla: 3.791 ± 0.474
0.0AspCys: 0.0 ± 0.0
0.758AspAsp: 0.758 ± 1.02
3.791AspGlu: 3.791 ± 1.012
6.065AspPhe: 6.065 ± 2.897
3.791AspGly: 3.791 ± 1.567
0.0AspHis: 0.0 ± 0.0
1.516AspIle: 1.516 ± 1.154
3.791AspLys: 3.791 ± 1.743
6.823AspLeu: 6.823 ± 1.889
1.516AspMet: 1.516 ± 0.766
3.791AspAsn: 3.791 ± 1.351
0.758AspPro: 0.758 ± 0.94
3.033AspGln: 3.033 ± 1.008
1.516AspArg: 1.516 ± 0.551
3.791AspSer: 3.791 ± 1.012
1.516AspThr: 1.516 ± 1.154
1.516AspVal: 1.516 ± 1.067
0.758AspTrp: 0.758 ± 0.577
1.516AspTyr: 1.516 ± 1.154
0.0AspXaa: 0.0 ± 0.0
Glu
6.823GluAla: 6.823 ± 2.323
0.0GluCys: 0.0 ± 0.0
0.758GluAsp: 0.758 ± 1.02
2.274GluGlu: 2.274 ± 1.615
3.033GluPhe: 3.033 ± 1.008
1.516GluGly: 1.516 ± 1.88
2.274GluHis: 2.274 ± 0.883
8.34GluIle: 8.34 ± 2.023
5.307GluLys: 5.307 ± 2.308
3.791GluLeu: 3.791 ± 0.474
1.516GluMet: 1.516 ± 0.853
5.307GluAsn: 5.307 ± 1.9
2.274GluPro: 2.274 ± 1.486
1.516GluGln: 1.516 ± 0.766
6.065GluArg: 6.065 ± 1.26
3.033GluSer: 3.033 ± 2.061
1.516GluThr: 1.516 ± 1.014
2.274GluVal: 2.274 ± 0.978
0.0GluTrp: 0.0 ± 0.0
3.033GluTyr: 3.033 ± 1.688
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 1.297
0.0PheCys: 0.0 ± 0.0
4.549PheAsp: 4.549 ± 3.043
1.516PheGlu: 1.516 ± 0.853
3.033PhePhe: 3.033 ± 2.309
3.033PheGly: 3.033 ± 1.428
1.516PheHis: 1.516 ± 1.091
2.274PheIle: 2.274 ± 0.915
3.033PheLys: 3.033 ± 1.594
1.516PheLeu: 1.516 ± 1.154
3.791PheMet: 3.791 ± 1.491
4.549PheAsn: 4.549 ± 1.652
2.274PhePro: 2.274 ± 1.574
2.274PheGln: 2.274 ± 1.539
3.791PheArg: 3.791 ± 2.886
3.791PheSer: 3.791 ± 1.396
1.516PheThr: 1.516 ± 1.014
1.516PheVal: 1.516 ± 1.014
0.758PheTrp: 0.758 ± 0.577
2.274PheTyr: 2.274 ± 1.297
0.0PheXaa: 0.0 ± 0.0
Gly
3.791GlyAla: 3.791 ± 3.456
0.0GlyCys: 0.0 ± 0.0
4.549GlyAsp: 4.549 ± 1.831
3.791GlyGlu: 3.791 ± 1.656
2.274GlyPhe: 2.274 ± 0.915
6.823GlyGly: 6.823 ± 2.934
1.516GlyHis: 1.516 ± 1.319
2.274GlyIle: 2.274 ± 1.927
4.549GlyLys: 4.549 ± 2.088
6.065GlyLeu: 6.065 ± 1.869
0.0GlyMet: 0.0 ± 0.0
3.033GlyAsn: 3.033 ± 1.412
1.516GlyPro: 1.516 ± 1.154
1.516GlyGln: 1.516 ± 0.766
0.758GlyArg: 0.758 ± 0.737
3.791GlySer: 3.791 ± 1.396
6.065GlyThr: 6.065 ± 2.897
3.033GlyVal: 3.033 ± 1.558
0.0GlyTrp: 0.0 ± 0.0
4.549GlyTyr: 4.549 ± 1.652
0.0GlyXaa: 0.0 ± 0.0
His
4.549HisAla: 4.549 ± 1.72
0.0HisCys: 0.0 ± 0.0
1.516HisAsp: 1.516 ± 0.551
2.274HisGlu: 2.274 ± 1.539
0.758HisPhe: 0.758 ± 0.577
2.274HisGly: 2.274 ± 1.069
0.758HisHis: 0.758 ± 0.737
0.758HisIle: 0.758 ± 0.577
1.516HisLys: 1.516 ± 1.091
1.516HisLeu: 1.516 ± 0.853
1.516HisMet: 1.516 ± 1.545
0.0HisAsn: 0.0 ± 0.0
1.516HisPro: 1.516 ± 0.551
1.516HisGln: 1.516 ± 1.091
0.758HisArg: 0.758 ± 0.659
0.758HisSer: 0.758 ± 0.659
0.758HisThr: 0.758 ± 0.659
1.516HisVal: 1.516 ± 1.88
0.758HisTrp: 0.758 ± 0.659
0.758HisTyr: 0.758 ± 0.94
0.0HisXaa: 0.0 ± 0.0
Ile
3.033IleAla: 3.033 ± 2.528
0.0IleCys: 0.0 ± 0.0
1.516IleAsp: 1.516 ± 1.154
3.791IleGlu: 3.791 ± 1.261
2.274IlePhe: 2.274 ± 1.069
5.307IleGly: 5.307 ± 1.6
0.758IleHis: 0.758 ± 0.659
4.549IleIle: 4.549 ± 1.586
4.549IleLys: 4.549 ± 1.643
2.274IleLeu: 2.274 ± 0.655
2.274IleMet: 2.274 ± 1.731
4.549IleAsn: 4.549 ± 1.784
4.549IlePro: 4.549 ± 1.095
3.033IleGln: 3.033 ± 0.578
1.516IleArg: 1.516 ± 0.551
2.274IleSer: 2.274 ± 1.069
3.791IleThr: 3.791 ± 1.656
2.274IleVal: 2.274 ± 1.871
0.758IleTrp: 0.758 ± 0.577
1.516IleTyr: 1.516 ± 1.154
0.0IleXaa: 0.0 ± 0.0
Lys
5.307LysAla: 5.307 ± 3.215
0.758LysCys: 0.758 ± 0.659
3.791LysAsp: 3.791 ± 1.507
5.307LysGlu: 5.307 ± 2.585
1.516LysPhe: 1.516 ± 1.545
1.516LysGly: 1.516 ± 1.091
0.758LysHis: 0.758 ± 0.659
8.34LysIle: 8.34 ± 2.504
6.823LysLys: 6.823 ± 4.616
6.065LysLeu: 6.065 ± 2.496
2.274LysMet: 2.274 ± 1.402
2.274LysAsn: 2.274 ± 1.14
2.274LysPro: 2.274 ± 2.345
3.033LysGln: 3.033 ± 1.524
3.791LysArg: 3.791 ± 1.567
5.307LysSer: 5.307 ± 2.682
6.065LysThr: 6.065 ± 3.335
1.516LysVal: 1.516 ± 0.766
1.516LysTrp: 1.516 ± 1.319
1.516LysTyr: 1.516 ± 0.953
0.0LysXaa: 0.0 ± 0.0
Leu
9.098LeuAla: 9.098 ± 5.075
0.758LeuCys: 0.758 ± 0.659
5.307LeuAsp: 5.307 ± 1.779
4.549LeuGlu: 4.549 ± 1.308
3.033LeuPhe: 3.033 ± 0.985
5.307LeuGly: 5.307 ± 2.484
2.274LeuHis: 2.274 ± 1.574
3.791LeuIle: 3.791 ± 1.44
4.549LeuLys: 4.549 ± 1.796
6.065LeuLeu: 6.065 ± 1.698
2.274LeuMet: 2.274 ± 1.161
3.033LeuAsn: 3.033 ± 0.957
6.065LeuPro: 6.065 ± 3.037
3.791LeuGln: 3.791 ± 1.396
4.549LeuArg: 4.549 ± 0.635
4.549LeuSer: 4.549 ± 0.83
2.274LeuThr: 2.274 ± 1.53
1.516LeuVal: 1.516 ± 1.014
0.758LeuTrp: 0.758 ± 0.659
0.758LeuTyr: 0.758 ± 0.94
0.0LeuXaa: 0.0 ± 0.0
Met
3.791MetAla: 3.791 ± 3.456
0.758MetCys: 0.758 ± 0.659
2.274MetAsp: 2.274 ± 1.256
3.033MetGlu: 3.033 ± 1.9
2.274MetPhe: 2.274 ± 2.345
2.274MetGly: 2.274 ± 1.731
1.516MetHis: 1.516 ± 1.067
0.0MetIle: 0.0 ± 0.0
1.516MetLys: 1.516 ± 1.091
2.274MetLeu: 2.274 ± 1.322
1.516MetMet: 1.516 ± 1.88
0.758MetAsn: 0.758 ± 0.94
0.758MetPro: 0.758 ± 0.577
2.274MetGln: 2.274 ± 0.833
2.274MetArg: 2.274 ± 0.793
0.758MetSer: 0.758 ± 0.94
1.516MetThr: 1.516 ± 0.853
1.516MetVal: 1.516 ± 1.014
1.516MetTrp: 1.516 ± 1.154
0.758MetTyr: 0.758 ± 0.577
0.0MetXaa: 0.0 ± 0.0
Asn
3.033AsnAla: 3.033 ± 1.101
0.758AsnCys: 0.758 ± 0.659
1.516AsnAsp: 1.516 ± 0.853
3.033AsnGlu: 3.033 ± 2.182
2.274AsnPhe: 2.274 ± 1.297
0.758AsnGly: 0.758 ± 0.577
1.516AsnHis: 1.516 ± 1.319
3.033AsnIle: 3.033 ± 1.76
5.307AsnLys: 5.307 ± 3.319
5.307AsnLeu: 5.307 ± 1.779
1.516AsnMet: 1.516 ± 0.766
1.516AsnAsn: 1.516 ± 1.473
3.791AsnPro: 3.791 ± 1.4
5.307AsnGln: 5.307 ± 2.592
6.823AsnArg: 6.823 ± 2.986
2.274AsnSer: 2.274 ± 0.978
6.823AsnThr: 6.823 ± 3.957
1.516AsnVal: 1.516 ± 1.154
1.516AsnTrp: 1.516 ± 0.551
0.758AsnTyr: 0.758 ± 0.577
0.0AsnXaa: 0.0 ± 0.0
Pro
2.274ProAla: 2.274 ± 0.833
0.758ProCys: 0.758 ± 0.659
1.516ProAsp: 1.516 ± 1.154
5.307ProGlu: 5.307 ± 2.765
0.758ProPhe: 0.758 ± 0.94
2.274ProGly: 2.274 ± 1.615
1.516ProHis: 1.516 ± 0.551
6.065ProIle: 6.065 ± 1.554
3.791ProLys: 3.791 ± 3.749
4.549ProLeu: 4.549 ± 2.462
2.274ProMet: 2.274 ± 1.871
1.516ProAsn: 1.516 ± 0.551
0.758ProPro: 0.758 ± 0.737
2.274ProGln: 2.274 ± 1.486
1.516ProArg: 1.516 ± 1.319
3.791ProSer: 3.791 ± 1.396
4.549ProThr: 4.549 ± 2.559
2.274ProVal: 2.274 ± 1.731
0.758ProTrp: 0.758 ± 1.02
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.549GlnAla: 4.549 ± 1.311
0.0GlnCys: 0.0 ± 0.0
2.274GlnAsp: 2.274 ± 1.539
4.549GlnGlu: 4.549 ± 2.088
1.516GlnPhe: 1.516 ± 1.209
2.274GlnGly: 2.274 ± 0.833
0.758GlnHis: 0.758 ± 0.94
0.0GlnIle: 0.0 ± 0.0
6.065GlnLys: 6.065 ± 2.629
4.549GlnLeu: 4.549 ± 1.467
1.516GlnMet: 1.516 ± 0.766
3.033GlnAsn: 3.033 ± 1.373
3.033GlnPro: 3.033 ± 1.576
3.033GlnGln: 3.033 ± 1.428
3.791GlnArg: 3.791 ± 1.922
2.274GlnSer: 2.274 ± 1.297
3.033GlnThr: 3.033 ± 0.684
0.758GlnVal: 0.758 ± 0.737
1.516GlnTrp: 1.516 ± 0.551
1.516GlnTyr: 1.516 ± 1.319
0.0GlnXaa: 0.0 ± 0.0
Arg
3.791ArgAla: 3.791 ± 1.976
1.516ArgCys: 1.516 ± 1.319
1.516ArgAsp: 1.516 ± 0.551
3.791ArgGlu: 3.791 ± 1.421
3.033ArgPhe: 3.033 ± 1.008
5.307ArgGly: 5.307 ± 2.328
0.758ArgHis: 0.758 ± 0.577
0.758ArgIle: 0.758 ± 0.659
1.516ArgLys: 1.516 ± 1.319
3.791ArgLeu: 3.791 ± 1.396
3.033ArgMet: 3.033 ± 1.532
3.791ArgAsn: 3.791 ± 1.012
3.033ArgPro: 3.033 ± 1.008
1.516ArgGln: 1.516 ± 1.091
1.516ArgArg: 1.516 ± 1.319
6.823ArgSer: 6.823 ± 2.417
3.033ArgThr: 3.033 ± 1.183
1.516ArgVal: 1.516 ± 1.473
0.758ArgTrp: 0.758 ± 0.577
3.791ArgTyr: 3.791 ± 1.567
0.0ArgXaa: 0.0 ± 0.0
Ser
4.549SerAla: 4.549 ± 2.537
1.516SerCys: 1.516 ± 0.551
4.549SerAsp: 4.549 ± 1.58
1.516SerGlu: 1.516 ± 0.853
3.791SerPhe: 3.791 ± 2.886
0.0SerGly: 0.0 ± 0.0
0.758SerHis: 0.758 ± 0.577
2.274SerIle: 2.274 ± 1.731
7.582SerLys: 7.582 ± 2.866
2.274SerLeu: 2.274 ± 1.069
1.516SerMet: 1.516 ± 1.839
3.791SerAsn: 3.791 ± 1.567
3.791SerPro: 3.791 ± 1.705
3.033SerGln: 3.033 ± 0.684
2.274SerArg: 2.274 ± 1.464
5.307SerSer: 5.307 ± 1.832
8.34SerThr: 8.34 ± 1.525
5.307SerVal: 5.307 ± 2.012
0.758SerTrp: 0.758 ± 0.659
0.758SerTyr: 0.758 ± 1.02
0.0SerXaa: 0.0 ± 0.0
Thr
7.582ThrAla: 7.582 ± 1.897
0.0ThrCys: 0.0 ± 0.0
2.274ThrAsp: 2.274 ± 0.978
2.274ThrGlu: 2.274 ± 0.793
3.791ThrPhe: 3.791 ± 1.567
3.791ThrGly: 3.791 ± 2.069
2.274ThrHis: 2.274 ± 0.915
3.791ThrIle: 3.791 ± 1.343
3.033ThrLys: 3.033 ± 1.576
4.549ThrLeu: 4.549 ± 1.467
2.274ThrMet: 2.274 ± 1.14
5.307ThrAsn: 5.307 ± 3.003
4.549ThrPro: 4.549 ± 1.922
4.549ThrGln: 4.549 ± 1.949
2.274ThrArg: 2.274 ± 0.915
3.791ThrSer: 3.791 ± 1.149
3.791ThrThr: 3.791 ± 2.069
4.549ThrVal: 4.549 ± 1.472
0.0ThrTrp: 0.0 ± 0.0
2.274ThrTyr: 2.274 ± 0.883
0.0ThrXaa: 0.0 ± 0.0
Val
4.549ValAla: 4.549 ± 2.298
0.0ValCys: 0.0 ± 0.0
3.033ValAsp: 3.033 ± 0.957
3.033ValGlu: 3.033 ± 1.008
2.274ValPhe: 2.274 ± 1.297
4.549ValGly: 4.549 ± 0.635
0.758ValHis: 0.758 ± 0.659
0.758ValIle: 0.758 ± 0.577
2.274ValLys: 2.274 ± 2.345
2.274ValLeu: 2.274 ± 1.731
0.0ValMet: 0.0 ± 0.0
3.791ValAsn: 3.791 ± 1.569
3.791ValPro: 3.791 ± 2.886
1.516ValGln: 1.516 ± 1.209
1.516ValArg: 1.516 ± 0.953
5.307ValSer: 5.307 ± 1.374
3.033ValThr: 3.033 ± 1.101
1.516ValVal: 1.516 ± 0.551
0.0ValTrp: 0.0 ± 0.0
0.758ValTyr: 0.758 ± 0.737
0.0ValXaa: 0.0 ± 0.0
Trp
1.516TrpAla: 1.516 ± 0.551
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.758TrpGlu: 0.758 ± 0.577
2.274TrpPhe: 2.274 ± 1.297
0.758TrpGly: 0.758 ± 0.659
1.516TrpHis: 1.516 ± 0.551
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.516TrpLeu: 1.516 ± 1.319
0.0TrpMet: 0.0 ± 0.0
1.516TrpAsn: 1.516 ± 1.154
0.758TrpPro: 0.758 ± 0.577
1.516TrpGln: 1.516 ± 1.319
0.0TrpArg: 0.0 ± 0.0
0.758TrpSer: 0.758 ± 0.659
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.033TyrAla: 3.033 ± 1.101
0.0TyrCys: 0.0 ± 0.0
2.274TyrAsp: 2.274 ± 1.297
0.758TyrGlu: 0.758 ± 0.659
0.758TyrPhe: 0.758 ± 0.577
1.516TyrGly: 1.516 ± 0.551
0.758TyrHis: 0.758 ± 0.659
1.516TyrIle: 1.516 ± 1.014
0.758TyrLys: 0.758 ± 0.577
3.791TyrLeu: 3.791 ± 1.438
0.0TyrMet: 0.0 ± 0.0
1.516TyrAsn: 1.516 ± 1.121
0.758TyrPro: 0.758 ± 0.94
3.033TyrGln: 3.033 ± 0.578
3.033TyrArg: 3.033 ± 1.183
0.758TyrSer: 0.758 ± 0.659
0.758TyrThr: 0.758 ± 0.577
3.033TyrVal: 3.033 ± 1.101
0.758TyrTrp: 0.758 ± 0.659
0.758TyrTyr: 0.758 ± 0.659
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1320 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski