Amino acid dipepetide frequency for Apis mellifera associated microvirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.882AlaAla: 8.882 ± 3.955
0.74AlaCys: 0.74 ± 0.646
7.402AlaAsp: 7.402 ± 1.54
2.221AlaGlu: 2.221 ± 0.555
1.48AlaPhe: 1.48 ± 0.725
3.701AlaGly: 3.701 ± 1.569
5.922AlaHis: 5.922 ± 1.436
1.48AlaIle: 1.48 ± 0.513
2.961AlaLys: 2.961 ± 2.422
5.922AlaLeu: 5.922 ± 2.258
1.48AlaMet: 1.48 ± 1.285
2.961AlaAsn: 2.961 ± 1.065
2.961AlaPro: 2.961 ± 1.45
4.441AlaGln: 4.441 ± 2.095
6.662AlaArg: 6.662 ± 1.431
7.402AlaSer: 7.402 ± 3.174
8.142AlaThr: 8.142 ± 3.204
8.882AlaVal: 8.882 ± 1.127
1.48AlaTrp: 1.48 ± 0.513
3.701AlaTyr: 3.701 ± 1.196
0.0AlaXaa: 0.0 ± 0.0
Cys
0.74CysAla: 0.74 ± 0.646
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.48CysPhe: 1.48 ± 0.513
2.221CysGly: 2.221 ± 1.037
0.0CysHis: 0.0 ± 0.0
0.74CysIle: 0.74 ± 0.646
0.0CysLys: 0.0 ± 0.0
1.48CysLeu: 1.48 ± 0.513
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.74CysArg: 0.74 ± 0.646
0.74CysSer: 0.74 ± 0.535
0.74CysThr: 0.74 ± 0.646
1.48CysVal: 1.48 ± 1.293
0.0CysTrp: 0.0 ± 0.0
0.74CysTyr: 0.74 ± 0.646
0.0CysXaa: 0.0 ± 0.0
Asp
7.402AspAla: 7.402 ± 1.747
0.74AspCys: 0.74 ± 0.535
5.181AspAsp: 5.181 ± 1.535
2.961AspGlu: 2.961 ± 2.14
4.441AspPhe: 4.441 ± 0.957
5.922AspGly: 5.922 ± 3.572
2.221AspHis: 2.221 ± 1.442
2.221AspIle: 2.221 ± 0.689
2.221AspLys: 2.221 ± 1.37
5.181AspLeu: 5.181 ± 2.047
0.74AspMet: 0.74 ± 0.682
4.441AspAsn: 4.441 ± 1.664
2.221AspPro: 2.221 ± 0.825
0.74AspGln: 0.74 ± 0.646
1.48AspArg: 1.48 ± 0.513
3.701AspSer: 3.701 ± 1.405
4.441AspThr: 4.441 ± 1.659
7.402AspVal: 7.402 ± 3.24
0.0AspTrp: 0.0 ± 0.0
3.701AspTyr: 3.701 ± 2.674
0.0AspXaa: 0.0 ± 0.0
Glu
3.701GluAla: 3.701 ± 1.271
0.74GluCys: 0.74 ± 0.646
0.74GluAsp: 0.74 ± 0.646
0.0GluGlu: 0.0 ± 0.0
2.961GluPhe: 2.961 ± 1.45
0.74GluGly: 0.74 ± 0.682
0.74GluHis: 0.74 ± 0.535
2.961GluIle: 2.961 ± 0.817
2.221GluLys: 2.221 ± 1.037
2.221GluLeu: 2.221 ± 1.404
1.48GluMet: 1.48 ± 0.84
2.961GluAsn: 2.961 ± 0.817
2.221GluPro: 2.221 ± 1.302
0.0GluGln: 0.0 ± 0.0
3.701GluArg: 3.701 ± 2.542
4.441GluSer: 4.441 ± 1.403
0.74GluThr: 0.74 ± 0.682
2.961GluVal: 2.961 ± 1.293
0.74GluTrp: 0.74 ± 0.988
2.221GluTyr: 2.221 ± 0.689
0.0GluXaa: 0.0 ± 0.0
Phe
6.662PheAla: 6.662 ± 1.965
0.0PheCys: 0.0 ± 0.0
5.181PheAsp: 5.181 ± 1.777
2.221PheGlu: 2.221 ± 1.37
2.961PhePhe: 2.961 ± 1.443
4.441PheGly: 4.441 ± 1.013
0.0PheHis: 0.0 ± 0.0
1.48PheIle: 1.48 ± 0.513
0.0PheLys: 0.0 ± 0.0
2.221PheLeu: 2.221 ± 0.821
0.74PheMet: 0.74 ± 0.535
2.961PheAsn: 2.961 ± 2.14
2.221PhePro: 2.221 ± 1.037
1.48PheGln: 1.48 ± 0.739
3.701PheArg: 3.701 ± 1.196
7.402PheSer: 7.402 ± 1.661
2.961PheThr: 2.961 ± 0.817
1.48PheVal: 1.48 ± 1.07
1.48PheTrp: 1.48 ± 0.916
2.961PheTyr: 2.961 ± 1.548
0.0PheXaa: 0.0 ± 0.0
Gly
5.922GlyAla: 5.922 ± 1.629
0.0GlyCys: 0.0 ± 0.0
6.662GlyAsp: 6.662 ± 1.592
4.441GlyGlu: 4.441 ± 1.992
3.701GlyPhe: 3.701 ± 0.98
5.922GlyGly: 5.922 ± 2.755
1.48GlyHis: 1.48 ± 0.513
2.961GlyIle: 2.961 ± 0.59
5.922GlyLys: 5.922 ± 1.991
8.142GlyLeu: 8.142 ± 2.223
0.0GlyMet: 0.0 ± 0.0
2.961GlyAsn: 2.961 ± 0.638
2.961GlyPro: 2.961 ± 1.537
0.74GlyGln: 0.74 ± 0.682
6.662GlyArg: 6.662 ± 2.935
5.922GlySer: 5.922 ± 3.376
5.922GlyThr: 5.922 ± 1.051
2.961GlyVal: 2.961 ± 2.14
0.0GlyTrp: 0.0 ± 0.0
2.221GlyTyr: 2.221 ± 1.605
0.0GlyXaa: 0.0 ± 0.0
His
2.221HisAla: 2.221 ± 0.825
0.0HisCys: 0.0 ± 0.0
0.74HisAsp: 0.74 ± 0.682
0.74HisGlu: 0.74 ± 0.682
2.961HisPhe: 2.961 ± 1.271
2.221HisGly: 2.221 ± 1.037
0.0HisHis: 0.0 ± 0.0
1.48HisIle: 1.48 ± 1.285
0.0HisLys: 0.0 ± 0.0
3.701HisLeu: 3.701 ± 1.196
0.0HisMet: 0.0 ± 0.0
2.221HisAsn: 2.221 ± 0.825
2.961HisPro: 2.961 ± 1.873
1.48HisGln: 1.48 ± 0.916
0.0HisArg: 0.0 ± 0.0
0.74HisSer: 0.74 ± 0.646
0.74HisThr: 0.74 ± 0.646
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.221HisTyr: 2.221 ± 1.939
0.0HisXaa: 0.0 ± 0.0
Ile
0.74IleAla: 0.74 ± 0.682
0.74IleCys: 0.74 ± 0.646
2.961IleAsp: 2.961 ± 1.479
2.221IleGlu: 2.221 ± 1.491
0.0IlePhe: 0.0 ± 0.0
2.961IleGly: 2.961 ± 1.479
0.0IleHis: 0.0 ± 0.0
2.221IleIle: 2.221 ± 1.037
1.48IleLys: 1.48 ± 0.513
2.221IleLeu: 2.221 ± 1.605
0.74IleMet: 0.74 ± 0.535
2.221IleAsn: 2.221 ± 0.977
2.221IlePro: 2.221 ± 1.605
2.221IleGln: 2.221 ± 1.076
3.701IleArg: 3.701 ± 0.471
2.961IleSer: 2.961 ± 0.993
3.701IleThr: 3.701 ± 1.136
2.961IleVal: 2.961 ± 0.817
0.74IleTrp: 0.74 ± 0.535
1.48IleTyr: 1.48 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
4.441LysAla: 4.441 ± 2.72
0.0LysCys: 0.0 ± 0.0
0.74LysAsp: 0.74 ± 0.646
2.961LysGlu: 2.961 ± 0.638
2.221LysPhe: 2.221 ± 1.378
3.701LysGly: 3.701 ± 1.208
0.0LysHis: 0.0 ± 0.0
3.701LysIle: 3.701 ± 1.882
3.701LysLys: 3.701 ± 3.232
0.74LysLeu: 0.74 ± 0.535
1.48LysMet: 1.48 ± 1.07
4.441LysAsn: 4.441 ± 2.274
2.961LysPro: 2.961 ± 1.126
1.48LysGln: 1.48 ± 0.995
2.221LysArg: 2.221 ± 0.825
2.221LysSer: 2.221 ± 0.555
2.221LysThr: 2.221 ± 1.076
0.74LysVal: 0.74 ± 0.646
1.48LysTrp: 1.48 ± 0.513
1.48LysTyr: 1.48 ± 0.513
0.0LysXaa: 0.0 ± 0.0
Leu
5.922LeuAla: 5.922 ± 1.936
1.48LeuCys: 1.48 ± 0.513
3.701LeuAsp: 3.701 ± 1.261
1.48LeuGlu: 1.48 ± 0.725
2.221LeuPhe: 2.221 ± 1.014
5.922LeuGly: 5.922 ± 1.044
0.74LeuHis: 0.74 ± 0.988
2.221LeuIle: 2.221 ± 1.605
3.701LeuLys: 3.701 ± 0.988
9.623LeuLeu: 9.623 ± 2.057
1.48LeuMet: 1.48 ± 0.612
5.922LeuAsn: 5.922 ± 2.246
5.181LeuPro: 5.181 ± 1.881
2.221LeuGln: 2.221 ± 0.825
6.662LeuArg: 6.662 ± 1.159
5.181LeuSer: 5.181 ± 3.744
3.701LeuThr: 3.701 ± 0.711
3.701LeuVal: 3.701 ± 1.421
0.74LeuTrp: 0.74 ± 0.646
5.181LeuTyr: 5.181 ± 2.286
0.0LeuXaa: 0.0 ± 0.0
Met
1.48MetAla: 1.48 ± 1.07
0.0MetCys: 0.0 ± 0.0
2.221MetAsp: 2.221 ± 1.014
0.0MetGlu: 0.0 ± 0.0
0.74MetPhe: 0.74 ± 0.646
1.48MetGly: 1.48 ± 0.739
0.74MetHis: 0.74 ± 0.535
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.48MetLeu: 1.48 ± 0.739
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.221MetPro: 2.221 ± 0.825
0.74MetGln: 0.74 ± 0.799
1.48MetArg: 1.48 ± 0.937
5.181MetSer: 5.181 ± 2.416
1.48MetThr: 1.48 ± 0.513
0.0MetVal: 0.0 ± 0.0
0.74MetTrp: 0.74 ± 0.535
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.662AsnAla: 6.662 ± 2.637
1.48AsnCys: 1.48 ± 1.293
2.961AsnAsp: 2.961 ± 1.537
1.48AsnGlu: 1.48 ± 1.293
4.441AsnPhe: 4.441 ± 1.465
1.48AsnGly: 1.48 ± 0.513
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
2.221AsnLys: 2.221 ± 1.041
5.922AsnLeu: 5.922 ± 2.77
1.48AsnMet: 1.48 ± 0.739
2.961AsnAsn: 2.961 ± 0.638
5.922AsnPro: 5.922 ± 1.225
0.74AsnGln: 0.74 ± 0.535
2.961AsnArg: 2.961 ± 1.537
2.221AsnSer: 2.221 ± 1.593
2.221AsnThr: 2.221 ± 1.014
2.221AsnVal: 2.221 ± 1.076
0.0AsnTrp: 0.0 ± 0.0
0.74AsnTyr: 0.74 ± 0.535
0.0AsnXaa: 0.0 ± 0.0
Pro
3.701ProAla: 3.701 ± 1.925
0.0ProCys: 0.0 ± 0.0
2.961ProAsp: 2.961 ± 1.271
5.181ProGlu: 5.181 ± 1.406
2.961ProPhe: 2.961 ± 1.126
5.922ProGly: 5.922 ± 2.77
1.48ProHis: 1.48 ± 0.513
4.441ProIle: 4.441 ± 2.175
1.48ProLys: 1.48 ± 0.725
5.181ProLeu: 5.181 ± 0.729
0.0ProMet: 0.0 ± 0.0
2.221ProAsn: 2.221 ± 0.689
3.701ProPro: 3.701 ± 2.129
6.662ProGln: 6.662 ± 3.539
1.48ProArg: 1.48 ± 0.725
5.181ProSer: 5.181 ± 1.147
0.74ProThr: 0.74 ± 0.682
4.441ProVal: 4.441 ± 1.155
0.0ProTrp: 0.0 ± 0.0
2.221ProTyr: 2.221 ± 1.037
0.0ProXaa: 0.0 ± 0.0
Gln
2.221GlnAla: 2.221 ± 1.37
0.0GlnCys: 0.0 ± 0.0
4.441GlnAsp: 4.441 ± 1.782
3.701GlnGlu: 3.701 ± 1.327
1.48GlnPhe: 1.48 ± 1.07
2.221GlnGly: 2.221 ± 1.605
0.0GlnHis: 0.0 ± 0.0
2.221GlnIle: 2.221 ± 1.162
2.221GlnLys: 2.221 ± 0.977
2.221GlnLeu: 2.221 ± 1.442
2.961GlnMet: 2.961 ± 0.964
0.74GlnAsn: 0.74 ± 0.799
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.701GlnArg: 3.701 ± 1.049
3.701GlnSer: 3.701 ± 1.569
2.221GlnThr: 2.221 ± 1.014
1.48GlnVal: 1.48 ± 0.739
0.0GlnTrp: 0.0 ± 0.0
0.74GlnTyr: 0.74 ± 0.535
0.0GlnXaa: 0.0 ± 0.0
Arg
7.402ArgAla: 7.402 ± 0.737
1.48ArgCys: 1.48 ± 0.513
2.961ArgAsp: 2.961 ± 1.122
2.961ArgGlu: 2.961 ± 1.658
4.441ArgPhe: 4.441 ± 1.33
2.961ArgGly: 2.961 ± 0.817
1.48ArgHis: 1.48 ± 1.187
1.48ArgIle: 1.48 ± 0.513
0.74ArgLys: 0.74 ± 0.646
7.402ArgLeu: 7.402 ± 1.348
2.221ArgMet: 2.221 ± 0.601
0.0ArgAsn: 0.0 ± 0.0
3.701ArgPro: 3.701 ± 1.504
2.961ArgGln: 2.961 ± 0.59
3.701ArgArg: 3.701 ± 1.258
5.181ArgSer: 5.181 ± 1.496
5.922ArgThr: 5.922 ± 1.834
4.441ArgVal: 4.441 ± 1.543
0.0ArgTrp: 0.0 ± 0.0
2.221ArgTyr: 2.221 ± 0.825
0.0ArgXaa: 0.0 ± 0.0
Ser
10.363SerAla: 10.363 ± 6.656
2.961SerCys: 2.961 ± 1.651
4.441SerAsp: 4.441 ± 2.302
1.48SerGlu: 1.48 ± 0.513
2.221SerPhe: 2.221 ± 1.083
4.441SerGly: 4.441 ± 2.845
0.74SerHis: 0.74 ± 0.646
5.181SerIle: 5.181 ± 2.406
5.181SerLys: 5.181 ± 2.803
5.922SerLeu: 5.922 ± 1.923
2.221SerMet: 2.221 ± 0.555
4.441SerAsn: 4.441 ± 2.221
4.441SerPro: 4.441 ± 2.153
3.701SerGln: 3.701 ± 1.066
3.701SerArg: 3.701 ± 0.471
2.961SerSer: 2.961 ± 1.276
5.922SerThr: 5.922 ± 0.565
4.441SerVal: 4.441 ± 1.176
0.0SerTrp: 0.0 ± 0.0
5.181SerTyr: 5.181 ± 2.189
0.0SerXaa: 0.0 ± 0.0
Thr
3.701ThrAla: 3.701 ± 0.471
0.0ThrCys: 0.0 ± 0.0
4.441ThrAsp: 4.441 ± 1.155
0.74ThrGlu: 0.74 ± 0.988
4.441ThrPhe: 4.441 ± 1.274
8.142ThrGly: 8.142 ± 1.642
2.221ThrHis: 2.221 ± 0.555
0.74ThrIle: 0.74 ± 0.535
3.701ThrLys: 3.701 ± 1.327
4.441ThrLeu: 4.441 ± 2.523
0.74ThrMet: 0.74 ± 0.646
3.701ThrAsn: 3.701 ± 1.925
6.662ThrPro: 6.662 ± 1.205
1.48ThrGln: 1.48 ± 0.739
6.662ThrArg: 6.662 ± 0.918
4.441ThrSer: 4.441 ± 3.209
2.221ThrThr: 2.221 ± 0.821
2.221ThrVal: 2.221 ± 1.014
0.74ThrTrp: 0.74 ± 0.535
3.701ThrTyr: 3.701 ± 0.98
0.0ThrXaa: 0.0 ± 0.0
Val
2.961ValAla: 2.961 ± 1.794
0.0ValCys: 0.0 ± 0.0
2.961ValAsp: 2.961 ± 1.537
0.74ValGlu: 0.74 ± 0.535
3.701ValPhe: 3.701 ± 0.969
6.662ValGly: 6.662 ± 2.649
2.221ValHis: 2.221 ± 1.037
1.48ValIle: 1.48 ± 1.07
2.221ValLys: 2.221 ± 1.688
2.221ValLeu: 2.221 ± 1.162
0.74ValMet: 0.74 ± 0.646
2.221ValAsn: 2.221 ± 0.821
2.961ValPro: 2.961 ± 2.14
2.221ValGln: 2.221 ± 1.041
2.961ValArg: 2.961 ± 2.035
6.662ValSer: 6.662 ± 1.159
7.402ValThr: 7.402 ± 2.828
2.221ValVal: 2.221 ± 1.076
0.0ValTrp: 0.0 ± 0.0
2.961ValTyr: 2.961 ± 0.993
0.0ValXaa: 0.0 ± 0.0
Trp
2.221TrpAla: 2.221 ± 1.014
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.74TrpGlu: 0.74 ± 0.535
0.74TrpPhe: 0.74 ± 0.646
0.74TrpGly: 0.74 ± 0.646
0.74TrpHis: 0.74 ± 0.535
0.0TrpIle: 0.0 ± 0.0
0.74TrpLys: 0.74 ± 0.646
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.74TrpPro: 0.74 ± 0.535
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
2.221TrpSer: 2.221 ± 1.037
0.0TrpThr: 0.0 ± 0.0
0.74TrpVal: 0.74 ± 0.988
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.48TyrAla: 1.48 ± 0.725
0.74TyrCys: 0.74 ± 0.646
6.662TyrAsp: 6.662 ± 1.959
2.221TyrGlu: 2.221 ± 1.688
2.961TyrPhe: 2.961 ± 1.293
3.701TyrGly: 3.701 ± 1.529
3.701TyrHis: 3.701 ± 0.969
1.48TyrIle: 1.48 ± 1.07
2.221TyrLys: 2.221 ± 1.605
1.48TyrLeu: 1.48 ± 1.07
0.74TyrMet: 0.74 ± 0.535
0.74TyrAsn: 0.74 ± 0.535
3.701TyrPro: 3.701 ± 1.006
2.961TyrGln: 2.961 ± 1.548
1.48TyrArg: 1.48 ± 1.223
1.48TyrSer: 1.48 ± 0.513
3.701TyrThr: 3.701 ± 1.266
0.74TyrVal: 0.74 ± 0.535
1.48TyrTrp: 1.48 ± 0.513
1.48TyrTyr: 1.48 ± 1.293
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski