Amino acid dipepetide frequency for Apis mellifera associated microvirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.326AlaAla: 7.326 ± 3.512
0.733AlaCys: 0.733 ± 0.7
5.861AlaAsp: 5.861 ± 1.563
2.93AlaGlu: 2.93 ± 1.598
3.663AlaPhe: 3.663 ± 1.242
5.861AlaGly: 5.861 ± 1.382
2.93AlaHis: 2.93 ± 1.184
2.93AlaIle: 2.93 ± 1.813
6.593AlaLys: 6.593 ± 3.912
4.396AlaLeu: 4.396 ± 2.009
3.663AlaMet: 3.663 ± 1.242
7.326AlaAsn: 7.326 ± 3.123
1.465AlaPro: 1.465 ± 0.686
3.663AlaGln: 3.663 ± 1.242
8.059AlaArg: 8.059 ± 1.573
5.861AlaSer: 5.861 ± 2.118
8.791AlaThr: 8.791 ± 2.94
8.791AlaVal: 8.791 ± 1.746
3.663AlaTrp: 3.663 ± 1.202
3.663AlaTyr: 3.663 ± 1.302
0.0AlaXaa: 0.0 ± 0.0
Cys
0.733CysAla: 0.733 ± 0.536
0.0CysCys: 0.0 ± 0.0
3.663CysAsp: 3.663 ± 1.778
0.733CysGlu: 0.733 ± 0.7
0.733CysPhe: 0.733 ± 0.7
0.733CysGly: 0.733 ± 0.7
0.0CysHis: 0.0 ± 0.0
0.733CysIle: 0.733 ± 0.7
0.0CysLys: 0.0 ± 0.0
0.733CysLeu: 0.733 ± 0.536
0.0CysMet: 0.0 ± 0.0
0.733CysAsn: 0.733 ± 0.7
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.733CysThr: 0.733 ± 0.7
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.733CysTyr: 0.733 ± 0.7
0.0CysXaa: 0.0 ± 0.0
Asp
6.593AspAla: 6.593 ± 2.637
0.733AspCys: 0.733 ± 0.777
3.663AspAsp: 3.663 ± 2.143
4.396AspGlu: 4.396 ± 3.294
5.128AspPhe: 5.128 ± 2.993
2.198AspGly: 2.198 ± 1.013
2.198AspHis: 2.198 ± 1.278
1.465AspIle: 1.465 ± 0.8
2.93AspLys: 2.93 ± 2.104
3.663AspLeu: 3.663 ± 1.38
1.465AspMet: 1.465 ± 1.122
2.93AspAsn: 2.93 ± 1.468
3.663AspPro: 3.663 ± 1.778
0.733AspGln: 0.733 ± 0.904
2.93AspArg: 2.93 ± 1.641
6.593AspSer: 6.593 ± 2.275
3.663AspThr: 3.663 ± 2.207
1.465AspVal: 1.465 ± 1.02
0.733AspTrp: 0.733 ± 0.7
2.198AspTyr: 2.198 ± 1.541
0.0AspXaa: 0.0 ± 0.0
Glu
4.396GluAla: 4.396 ± 2.247
0.0GluCys: 0.0 ± 0.0
1.465GluAsp: 1.465 ± 1.804
0.733GluGlu: 0.733 ± 0.777
2.93GluPhe: 2.93 ± 2.087
1.465GluGly: 1.465 ± 0.935
2.198GluHis: 2.198 ± 1.013
2.198GluIle: 2.198 ± 0.965
1.465GluLys: 1.465 ± 1.808
2.93GluLeu: 2.93 ± 2.786
0.733GluMet: 0.733 ± 0.723
2.93GluAsn: 2.93 ± 0.691
0.733GluPro: 0.733 ± 0.7
2.198GluGln: 2.198 ± 1.013
6.593GluArg: 6.593 ± 1.518
0.733GluSer: 0.733 ± 0.777
2.93GluThr: 2.93 ± 1.78
2.198GluVal: 2.198 ± 0.71
0.733GluTrp: 0.733 ± 0.7
2.198GluTyr: 2.198 ± 1.013
0.0GluXaa: 0.0 ± 0.0
Phe
4.396PheAla: 4.396 ± 1.884
0.733PheCys: 0.733 ± 0.777
4.396PheAsp: 4.396 ± 1.416
0.0PheGlu: 0.0 ± 0.0
4.396PhePhe: 4.396 ± 1.972
5.128PheGly: 5.128 ± 1.41
1.465PheHis: 1.465 ± 1.682
2.198PheIle: 2.198 ± 0.778
2.198PheLys: 2.198 ± 0.965
2.93PheLeu: 2.93 ± 1.476
2.93PheMet: 2.93 ± 1.233
0.733PheAsn: 0.733 ± 0.7
2.198PhePro: 2.198 ± 1.278
3.663PheGln: 3.663 ± 1.064
3.663PheArg: 3.663 ± 1.265
2.93PheSer: 2.93 ± 0.691
3.663PheThr: 3.663 ± 1.828
1.465PheVal: 1.465 ± 0.8
0.733PheTrp: 0.733 ± 0.7
1.465PheTyr: 1.465 ± 0.686
0.0PheXaa: 0.0 ± 0.0
Gly
4.396GlyAla: 4.396 ± 1.42
1.465GlyCys: 1.465 ± 1.4
4.396GlyAsp: 4.396 ± 2.238
5.128GlyGlu: 5.128 ± 1.41
2.198GlyPhe: 2.198 ± 0.883
9.524GlyGly: 9.524 ± 2.963
0.0GlyHis: 0.0 ± 0.0
2.93GlyIle: 2.93 ± 2.04
2.93GlyLys: 2.93 ± 1.944
7.326GlyLeu: 7.326 ± 2.616
0.0GlyMet: 0.0 ± 0.0
2.93GlyAsn: 2.93 ± 1.395
1.465GlyPro: 1.465 ± 1.41
3.663GlyGln: 3.663 ± 2.048
2.93GlyArg: 2.93 ± 2.104
5.128GlySer: 5.128 ± 1.826
6.593GlyThr: 6.593 ± 4.093
3.663GlyVal: 3.663 ± 1.242
0.733GlyTrp: 0.733 ± 0.7
3.663GlyTyr: 3.663 ± 1.645
0.0GlyXaa: 0.0 ± 0.0
His
2.93HisAla: 2.93 ± 1.474
0.733HisCys: 0.733 ± 0.536
2.93HisAsp: 2.93 ± 1.245
2.198HisGlu: 2.198 ± 1.457
2.198HisPhe: 2.198 ± 1.013
1.465HisGly: 1.465 ± 1.071
0.0HisHis: 0.0 ± 0.0
1.465HisIle: 1.465 ± 1.44
0.0HisLys: 0.0 ± 0.0
2.198HisLeu: 2.198 ± 1.258
0.0HisMet: 0.0 ± 0.0
1.465HisAsn: 1.465 ± 0.686
2.198HisPro: 2.198 ± 2.829
1.465HisGln: 1.465 ± 0.686
1.465HisArg: 1.465 ± 1.071
0.733HisSer: 0.733 ± 0.904
0.733HisThr: 0.733 ± 0.904
0.0HisVal: 0.0 ± 0.0
0.733HisTrp: 0.733 ± 0.7
2.198HisTyr: 2.198 ± 2.1
0.0HisXaa: 0.0 ± 0.0
Ile
5.861IleAla: 5.861 ± 3.419
0.0IleCys: 0.0 ± 0.0
3.663IleAsp: 3.663 ± 1.302
0.0IleGlu: 0.0 ± 0.0
0.733IlePhe: 0.733 ± 0.7
2.198IleGly: 2.198 ± 1.457
0.733IleHis: 0.733 ± 0.904
1.465IleIle: 1.465 ± 1.02
0.733IleLys: 0.733 ± 0.7
0.0IleLeu: 0.0 ± 0.0
0.733IleMet: 0.733 ± 0.536
5.861IleAsn: 5.861 ± 1.273
3.663IlePro: 3.663 ± 1.801
1.465IleGln: 1.465 ± 0.767
1.465IleArg: 1.465 ± 1.4
2.198IleSer: 2.198 ± 1.013
3.663IleThr: 3.663 ± 1.064
2.93IleVal: 2.93 ± 1.601
0.733IleTrp: 0.733 ± 0.7
2.198IleTyr: 2.198 ± 1.013
0.0IleXaa: 0.0 ± 0.0
Lys
3.663LysAla: 3.663 ± 4.521
0.0LysCys: 0.0 ± 0.0
1.465LysAsp: 1.465 ± 1.804
0.733LysGlu: 0.733 ± 0.777
1.465LysPhe: 1.465 ± 1.071
2.93LysGly: 2.93 ± 0.954
1.465LysHis: 1.465 ± 1.44
1.465LysIle: 1.465 ± 1.4
2.93LysLys: 2.93 ± 1.498
5.861LysLeu: 5.861 ± 3.127
2.93LysMet: 2.93 ± 1.065
0.0LysAsn: 0.0 ± 0.0
2.198LysPro: 2.198 ± 1.278
2.198LysGln: 2.198 ± 1.588
3.663LysArg: 3.663 ± 1.929
2.198LysSer: 2.198 ± 0.778
2.93LysThr: 2.93 ± 2.087
2.93LysVal: 2.93 ± 1.184
0.0LysTrp: 0.0 ± 0.0
1.465LysTyr: 1.465 ± 1.4
0.0LysXaa: 0.0 ± 0.0
Leu
7.326LeuAla: 7.326 ± 1.72
0.733LeuCys: 0.733 ± 1.474
2.93LeuAsp: 2.93 ± 1.329
3.663LeuGlu: 3.663 ± 1.074
3.663LeuPhe: 3.663 ± 1.555
5.861LeuGly: 5.861 ± 1.753
1.465LeuHis: 1.465 ± 0.686
3.663LeuIle: 3.663 ± 2.678
2.93LeuLys: 2.93 ± 2.104
2.93LeuLeu: 2.93 ± 1.561
1.465LeuMet: 1.465 ± 1.808
5.128LeuAsn: 5.128 ± 4.052
6.593LeuPro: 6.593 ± 1.633
5.861LeuGln: 5.861 ± 2.804
2.93LeuArg: 2.93 ± 1.591
5.128LeuSer: 5.128 ± 1.049
2.93LeuThr: 2.93 ± 0.954
3.663LeuVal: 3.663 ± 1.456
2.198LeuTrp: 2.198 ± 1.258
2.93LeuTyr: 2.93 ± 1.372
0.0LeuXaa: 0.0 ± 0.0
Met
5.128MetAla: 5.128 ± 3.053
0.0MetCys: 0.0 ± 0.0
1.465MetAsp: 1.465 ± 0.767
1.465MetGlu: 1.465 ± 1.808
1.465MetPhe: 1.465 ± 1.41
2.198MetGly: 2.198 ± 0.965
0.733MetHis: 0.733 ± 0.7
0.733MetIle: 0.733 ± 1.474
0.733MetLys: 0.733 ± 0.7
2.198MetLeu: 2.198 ± 1.588
0.733MetMet: 0.733 ± 0.7
0.733MetAsn: 0.733 ± 0.777
0.733MetPro: 0.733 ± 0.536
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.465MetSer: 1.465 ± 1.02
0.733MetThr: 0.733 ± 0.536
2.198MetVal: 2.198 ± 1.258
0.733MetTrp: 0.733 ± 0.536
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.396AsnAla: 4.396 ± 2.838
0.733AsnCys: 0.733 ± 0.7
1.465AsnAsp: 1.465 ± 0.8
2.93AsnGlu: 2.93 ± 1.184
0.733AsnPhe: 0.733 ± 0.7
2.198AsnGly: 2.198 ± 1.541
0.733AsnHis: 0.733 ± 0.536
2.93AsnIle: 2.93 ± 1.476
2.198AsnLys: 2.198 ± 0.778
5.128AsnLeu: 5.128 ± 2.025
0.733AsnMet: 0.733 ± 1.189
2.198AsnAsn: 2.198 ± 0.778
4.396AsnPro: 4.396 ± 2.117
2.198AsnGln: 2.198 ± 1.013
2.198AsnArg: 2.198 ± 1.588
2.93AsnSer: 2.93 ± 0.935
5.128AsnThr: 5.128 ± 2.984
0.733AsnVal: 0.733 ± 0.536
1.465AsnTrp: 1.465 ± 1.071
2.93AsnTyr: 2.93 ± 1.601
0.0AsnXaa: 0.0 ± 0.0
Pro
2.198ProAla: 2.198 ± 1.588
0.733ProCys: 0.733 ± 0.7
2.198ProAsp: 2.198 ± 1.013
5.128ProGlu: 5.128 ± 3.151
3.663ProPhe: 3.663 ± 1.287
4.396ProGly: 4.396 ± 1.456
2.198ProHis: 2.198 ± 1.278
5.128ProIle: 5.128 ± 1.644
0.733ProLys: 0.733 ± 0.904
5.861ProLeu: 5.861 ± 2.268
1.465ProMet: 1.465 ± 0.767
2.93ProAsn: 2.93 ± 1.591
2.198ProPro: 2.198 ± 0.778
2.93ProGln: 2.93 ± 2.82
2.198ProArg: 2.198 ± 0.778
2.93ProSer: 2.93 ± 0.691
2.198ProThr: 2.198 ± 1.541
3.663ProVal: 3.663 ± 1.338
0.0ProTrp: 0.0 ± 0.0
2.198ProTyr: 2.198 ± 1.457
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 2.155
0.733GlnCys: 0.733 ± 0.7
4.396GlnAsp: 4.396 ± 1.563
2.93GlnGlu: 2.93 ± 0.736
1.465GlnPhe: 1.465 ± 1.071
2.93GlnGly: 2.93 ± 0.691
0.733GlnHis: 0.733 ± 0.777
0.733GlnIle: 0.733 ± 0.904
2.93GlnLys: 2.93 ± 2.143
5.128GlnLeu: 5.128 ± 1.423
0.733GlnMet: 0.733 ± 0.904
2.198GlnAsn: 2.198 ± 0.965
2.198GlnPro: 2.198 ± 0.71
2.93GlnGln: 2.93 ± 1.561
2.198GlnArg: 2.198 ± 0.883
2.198GlnSer: 2.198 ± 1.278
3.663GlnThr: 3.663 ± 1.523
3.663GlnVal: 3.663 ± 1.964
0.733GlnTrp: 0.733 ± 0.536
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.861ArgAla: 5.861 ± 3.345
0.733ArgCys: 0.733 ± 0.7
2.93ArgAsp: 2.93 ± 1.29
1.465ArgGlu: 1.465 ± 0.935
2.198ArgPhe: 2.198 ± 1.124
2.198ArgGly: 2.198 ± 1.119
1.465ArgHis: 1.465 ± 0.686
1.465ArgIle: 1.465 ± 1.41
0.733ArgLys: 0.733 ± 0.904
5.128ArgLeu: 5.128 ± 0.832
4.396ArgMet: 4.396 ± 1.917
0.0ArgAsn: 0.0 ± 0.0
4.396ArgPro: 4.396 ± 2.557
2.93ArgGln: 2.93 ± 1.245
2.93ArgArg: 2.93 ± 1.395
5.861ArgSer: 5.861 ± 1.908
2.198ArgThr: 2.198 ± 1.57
2.93ArgVal: 2.93 ± 1.591
0.733ArgTrp: 0.733 ± 0.536
5.861ArgTyr: 5.861 ± 2.745
0.0ArgXaa: 0.0 ± 0.0
Ser
11.722SerAla: 11.722 ± 3.187
0.733SerCys: 0.733 ± 0.536
3.663SerAsp: 3.663 ± 2.308
0.733SerGlu: 0.733 ± 0.777
3.663SerPhe: 3.663 ± 1.694
5.861SerGly: 5.861 ± 1.233
2.93SerHis: 2.93 ± 0.736
3.663SerIle: 3.663 ± 1.409
2.93SerLys: 2.93 ± 1.87
6.593SerLeu: 6.593 ± 1.939
0.0SerMet: 0.0 ± 0.0
0.733SerAsn: 0.733 ± 0.904
4.396SerPro: 4.396 ± 2.074
3.663SerGln: 3.663 ± 1.409
2.93SerArg: 2.93 ± 1.245
9.524SerSer: 9.524 ± 4.095
6.593SerThr: 6.593 ± 2.349
5.128SerVal: 5.128 ± 1.728
0.0SerTrp: 0.0 ± 0.0
1.465SerTyr: 1.465 ± 0.767
0.0SerXaa: 0.0 ± 0.0
Thr
8.791ThrAla: 8.791 ± 3.957
0.0ThrCys: 0.0 ± 0.0
4.396ThrAsp: 4.396 ± 3.459
1.465ThrGlu: 1.465 ± 1.071
3.663ThrPhe: 3.663 ± 1.964
5.861ThrGly: 5.861 ± 1.838
2.198ThrHis: 2.198 ± 1.598
1.465ThrIle: 1.465 ± 1.071
0.733ThrLys: 0.733 ± 0.904
5.128ThrLeu: 5.128 ± 1.795
0.0ThrMet: 0.0 ± 0.0
3.663ThrAsn: 3.663 ± 1.074
5.861ThrPro: 5.861 ± 0.948
1.465ThrGln: 1.465 ± 1.122
5.128ThrArg: 5.128 ± 2.528
7.326ThrSer: 7.326 ± 2.019
3.663ThrThr: 3.663 ± 2.207
4.396ThrVal: 4.396 ± 2.325
0.0ThrTrp: 0.0 ± 0.0
2.198ThrTyr: 2.198 ± 1.013
0.0ThrXaa: 0.0 ± 0.0
Val
5.128ValAla: 5.128 ± 1.636
0.733ValCys: 0.733 ± 0.7
0.733ValAsp: 0.733 ± 0.536
2.198ValGlu: 2.198 ± 1.705
4.396ValPhe: 4.396 ± 1.779
6.593ValGly: 6.593 ± 2.672
0.733ValHis: 0.733 ± 0.7
2.198ValIle: 2.198 ± 1.607
5.128ValLys: 5.128 ± 2.207
1.465ValLeu: 1.465 ± 1.071
0.0ValMet: 0.0 ± 0.0
2.198ValAsn: 2.198 ± 0.71
5.128ValPro: 5.128 ± 1.322
0.733ValGln: 0.733 ± 0.536
2.198ValArg: 2.198 ± 1.541
8.791ValSer: 8.791 ± 2.322
3.663ValThr: 3.663 ± 1.302
2.198ValVal: 2.198 ± 0.883
0.0ValTrp: 0.0 ± 0.0
1.465ValTyr: 1.465 ± 1.071
0.0ValXaa: 0.0 ± 0.0
Trp
1.465TrpAla: 1.465 ± 0.686
0.0TrpCys: 0.0 ± 0.0
1.465TrpAsp: 1.465 ± 1.02
0.733TrpGlu: 0.733 ± 0.7
0.733TrpPhe: 0.733 ± 0.536
0.0TrpGly: 0.0 ± 0.0
2.93TrpHis: 2.93 ± 1.468
0.0TrpIle: 0.0 ± 0.0
0.733TrpLys: 0.733 ± 0.536
0.733TrpLeu: 0.733 ± 0.7
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.733TrpPro: 0.733 ± 0.7
0.733TrpGln: 0.733 ± 0.536
0.733TrpArg: 0.733 ± 0.7
0.733TrpSer: 0.733 ± 0.536
2.198TrpThr: 2.198 ± 1.541
0.0TrpVal: 0.0 ± 0.0
0.733TrpTrp: 0.733 ± 0.904
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 1.278
0.733TyrCys: 0.733 ± 0.536
2.198TyrAsp: 2.198 ± 1.484
2.198TyrGlu: 2.198 ± 1.258
2.93TyrPhe: 2.93 ± 1.372
1.465TyrGly: 1.465 ± 1.02
0.0TyrHis: 0.0 ± 0.0
1.465TyrIle: 1.465 ± 1.4
2.93TyrLys: 2.93 ± 1.372
3.663TyrLeu: 3.663 ± 2.678
0.733TyrMet: 0.733 ± 0.536
3.663TyrAsn: 3.663 ± 1.338
0.733TyrPro: 0.733 ± 0.7
3.663TyrGln: 3.663 ± 1.064
2.198TyrArg: 2.198 ± 1.278
3.663TyrSer: 3.663 ± 1.064
0.733TyrThr: 0.733 ± 0.536
3.663TyrVal: 3.663 ± 1.929
0.0TyrTrp: 0.0 ± 0.0
1.465TyrTyr: 1.465 ± 1.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1366 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski