Amino acid dipepetide frequency for Apis mellifera associated microvirus 54

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.677AlaAla: 6.677 ± 1.604
0.0AlaCys: 0.0 ± 0.0
3.709AlaAsp: 3.709 ± 1.342
2.967AlaGlu: 2.967 ± 0.717
2.967AlaPhe: 2.967 ± 0.973
7.418AlaGly: 7.418 ± 2.309
2.226AlaHis: 2.226 ± 0.537
2.967AlaIle: 2.967 ± 0.605
3.709AlaLys: 3.709 ± 3.314
11.128AlaLeu: 11.128 ± 1.893
2.967AlaMet: 2.967 ± 0.973
4.451AlaAsn: 4.451 ± 1.909
3.709AlaPro: 3.709 ± 1.345
3.709AlaGln: 3.709 ± 1.56
8.16AlaArg: 8.16 ± 2.113
2.967AlaSer: 2.967 ± 0.717
4.451AlaThr: 4.451 ± 1.764
3.709AlaVal: 3.709 ± 1.408
1.484AlaTrp: 1.484 ± 0.636
2.967AlaTyr: 2.967 ± 0.916
0.0AlaXaa: 0.0 ± 0.0
Cys
0.742CysAla: 0.742 ± 0.548
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.742CysPhe: 0.742 ± 0.74
0.742CysGly: 0.742 ± 0.74
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.742CysLys: 0.742 ± 0.74
1.484CysLeu: 1.484 ± 1.048
0.0CysMet: 0.0 ± 0.0
0.742CysAsn: 0.742 ± 0.548
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.484CysArg: 1.484 ± 1.048
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.193AspAla: 5.193 ± 1.73
0.0AspCys: 0.0 ± 0.0
3.709AspAsp: 3.709 ± 1.727
2.226AspGlu: 2.226 ± 1.306
3.709AspPhe: 3.709 ± 1.311
3.709AspGly: 3.709 ± 1.945
0.0AspHis: 0.0 ± 0.0
1.484AspIle: 1.484 ± 1.48
2.226AspLys: 2.226 ± 1.086
5.193AspLeu: 5.193 ± 0.935
0.0AspMet: 0.0 ± 0.0
1.484AspAsn: 1.484 ± 1.753
4.451AspPro: 4.451 ± 1.606
2.967AspGln: 2.967 ± 1.236
1.484AspArg: 1.484 ± 0.636
2.967AspSer: 2.967 ± 2.192
2.226AspThr: 2.226 ± 0.855
2.967AspVal: 2.967 ± 0.698
0.742AspTrp: 0.742 ± 0.548
2.967AspTyr: 2.967 ± 1.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.935GluAla: 5.935 ± 1.36
0.0GluCys: 0.0 ± 0.0
2.967GluAsp: 2.967 ± 0.916
3.709GluGlu: 3.709 ± 1.399
2.967GluPhe: 2.967 ± 1.349
2.967GluGly: 2.967 ± 1.236
1.484GluHis: 1.484 ± 0.636
2.226GluIle: 2.226 ± 0.982
2.226GluLys: 2.226 ± 1.139
3.709GluLeu: 3.709 ± 0.631
0.742GluMet: 0.742 ± 0.643
2.226GluAsn: 2.226 ± 1.11
1.484GluPro: 1.484 ± 1.56
3.709GluGln: 3.709 ± 2.42
2.226GluArg: 2.226 ± 1.288
3.709GluSer: 3.709 ± 1.215
2.226GluThr: 2.226 ± 1.663
5.193GluVal: 5.193 ± 1.734
2.226GluTrp: 2.226 ± 0.998
3.709GluTyr: 3.709 ± 1.311
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.698
0.0PheCys: 0.0 ± 0.0
2.226PheAsp: 2.226 ± 1.086
1.484PheGlu: 1.484 ± 1.096
1.484PhePhe: 1.484 ± 1.096
3.709PheGly: 3.709 ± 0.915
0.0PheHis: 0.0 ± 0.0
2.226PheIle: 2.226 ± 0.982
0.742PheLys: 0.742 ± 0.78
2.967PheLeu: 2.967 ± 1.402
0.742PheMet: 0.742 ± 0.643
2.226PheAsn: 2.226 ± 1.37
2.226PhePro: 2.226 ± 0.818
0.742PheGln: 0.742 ± 0.548
1.484PheArg: 1.484 ± 1.096
3.709PheSer: 3.709 ± 1.286
1.484PheThr: 1.484 ± 0.675
1.484PheVal: 1.484 ± 0.675
0.0PheTrp: 0.0 ± 0.0
0.742PheTyr: 0.742 ± 0.74
0.0PheXaa: 0.0 ± 0.0
Gly
3.709GlyAla: 3.709 ± 1.827
0.0GlyCys: 0.0 ± 0.0
6.677GlyAsp: 6.677 ± 1.078
3.709GlyGlu: 3.709 ± 2.222
2.967GlyPhe: 2.967 ± 0.717
8.902GlyGly: 8.902 ± 2.087
1.484GlyHis: 1.484 ± 0.636
5.193GlyIle: 5.193 ± 1.4
2.967GlyLys: 2.967 ± 2.127
3.709GlyLeu: 3.709 ± 1.56
0.742GlyMet: 0.742 ± 0.74
5.193GlyAsn: 5.193 ± 1.324
2.967GlyPro: 2.967 ± 1.273
2.967GlyGln: 2.967 ± 1.125
2.967GlyArg: 2.967 ± 2.012
8.902GlySer: 8.902 ± 2.258
2.967GlyThr: 2.967 ± 1.273
5.935GlyVal: 5.935 ± 3.601
0.0GlyTrp: 0.0 ± 0.0
4.451GlyTyr: 4.451 ± 1.42
0.0GlyXaa: 0.0 ± 0.0
His
0.742HisAla: 0.742 ± 0.877
0.742HisCys: 0.742 ± 0.877
0.0HisAsp: 0.0 ± 0.0
0.742HisGlu: 0.742 ± 0.74
0.0HisPhe: 0.0 ± 0.0
2.226HisGly: 2.226 ± 0.982
0.742HisHis: 0.742 ± 0.548
0.742HisIle: 0.742 ± 0.548
2.226HisLys: 2.226 ± 1.663
2.226HisLeu: 2.226 ± 1.435
0.742HisMet: 0.742 ± 0.643
0.0HisAsn: 0.0 ± 0.0
2.967HisPro: 2.967 ± 1.125
0.742HisGln: 0.742 ± 0.643
2.967HisArg: 2.967 ± 0.916
2.226HisSer: 2.226 ± 1.644
0.742HisThr: 0.742 ± 0.643
0.742HisVal: 0.742 ± 0.548
2.226HisTrp: 2.226 ± 0.537
1.484HisTyr: 1.484 ± 0.675
0.0HisXaa: 0.0 ± 0.0
Ile
4.451IleAla: 4.451 ± 2.266
0.0IleCys: 0.0 ± 0.0
2.967IleAsp: 2.967 ± 1.242
2.226IleGlu: 2.226 ± 1.306
2.226IlePhe: 2.226 ± 0.982
4.451IleGly: 4.451 ± 1.472
2.226IleHis: 2.226 ± 0.837
5.193IleIle: 5.193 ± 1.929
5.193IleLys: 5.193 ± 1.346
2.967IleLeu: 2.967 ± 1.349
2.226IleMet: 2.226 ± 0.562
1.484IleAsn: 1.484 ± 1.048
2.226IlePro: 2.226 ± 1.288
3.709IleGln: 3.709 ± 1.135
2.967IleArg: 2.967 ± 0.605
4.451IleSer: 4.451 ± 1.227
3.709IleThr: 3.709 ± 2.037
0.742IleVal: 0.742 ± 0.877
1.484IleTrp: 1.484 ± 1.096
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.484LysAla: 1.484 ± 0.83
0.0LysCys: 0.0 ± 0.0
2.967LysAsp: 2.967 ± 2.012
5.193LysGlu: 5.193 ± 2.431
2.226LysPhe: 2.226 ± 1.139
2.967LysGly: 2.967 ± 1.305
3.709LysHis: 3.709 ± 2.072
5.193LysIle: 5.193 ± 2.436
3.709LysLys: 3.709 ± 2.244
5.935LysLeu: 5.935 ± 2.375
1.484LysMet: 1.484 ± 0.8
1.484LysAsn: 1.484 ± 1.068
5.935LysPro: 5.935 ± 2.063
2.226LysGln: 2.226 ± 1.11
4.451LysArg: 4.451 ± 1.539
2.967LysSer: 2.967 ± 1.223
2.226LysThr: 2.226 ± 1.138
0.0LysVal: 0.0 ± 0.0
0.742LysTrp: 0.742 ± 0.643
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.935LeuAla: 5.935 ± 0.809
0.742LeuCys: 0.742 ± 0.74
1.484LeuAsp: 1.484 ± 0.675
4.451LeuGlu: 4.451 ± 2.169
4.451LeuPhe: 4.451 ± 1.694
8.16LeuGly: 8.16 ± 1.57
1.484LeuHis: 1.484 ± 0.855
5.193LeuIle: 5.193 ± 1.556
4.451LeuLys: 4.451 ± 2.3
2.226LeuLeu: 2.226 ± 0.818
2.226LeuMet: 2.226 ± 1.342
2.226LeuAsn: 2.226 ± 0.982
5.193LeuPro: 5.193 ± 1.277
8.902LeuGln: 8.902 ± 2.433
7.418LeuArg: 7.418 ± 2.722
7.418LeuSer: 7.418 ± 2.621
5.193LeuThr: 5.193 ± 1.721
4.451LeuVal: 4.451 ± 1.153
0.742LeuTrp: 0.742 ± 0.74
2.967LeuTyr: 2.967 ± 0.916
0.0LeuXaa: 0.0 ± 0.0
Met
3.709MetAla: 3.709 ± 1.766
0.0MetCys: 0.0 ± 0.0
0.742MetAsp: 0.742 ± 0.643
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.967MetGly: 2.967 ± 1.759
0.742MetHis: 0.742 ± 0.548
0.742MetIle: 0.742 ± 0.643
0.742MetLys: 0.742 ± 0.548
2.226MetLeu: 2.226 ± 1.306
0.0MetMet: 0.0 ± 0.522
0.742MetAsn: 0.742 ± 0.643
1.484MetPro: 1.484 ± 0.83
1.484MetGln: 1.484 ± 0.77
0.0MetArg: 0.0 ± 0.0
1.484MetSer: 1.484 ± 1.068
1.484MetThr: 1.484 ± 0.83
1.484MetVal: 1.484 ± 0.986
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.709AsnAla: 3.709 ± 1.269
0.0AsnCys: 0.0 ± 0.0
3.709AsnAsp: 3.709 ± 0.631
0.742AsnGlu: 0.742 ± 0.548
0.0AsnPhe: 0.0 ± 0.0
1.484AsnGly: 1.484 ± 0.636
1.484AsnHis: 1.484 ± 0.636
3.709AsnIle: 3.709 ± 2.705
2.226AsnLys: 2.226 ± 0.537
5.935AsnLeu: 5.935 ± 1.183
2.226AsnMet: 2.226 ± 0.998
1.484AsnAsn: 1.484 ± 0.636
2.226AsnPro: 2.226 ± 1.139
1.484AsnGln: 1.484 ± 0.83
5.935AsnArg: 5.935 ± 1.578
2.967AsnSer: 2.967 ± 2.021
0.0AsnThr: 0.0 ± 0.0
1.484AsnVal: 1.484 ± 0.636
0.742AsnTrp: 0.742 ± 0.643
3.709AsnTyr: 3.709 ± 1.799
0.0AsnXaa: 0.0 ± 0.0
Pro
3.709ProAla: 3.709 ± 1.06
0.742ProCys: 0.742 ± 0.74
2.967ProAsp: 2.967 ± 1.568
4.451ProGlu: 4.451 ± 1.636
1.484ProPhe: 1.484 ± 0.675
0.742ProGly: 0.742 ± 0.548
1.484ProHis: 1.484 ± 0.675
2.226ProIle: 2.226 ± 1.086
5.193ProLys: 5.193 ± 1.568
6.677ProLeu: 6.677 ± 0.942
0.0ProMet: 0.0 ± 0.0
2.226ProAsn: 2.226 ± 0.537
5.935ProPro: 5.935 ± 2.741
2.226ProGln: 2.226 ± 2.341
1.484ProArg: 1.484 ± 1.048
6.677ProSer: 6.677 ± 4.149
2.967ProThr: 2.967 ± 1.71
2.226ProVal: 2.226 ± 0.998
1.484ProTrp: 1.484 ± 0.77
3.709ProTyr: 3.709 ± 1.999
0.0ProXaa: 0.0 ± 0.0
Gln
4.451GlnAla: 4.451 ± 1.702
0.742GlnCys: 0.742 ± 0.74
2.226GlnAsp: 2.226 ± 0.998
4.451GlnGlu: 4.451 ± 1.42
1.484GlnPhe: 1.484 ± 0.77
3.709GlnGly: 3.709 ± 1.855
1.484GlnHis: 1.484 ± 0.675
2.226GlnIle: 2.226 ± 1.288
2.967GlnLys: 2.967 ± 1.349
0.742GlnLeu: 0.742 ± 0.877
0.742GlnMet: 0.742 ± 0.643
2.226GlnAsn: 2.226 ± 0.537
1.484GlnPro: 1.484 ± 0.986
3.709GlnGln: 3.709 ± 0.915
4.451GlnArg: 4.451 ± 1.261
3.709GlnSer: 3.709 ± 1.766
4.451GlnThr: 4.451 ± 1.217
1.484GlnVal: 1.484 ± 1.56
0.742GlnTrp: 0.742 ± 0.643
0.742GlnTyr: 0.742 ± 0.74
0.0GlnXaa: 0.0 ± 0.0
Arg
6.677ArgAla: 6.677 ± 1.7
0.742ArgCys: 0.742 ± 0.548
4.451ArgAsp: 4.451 ± 0.831
6.677ArgGlu: 6.677 ± 2.167
0.742ArgPhe: 0.742 ± 0.74
0.0ArgGly: 0.0 ± 0.0
1.484ArgHis: 1.484 ± 1.48
4.451ArgIle: 4.451 ± 1.963
2.967ArgLys: 2.967 ± 1.223
6.677ArgLeu: 6.677 ± 2.458
1.484ArgMet: 1.484 ± 1.026
4.451ArgAsn: 4.451 ± 4.353
2.967ArgPro: 2.967 ± 0.916
0.742ArgGln: 0.742 ± 0.548
8.16ArgArg: 8.16 ± 3.938
5.193ArgSer: 5.193 ± 1.648
4.451ArgThr: 4.451 ± 1.673
2.226ArgVal: 2.226 ± 1.643
0.742ArgTrp: 0.742 ± 0.74
5.935ArgTyr: 5.935 ± 2.028
0.0ArgXaa: 0.0 ± 0.0
Ser
8.902SerAla: 8.902 ± 2.648
1.484SerCys: 1.484 ± 0.675
2.967SerAsp: 2.967 ± 1.236
3.709SerGlu: 3.709 ± 1.223
1.484SerPhe: 1.484 ± 0.636
8.16SerGly: 8.16 ± 2.382
0.0SerHis: 0.0 ± 0.0
2.967SerIle: 2.967 ± 0.717
3.709SerLys: 3.709 ± 1.062
8.16SerLeu: 8.16 ± 1.334
1.484SerMet: 1.484 ± 0.986
2.967SerAsn: 2.967 ± 1.48
3.709SerPro: 3.709 ± 0.631
2.967SerGln: 2.967 ± 1.749
5.193SerArg: 5.193 ± 2.717
11.869SerSer: 11.869 ± 2.212
4.451SerThr: 4.451 ± 1.41
5.193SerVal: 5.193 ± 2.464
0.0SerTrp: 0.0 ± 0.0
1.484SerTyr: 1.484 ± 1.753
0.0SerXaa: 0.0 ± 0.0
Thr
3.709ThrAla: 3.709 ± 1.06
0.742ThrCys: 0.742 ± 0.74
1.484ThrAsp: 1.484 ± 0.675
2.967ThrGlu: 2.967 ± 1.48
0.742ThrPhe: 0.742 ± 0.877
5.193ThrGly: 5.193 ± 2.26
2.226ThrHis: 2.226 ± 1.589
4.451ThrIle: 4.451 ± 1.722
5.193ThrLys: 5.193 ± 2.436
4.451ThrLeu: 4.451 ± 1.606
1.484ThrMet: 1.484 ± 1.286
1.484ThrAsn: 1.484 ± 0.636
1.484ThrPro: 1.484 ± 1.023
0.742ThrGln: 0.742 ± 0.643
3.709ThrArg: 3.709 ± 1.062
5.935ThrSer: 5.935 ± 1.848
3.709ThrThr: 3.709 ± 2.055
0.742ThrVal: 0.742 ± 0.548
0.742ThrTrp: 0.742 ± 0.548
1.484ThrTyr: 1.484 ± 1.48
0.0ThrXaa: 0.0 ± 0.0
Val
5.193ValAla: 5.193 ± 2.251
0.742ValCys: 0.742 ± 0.877
2.226ValAsp: 2.226 ± 0.805
1.484ValGlu: 1.484 ± 1.286
1.484ValPhe: 1.484 ± 1.096
3.709ValGly: 3.709 ± 1.135
0.742ValHis: 0.742 ± 0.548
1.484ValIle: 1.484 ± 1.753
1.484ValLys: 1.484 ± 0.636
2.967ValLeu: 2.967 ± 1.48
0.742ValMet: 0.742 ± 0.548
3.709ValAsn: 3.709 ± 1.2
5.935ValPro: 5.935 ± 1.156
0.742ValGln: 0.742 ± 0.643
2.226ValArg: 2.226 ± 1.644
2.967ValSer: 2.967 ± 1.236
2.226ValThr: 2.226 ± 1.435
1.484ValVal: 1.484 ± 1.286
0.0ValTrp: 0.0 ± 0.0
2.226ValTyr: 2.226 ± 1.138
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.548
0.0TrpCys: 0.0 ± 0.0
0.742TrpAsp: 0.742 ± 0.548
2.967TrpGlu: 2.967 ± 0.961
0.742TrpPhe: 0.742 ± 0.548
1.484TrpGly: 1.484 ± 0.83
0.742TrpHis: 0.742 ± 0.548
0.0TrpIle: 0.0 ± 0.0
0.742TrpLys: 0.742 ± 0.643
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
2.226TrpAsn: 2.226 ± 0.998
0.742TrpPro: 0.742 ± 0.74
0.742TrpGln: 0.742 ± 0.548
0.742TrpArg: 0.742 ± 0.643
0.0TrpSer: 0.0 ± 0.0
0.742TrpThr: 0.742 ± 0.74
0.742TrpVal: 0.742 ± 0.643
0.742TrpTrp: 0.742 ± 0.643
0.742TrpTyr: 0.742 ± 0.548
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.709TyrAla: 3.709 ± 1.945
0.0TyrCys: 0.0 ± 0.0
1.484TyrAsp: 1.484 ± 0.77
1.484TyrGlu: 1.484 ± 0.986
0.742TyrPhe: 0.742 ± 0.548
3.709TyrGly: 3.709 ± 1.942
1.484TyrHis: 1.484 ± 0.83
2.226TyrIle: 2.226 ± 0.982
1.484TyrLys: 1.484 ± 0.855
5.935TyrLeu: 5.935 ± 1.156
0.0TyrMet: 0.0 ± 0.0
2.226TyrAsn: 2.226 ± 1.051
1.484TyrPro: 1.484 ± 1.048
3.709TyrGln: 3.709 ± 0.477
3.709TyrArg: 3.709 ± 1.345
0.742TyrSer: 0.742 ± 0.877
2.967TyrThr: 2.967 ± 1.305
1.484TyrVal: 1.484 ± 1.048
0.742TyrTrp: 0.742 ± 0.548
0.742TyrTyr: 0.742 ± 0.643
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski