Amino acid dipepetide frequency for Apis mellifera associated microvirus 52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.309AlaAla: 10.309 ± 2.051
0.736AlaCys: 0.736 ± 0.497
5.891AlaAsp: 5.891 ± 2.56
6.627AlaGlu: 6.627 ± 2.827
2.209AlaPhe: 2.209 ± 1.173
8.1AlaGly: 8.1 ± 1.776
1.473AlaHis: 1.473 ± 0.87
5.155AlaIle: 5.155 ± 3.119
3.682AlaLys: 3.682 ± 1.222
13.255AlaLeu: 13.255 ± 2.906
1.473AlaMet: 1.473 ± 0.999
2.209AlaAsn: 2.209 ± 0.853
4.418AlaPro: 4.418 ± 1.225
5.155AlaGln: 5.155 ± 0.941
7.364AlaArg: 7.364 ± 1.639
5.155AlaSer: 5.155 ± 1.589
5.891AlaThr: 5.891 ± 1.956
4.418AlaVal: 4.418 ± 1.744
1.473AlaTrp: 1.473 ± 0.837
2.209AlaTyr: 2.209 ± 0.928
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.209CysAsp: 2.209 ± 1.173
0.0CysGlu: 0.0 ± 0.0
0.736CysPhe: 0.736 ± 0.63
0.736CysGly: 0.736 ± 0.63
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.473CysLys: 1.473 ± 1.021
0.736CysLeu: 0.736 ± 0.63
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.736CysVal: 0.736 ± 0.997
0.736CysTrp: 0.736 ± 0.497
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.891AspAla: 5.891 ± 2.311
0.0AspCys: 0.0 ± 0.0
2.209AspAsp: 2.209 ± 0.822
4.418AspGlu: 4.418 ± 2.395
2.946AspPhe: 2.946 ± 0.962
3.682AspGly: 3.682 ± 1.103
2.946AspHis: 2.946 ± 1.108
1.473AspIle: 1.473 ± 0.837
1.473AspLys: 1.473 ± 0.999
4.418AspLeu: 4.418 ± 1.428
2.946AspMet: 2.946 ± 2.098
0.736AspAsn: 0.736 ± 0.646
2.209AspPro: 2.209 ± 0.822
2.209AspGln: 2.209 ± 1.077
4.418AspArg: 4.418 ± 1.673
2.209AspSer: 2.209 ± 0.822
2.946AspThr: 2.946 ± 1.389
4.418AspVal: 4.418 ± 1.632
0.736AspTrp: 0.736 ± 0.497
2.209AspTyr: 2.209 ± 1.491
0.0AspXaa: 0.0 ± 0.0
Glu
4.418GluAla: 4.418 ± 2.796
0.0GluCys: 0.0 ± 0.0
5.891GluAsp: 5.891 ± 2.317
4.418GluGlu: 4.418 ± 1.734
1.473GluPhe: 1.473 ± 0.994
1.473GluGly: 1.473 ± 0.837
1.473GluHis: 1.473 ± 1.006
1.473GluIle: 1.473 ± 0.57
4.418GluLys: 4.418 ± 2.304
5.155GluLeu: 5.155 ± 1.632
0.736GluMet: 0.736 ± 0.63
2.946GluAsn: 2.946 ± 1.108
2.946GluPro: 2.946 ± 0.817
2.209GluGln: 2.209 ± 2.764
2.946GluArg: 2.946 ± 1.237
2.946GluSer: 2.946 ± 1.911
4.418GluThr: 4.418 ± 1.734
2.946GluVal: 2.946 ± 1.59
0.736GluTrp: 0.736 ± 0.646
2.946GluTyr: 2.946 ± 0.743
0.0GluXaa: 0.0 ± 0.0
Phe
3.682PheAla: 3.682 ± 1.006
0.0PheCys: 0.0 ± 0.0
2.209PheAsp: 2.209 ± 1.722
1.473PheGlu: 1.473 ± 0.644
2.946PhePhe: 2.946 ± 1.389
2.209PheGly: 2.209 ± 0.616
0.736PheHis: 0.736 ± 0.646
1.473PheIle: 1.473 ± 0.57
0.736PheLys: 0.736 ± 0.63
0.736PheLeu: 0.736 ± 0.646
1.473PheMet: 1.473 ± 0.551
0.736PheAsn: 0.736 ± 0.646
3.682PhePro: 3.682 ± 1.694
2.209PheGln: 2.209 ± 1.491
1.473PheArg: 1.473 ± 0.994
4.418PheSer: 4.418 ± 1.823
2.209PheThr: 2.209 ± 0.962
3.682PheVal: 3.682 ± 1.559
0.0PheTrp: 0.0 ± 0.0
1.473PheTyr: 1.473 ± 0.644
0.0PheXaa: 0.0 ± 0.0
Gly
9.573GlyAla: 9.573 ± 3.17
0.736GlyCys: 0.736 ± 0.63
2.209GlyAsp: 2.209 ± 0.616
5.155GlyGlu: 5.155 ± 1.679
2.946GlyPhe: 2.946 ± 0.743
8.837GlyGly: 8.837 ± 3.007
1.473GlyHis: 1.473 ± 0.57
4.418GlyIle: 4.418 ± 1.232
4.418GlyLys: 4.418 ± 2.063
5.155GlyLeu: 5.155 ± 1.67
0.0GlyMet: 0.0 ± 0.0
2.209GlyAsn: 2.209 ± 0.928
3.682GlyPro: 3.682 ± 1.006
5.155GlyGln: 5.155 ± 0.785
2.209GlyArg: 2.209 ± 1.247
3.682GlySer: 3.682 ± 1.301
5.155GlyThr: 5.155 ± 2.008
7.364GlyVal: 7.364 ± 1.173
0.736GlyTrp: 0.736 ± 0.646
3.682GlyTyr: 3.682 ± 1.082
0.0GlyXaa: 0.0 ± 0.0
His
2.946HisAla: 2.946 ± 1.466
0.736HisCys: 0.736 ± 0.997
0.0HisAsp: 0.0 ± 0.0
1.473HisGlu: 1.473 ± 1.137
2.209HisPhe: 2.209 ± 0.924
2.946HisGly: 2.946 ± 0.743
0.736HisHis: 0.736 ± 0.497
0.0HisIle: 0.0 ± 0.0
2.209HisLys: 2.209 ± 0.924
2.209HisLeu: 2.209 ± 1.393
0.736HisMet: 0.736 ± 0.497
0.736HisAsn: 0.736 ± 0.921
1.473HisPro: 1.473 ± 0.57
0.736HisGln: 0.736 ± 0.997
1.473HisArg: 1.473 ± 0.644
1.473HisSer: 1.473 ± 0.994
0.736HisThr: 0.736 ± 0.497
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.209HisTyr: 2.209 ± 1.393
0.0HisXaa: 0.0 ± 0.0
Ile
0.736IleAla: 0.736 ± 0.646
0.736IleCys: 0.736 ± 0.63
5.155IleAsp: 5.155 ± 0.68
2.209IleGlu: 2.209 ± 1.603
2.946IlePhe: 2.946 ± 1.108
3.682IleGly: 3.682 ± 1.006
2.209IleHis: 2.209 ± 0.924
0.0IleIle: 0.0 ± 0.0
0.736IleLys: 0.736 ± 0.646
0.736IleLeu: 0.736 ± 0.497
1.473IleMet: 1.473 ± 1.077
1.473IleAsn: 1.473 ± 0.57
0.736IlePro: 0.736 ± 0.646
3.682IleGln: 3.682 ± 1.68
4.418IleArg: 4.418 ± 0.995
3.682IleSer: 3.682 ± 1.384
2.209IleThr: 2.209 ± 0.962
1.473IleVal: 1.473 ± 0.644
0.0IleTrp: 0.0 ± 0.0
1.473IleTyr: 1.473 ± 0.994
0.0IleXaa: 0.0 ± 0.0
Lys
2.209LysAla: 2.209 ± 1.938
0.0LysCys: 0.0 ± 0.0
2.209LysAsp: 2.209 ± 1.334
3.682LysGlu: 3.682 ± 1.006
2.946LysPhe: 2.946 ± 1.002
1.473LysGly: 1.473 ± 0.644
2.209LysHis: 2.209 ± 0.962
8.1LysIle: 8.1 ± 3.139
5.155LysLys: 5.155 ± 2.269
3.682LysLeu: 3.682 ± 1.694
0.736LysMet: 0.736 ± 0.879
5.891LysAsn: 5.891 ± 2.311
2.209LysPro: 2.209 ± 1.981
0.0LysGln: 0.0 ± 0.0
2.209LysArg: 2.209 ± 1.173
2.946LysSer: 2.946 ± 1.054
4.418LysThr: 4.418 ± 1.848
2.946LysVal: 2.946 ± 1.579
0.736LysTrp: 0.736 ± 0.646
0.736LysTyr: 0.736 ± 0.63
0.0LysXaa: 0.0 ± 0.0
Leu
8.1LeuAla: 8.1 ± 2.22
0.736LeuCys: 0.736 ± 0.63
2.209LeuAsp: 2.209 ± 0.616
3.682LeuGlu: 3.682 ± 1.761
0.0LeuPhe: 0.0 ± 0.0
8.1LeuGly: 8.1 ± 2.096
0.736LeuHis: 0.736 ± 0.997
2.946LeuIle: 2.946 ± 0.962
1.473LeuLys: 1.473 ± 0.644
4.418LeuLeu: 4.418 ± 1.04
0.736LeuMet: 0.736 ± 0.497
5.155LeuAsn: 5.155 ± 1.512
5.155LeuPro: 5.155 ± 1.568
9.573LeuGln: 9.573 ± 1.682
7.364LeuArg: 7.364 ± 3.297
5.155LeuSer: 5.155 ± 2.207
4.418LeuThr: 4.418 ± 2.346
4.418LeuVal: 4.418 ± 1.04
1.473LeuTrp: 1.473 ± 1.26
2.209LeuTyr: 2.209 ± 0.962
0.0LeuXaa: 0.0 ± 0.0
Met
1.473MetAla: 1.473 ± 0.87
0.0MetCys: 0.0 ± 0.0
0.736MetAsp: 0.736 ± 0.63
0.736MetGlu: 0.736 ± 0.646
0.0MetPhe: 0.0 ± 0.0
2.946MetGly: 2.946 ± 1.141
0.736MetHis: 0.736 ± 0.497
0.0MetIle: 0.0 ± 0.0
2.209MetLys: 2.209 ± 0.928
1.473MetLeu: 1.473 ± 1.26
0.736MetMet: 0.736 ± 0.997
0.0MetAsn: 0.0 ± 0.0
0.736MetPro: 0.736 ± 0.646
0.0MetGln: 0.0 ± 0.0
0.736MetArg: 0.736 ± 0.921
1.473MetSer: 1.473 ± 1.26
0.736MetThr: 0.736 ± 0.63
0.736MetVal: 0.736 ± 0.497
0.736MetTrp: 0.736 ± 0.646
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
7.364AsnAla: 7.364 ± 1.892
0.0AsnCys: 0.0 ± 0.0
0.736AsnAsp: 0.736 ± 0.497
2.946AsnGlu: 2.946 ± 0.743
1.473AsnPhe: 1.473 ± 1.292
2.946AsnGly: 2.946 ± 0.962
0.736AsnHis: 0.736 ± 0.63
0.736AsnIle: 0.736 ± 0.646
2.209AsnLys: 2.209 ± 0.616
0.736AsnLeu: 0.736 ± 0.497
0.0AsnMet: 0.0 ± 0.0
3.682AsnAsn: 3.682 ± 1.937
2.946AsnPro: 2.946 ± 0.921
1.473AsnGln: 1.473 ± 1.292
7.364AsnArg: 7.364 ± 2.87
3.682AsnSer: 3.682 ± 1.235
2.209AsnThr: 2.209 ± 0.924
1.473AsnVal: 1.473 ± 0.57
0.0AsnTrp: 0.0 ± 0.0
1.473AsnTyr: 1.473 ± 1.006
0.0AsnXaa: 0.0 ± 0.0
Pro
5.155ProAla: 5.155 ± 0.908
1.473ProCys: 1.473 ± 1.26
5.891ProAsp: 5.891 ± 1.347
5.891ProGlu: 5.891 ± 2.311
2.946ProPhe: 2.946 ± 1.77
5.155ProGly: 5.155 ± 2.106
2.946ProHis: 2.946 ± 1.623
0.736ProIle: 0.736 ± 0.497
1.473ProLys: 1.473 ± 1.006
3.682ProLeu: 3.682 ± 1.218
0.736ProMet: 0.736 ± 0.646
2.209ProAsn: 2.209 ± 1.113
3.682ProPro: 3.682 ± 3.045
1.473ProGln: 1.473 ± 0.837
3.682ProArg: 3.682 ± 1.932
5.891ProSer: 5.891 ± 2.723
2.209ProThr: 2.209 ± 1.077
4.418ProVal: 4.418 ± 1.689
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.473GlnAla: 1.473 ± 1.137
0.736GlnCys: 0.736 ± 0.63
3.682GlnAsp: 3.682 ± 1.235
1.473GlnGlu: 1.473 ± 0.57
2.209GlnPhe: 2.209 ± 0.962
2.946GlnGly: 2.946 ± 1.141
0.736GlnHis: 0.736 ± 0.921
2.946GlnIle: 2.946 ± 1.76
4.418GlnLys: 4.418 ± 1.006
4.418GlnLeu: 4.418 ± 1.645
0.0GlnMet: 0.0 ± 0.0
3.682GlnAsn: 3.682 ± 1.168
1.473GlnPro: 1.473 ± 1.084
2.946GlnGln: 2.946 ± 1.237
3.682GlnArg: 3.682 ± 0.872
2.946GlnSer: 2.946 ± 1.141
1.473GlnThr: 1.473 ± 0.994
0.736GlnVal: 0.736 ± 0.997
0.736GlnTrp: 0.736 ± 0.646
2.209GlnTyr: 2.209 ± 0.962
0.0GlnXaa: 0.0 ± 0.0
Arg
9.573ArgAla: 9.573 ± 3.611
0.0ArgCys: 0.0 ± 0.0
4.418ArgAsp: 4.418 ± 1.157
2.209ArgGlu: 2.209 ± 1.247
1.473ArgPhe: 1.473 ± 0.994
1.473ArgGly: 1.473 ± 0.837
1.473ArgHis: 1.473 ± 0.999
4.418ArgIle: 4.418 ± 1.157
5.155ArgLys: 5.155 ± 2.653
5.891ArgLeu: 5.891 ± 2.91
0.736ArgMet: 0.736 ± 0.63
2.209ArgAsn: 2.209 ± 1.48
3.682ArgPro: 3.682 ± 1.154
0.736ArgGln: 0.736 ± 0.921
8.837ArgArg: 8.837 ± 3.437
6.627ArgSer: 6.627 ± 3.782
6.627ArgThr: 6.627 ± 2.4
1.473ArgVal: 1.473 ± 1.006
0.736ArgTrp: 0.736 ± 0.646
3.682ArgTyr: 3.682 ± 1.231
0.0ArgXaa: 0.0 ± 0.0
Ser
5.891SerAla: 5.891 ± 1.568
0.736SerCys: 0.736 ± 0.497
2.946SerAsp: 2.946 ± 1.268
3.682SerGlu: 3.682 ± 0.872
2.209SerPhe: 2.209 ± 1.113
4.418SerGly: 4.418 ± 1.519
0.0SerHis: 0.0 ± 0.0
0.736SerIle: 0.736 ± 0.646
2.946SerLys: 2.946 ± 0.748
6.627SerLeu: 6.627 ± 2.131
0.0SerMet: 0.0 ± 0.0
5.155SerAsn: 5.155 ± 2.116
8.837SerPro: 8.837 ± 1.974
2.209SerGln: 2.209 ± 0.911
5.155SerArg: 5.155 ± 2.587
2.209SerSer: 2.209 ± 1.234
8.837SerThr: 8.837 ± 3.773
2.209SerVal: 2.209 ± 1.491
0.736SerTrp: 0.736 ± 0.497
1.473SerTyr: 1.473 ± 0.837
0.0SerXaa: 0.0 ± 0.0
Thr
6.627ThrAla: 6.627 ± 1.463
0.0ThrCys: 0.0 ± 0.0
2.946ThrAsp: 2.946 ± 1.389
0.736ThrGlu: 0.736 ± 0.921
2.946ThrPhe: 2.946 ± 1.389
9.573ThrGly: 9.573 ± 2.204
1.473ThrHis: 1.473 ± 1.084
3.682ThrIle: 3.682 ± 1.852
2.946ThrLys: 2.946 ± 1.108
5.891ThrLeu: 5.891 ± 1.25
0.736ThrMet: 0.736 ± 0.646
2.209ThrAsn: 2.209 ± 0.853
5.891ThrPro: 5.891 ± 2.109
1.473ThrGln: 1.473 ± 0.57
3.682ThrArg: 3.682 ± 2.513
4.418ThrSer: 4.418 ± 0.969
12.518ThrThr: 12.518 ± 5.482
2.946ThrVal: 2.946 ± 1.59
0.736ThrTrp: 0.736 ± 0.497
1.473ThrTyr: 1.473 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
6.627ValAla: 6.627 ± 1.814
0.736ValCys: 0.736 ± 0.997
0.736ValAsp: 0.736 ± 0.921
2.209ValGlu: 2.209 ± 1.722
2.209ValPhe: 2.209 ± 0.962
3.682ValGly: 3.682 ± 2.007
0.736ValHis: 0.736 ± 0.63
1.473ValIle: 1.473 ± 0.994
3.682ValLys: 3.682 ± 1.857
4.418ValLeu: 4.418 ± 1.59
1.473ValMet: 1.473 ± 0.644
0.736ValAsn: 0.736 ± 0.497
3.682ValPro: 3.682 ± 1.006
1.473ValGln: 1.473 ± 0.57
1.473ValArg: 1.473 ± 1.006
3.682ValSer: 3.682 ± 1.652
3.682ValThr: 3.682 ± 1.103
0.736ValVal: 0.736 ± 0.63
1.473ValTrp: 1.473 ± 1.006
3.682ValTyr: 3.682 ± 1.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.473TrpGlu: 1.473 ± 0.994
0.0TrpPhe: 0.0 ± 0.0
2.946TrpGly: 2.946 ± 1.953
1.473TrpHis: 1.473 ± 1.006
0.0TrpIle: 0.0 ± 0.0
1.473TrpLys: 1.473 ± 0.837
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.473TrpAsn: 1.473 ± 0.994
0.0TrpPro: 0.0 ± 0.0
1.473TrpGln: 1.473 ± 1.292
0.0TrpArg: 0.0 ± 0.0
1.473TrpSer: 1.473 ± 0.837
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.736TrpTyr: 0.736 ± 0.63
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.155TyrAla: 5.155 ± 2.218
0.0TyrCys: 0.0 ± 0.0
2.209TyrAsp: 2.209 ± 1.077
1.473TyrGlu: 1.473 ± 0.87
0.736TyrPhe: 0.736 ± 0.646
1.473TyrGly: 1.473 ± 0.644
0.736TyrHis: 0.736 ± 0.63
0.736TyrIle: 0.736 ± 0.497
2.209TyrLys: 2.209 ± 1.173
4.418TyrLeu: 4.418 ± 1.924
0.736TyrMet: 0.736 ± 0.921
0.736TyrAsn: 0.736 ± 0.497
2.209TyrPro: 2.209 ± 0.616
0.736TyrGln: 0.736 ± 0.921
2.946TyrArg: 2.946 ± 1.274
2.946TyrSer: 2.946 ± 1.202
2.209TyrThr: 2.209 ± 1.356
1.473TyrVal: 1.473 ± 0.644
0.736TyrTrp: 0.736 ± 0.63
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski