Amino acid dipepetide frequency for Apis mellifera associated microvirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.442AlaAla: 8.442 ± 2.635
1.299AlaCys: 1.299 ± 0.908
4.545AlaAsp: 4.545 ± 1.954
4.545AlaGlu: 4.545 ± 3.079
1.948AlaPhe: 1.948 ± 0.867
9.74AlaGly: 9.74 ± 3.883
0.649AlaHis: 0.649 ± 0.464
5.195AlaIle: 5.195 ± 1.234
2.597AlaLys: 2.597 ± 1.831
3.896AlaLeu: 3.896 ± 1.051
0.649AlaMet: 0.649 ± 0.464
3.896AlaAsn: 3.896 ± 2.219
3.247AlaPro: 3.247 ± 1.79
7.143AlaGln: 7.143 ± 2.967
8.442AlaArg: 8.442 ± 3.69
7.143AlaSer: 7.143 ± 2.277
6.494AlaThr: 6.494 ± 2.77
5.844AlaVal: 5.844 ± 2.019
0.649AlaTrp: 0.649 ± 0.595
3.247AlaTyr: 3.247 ± 1.586
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.649CysCys: 0.649 ± 0.595
1.299CysAsp: 1.299 ± 0.966
1.299CysGlu: 1.299 ± 0.825
0.649CysPhe: 0.649 ± 0.595
1.299CysGly: 1.299 ± 0.893
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.299CysLeu: 1.299 ± 0.825
0.649CysMet: 0.649 ± 0.595
0.0CysAsn: 0.0 ± 0.0
0.649CysPro: 0.649 ± 0.65
0.0CysGln: 0.0 ± 0.0
1.299CysArg: 1.299 ± 0.893
3.247CysSer: 3.247 ± 1.96
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.649CysTyr: 0.649 ± 0.772
0.0CysXaa: 0.0 ± 0.0
Asp
3.896AspAla: 3.896 ± 1.314
0.0AspCys: 0.0 ± 0.0
4.545AspAsp: 4.545 ± 1.815
1.948AspGlu: 1.948 ± 1.087
4.545AspPhe: 4.545 ± 1.441
1.948AspGly: 1.948 ± 0.867
0.649AspHis: 0.649 ± 0.464
1.948AspIle: 1.948 ± 1.399
1.948AspLys: 1.948 ± 1.212
6.494AspLeu: 6.494 ± 1.93
3.896AspMet: 3.896 ± 1.462
0.649AspAsn: 0.649 ± 0.464
1.948AspPro: 1.948 ± 1.699
1.948AspGln: 1.948 ± 0.566
3.896AspArg: 3.896 ± 1.263
0.649AspSer: 0.649 ± 0.464
1.948AspThr: 1.948 ± 0.98
5.844AspVal: 5.844 ± 2.019
0.0AspTrp: 0.0 ± 0.0
3.247AspTyr: 3.247 ± 1.087
0.0AspXaa: 0.0 ± 0.0
Glu
1.948GluAla: 1.948 ± 1.491
0.649GluCys: 0.649 ± 0.595
1.299GluAsp: 1.299 ± 1.146
1.299GluGlu: 1.299 ± 0.928
5.195GluPhe: 5.195 ± 2.79
1.948GluGly: 1.948 ± 1.032
1.299GluHis: 1.299 ± 0.916
2.597GluIle: 2.597 ± 0.967
0.0GluLys: 0.0 ± 0.0
4.545GluLeu: 4.545 ± 1.279
1.299GluMet: 1.299 ± 0.581
2.597GluAsn: 2.597 ± 1.247
1.299GluPro: 1.299 ± 1.023
3.896GluGln: 3.896 ± 2.409
3.247GluArg: 3.247 ± 1.052
5.195GluSer: 5.195 ± 1.52
1.948GluThr: 1.948 ± 0.957
1.299GluVal: 1.299 ± 0.928
1.299GluTrp: 1.299 ± 0.581
1.948GluTyr: 1.948 ± 1.081
0.0GluXaa: 0.0 ± 0.0
Phe
1.299PheAla: 1.299 ± 0.928
3.247PheCys: 3.247 ± 3.256
1.948PheAsp: 1.948 ± 1.822
2.597PheGlu: 2.597 ± 0.949
5.195PhePhe: 5.195 ± 2.173
3.896PheGly: 3.896 ± 1.502
0.649PheHis: 0.649 ± 0.595
1.299PheIle: 1.299 ± 0.825
2.597PheLys: 2.597 ± 1.931
3.896PheLeu: 3.896 ± 1.519
2.597PheMet: 2.597 ± 1.027
1.299PheAsn: 1.299 ± 0.732
3.896PhePro: 3.896 ± 1.411
3.247PheGln: 3.247 ± 1.643
6.494PheArg: 6.494 ± 1.628
1.948PheSer: 1.948 ± 1.237
1.948PheThr: 1.948 ± 0.914
3.247PheVal: 3.247 ± 2.409
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.195GlyAla: 5.195 ± 1.834
0.649GlyCys: 0.649 ± 0.772
4.545GlyAsp: 4.545 ± 1.697
7.792GlyGlu: 7.792 ± 2.166
1.299GlyPhe: 1.299 ± 0.825
5.195GlyGly: 5.195 ± 1.841
1.299GlyHis: 1.299 ± 0.581
2.597GlyIle: 2.597 ± 2.052
2.597GlyLys: 2.597 ± 0.732
7.792GlyLeu: 7.792 ± 2.954
0.0GlyMet: 0.0 ± 0.0
2.597GlyAsn: 2.597 ± 1.398
5.195GlyPro: 5.195 ± 2.048
3.896GlyGln: 3.896 ± 1.168
0.649GlyArg: 0.649 ± 0.595
10.39GlySer: 10.39 ± 2.2
5.195GlyThr: 5.195 ± 3.092
9.74GlyVal: 9.74 ± 1.966
0.649GlyTrp: 0.649 ± 0.65
1.948GlyTyr: 1.948 ± 0.933
0.0GlyXaa: 0.0 ± 0.0
His
1.948HisAla: 1.948 ± 0.748
0.0HisCys: 0.0 ± 0.0
3.247HisAsp: 3.247 ± 1.284
0.649HisGlu: 0.649 ± 0.595
3.247HisPhe: 3.247 ± 0.854
1.299HisGly: 1.299 ± 0.581
0.0HisHis: 0.0 ± 0.0
0.649HisIle: 0.649 ± 0.772
0.649HisLys: 0.649 ± 0.464
1.299HisLeu: 1.299 ± 0.928
0.0HisMet: 0.0 ± 0.0
0.649HisAsn: 0.649 ± 0.639
2.597HisPro: 2.597 ± 1.833
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.649HisSer: 0.649 ± 0.464
1.299HisThr: 1.299 ± 0.581
0.649HisVal: 0.649 ± 0.595
0.649HisTrp: 0.649 ± 0.464
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.545IleAla: 4.545 ± 1.508
0.0IleCys: 0.0 ± 0.0
1.948IleAsp: 1.948 ± 0.834
1.948IleGlu: 1.948 ± 0.566
0.649IlePhe: 0.649 ± 0.772
5.195IleGly: 5.195 ± 1.085
0.0IleHis: 0.0 ± 0.0
1.948IleIle: 1.948 ± 1.822
0.649IleLys: 0.649 ± 0.639
1.948IleLeu: 1.948 ± 1.491
1.948IleMet: 1.948 ± 0.887
3.247IleAsn: 3.247 ± 1.254
1.299IlePro: 1.299 ± 1.246
0.649IleGln: 0.649 ± 0.464
5.195IleArg: 5.195 ± 1.886
4.545IleSer: 4.545 ± 1.715
5.195IleThr: 5.195 ± 1.91
1.948IleVal: 1.948 ± 1.094
0.0IleTrp: 0.0 ± 0.0
3.896IleTyr: 3.896 ± 1.502
0.0IleXaa: 0.0 ± 0.0
Lys
1.948LysAla: 1.948 ± 1.071
1.299LysCys: 1.299 ± 0.966
1.299LysAsp: 1.299 ± 0.966
2.597LysGlu: 2.597 ± 0.702
0.649LysPhe: 0.649 ± 0.464
1.299LysGly: 1.299 ± 0.581
0.0LysHis: 0.0 ± 0.0
2.597LysIle: 2.597 ± 1.465
1.948LysLys: 1.948 ± 0.566
1.948LysLeu: 1.948 ± 1.463
1.948LysMet: 1.948 ± 1.399
1.299LysAsn: 1.299 ± 1.061
2.597LysPro: 2.597 ± 0.983
1.948LysGln: 1.948 ± 0.891
3.247LysArg: 3.247 ± 2.226
1.948LysSer: 1.948 ± 1.42
1.948LysThr: 1.948 ± 0.915
1.948LysVal: 1.948 ± 1.699
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.442LeuAla: 8.442 ± 2.065
1.299LeuCys: 1.299 ± 0.893
3.896LeuAsp: 3.896 ± 1.201
3.896LeuGlu: 3.896 ± 1.593
4.545LeuPhe: 4.545 ± 2.239
5.844LeuGly: 5.844 ± 1.05
0.649LeuHis: 0.649 ± 0.772
3.247LeuIle: 3.247 ± 1.288
2.597LeuLys: 2.597 ± 1.338
7.792LeuLeu: 7.792 ± 1.966
0.649LeuMet: 0.649 ± 0.652
4.545LeuAsn: 4.545 ± 1.673
8.442LeuPro: 8.442 ± 2.827
3.247LeuGln: 3.247 ± 0.803
5.844LeuArg: 5.844 ± 2.308
6.494LeuSer: 6.494 ± 2.337
6.494LeuThr: 6.494 ± 1.302
3.896LeuVal: 3.896 ± 1.247
0.649LeuTrp: 0.649 ± 0.464
3.896LeuTyr: 3.896 ± 1.315
0.0LeuXaa: 0.0 ± 0.0
Met
1.948MetAla: 1.948 ± 1.661
0.649MetCys: 0.649 ± 0.595
1.299MetAsp: 1.299 ± 0.74
1.299MetGlu: 1.299 ± 0.966
0.0MetPhe: 0.0 ± 0.0
3.247MetGly: 3.247 ± 0.914
2.597MetHis: 2.597 ± 1.299
2.597MetIle: 2.597 ± 0.99
1.299MetLys: 1.299 ± 1.926
1.299MetLeu: 1.299 ± 0.651
0.0MetMet: 0.0 ± 0.0
1.299MetAsn: 1.299 ± 1.105
1.299MetPro: 1.299 ± 1.19
0.649MetGln: 0.649 ± 0.595
3.247MetArg: 3.247 ± 1.283
2.597MetSer: 2.597 ± 1.388
0.0MetThr: 0.0 ± 0.0
1.948MetVal: 1.948 ± 0.867
0.0MetTrp: 0.0 ± 0.0
1.299MetTyr: 1.299 ± 0.928
0.0MetXaa: 0.0 ± 0.0
Asn
3.896AsnAla: 3.896 ± 1.954
0.0AsnCys: 0.0 ± 0.0
1.299AsnAsp: 1.299 ± 0.74
1.948AsnGlu: 1.948 ± 1.502
0.0AsnPhe: 0.0 ± 0.0
1.299AsnGly: 1.299 ± 0.749
0.649AsnHis: 0.649 ± 0.464
1.299AsnIle: 1.299 ± 0.74
1.299AsnLys: 1.299 ± 0.651
7.143AsnLeu: 7.143 ± 2.003
0.649AsnMet: 0.649 ± 0.595
0.649AsnAsn: 0.649 ± 0.639
2.597AsnPro: 2.597 ± 0.732
0.649AsnGln: 0.649 ± 0.464
1.948AsnArg: 1.948 ± 0.566
1.299AsnSer: 1.299 ± 0.749
1.299AsnThr: 1.299 ± 0.916
5.195AsnVal: 5.195 ± 2.684
0.649AsnTrp: 0.649 ± 0.464
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.896ProAla: 3.896 ± 1.289
0.0ProCys: 0.0 ± 0.0
2.597ProAsp: 2.597 ± 1.052
4.545ProGlu: 4.545 ± 2.628
2.597ProPhe: 2.597 ± 1.188
3.247ProGly: 3.247 ± 1.541
1.299ProHis: 1.299 ± 1.19
2.597ProIle: 2.597 ± 1.069
1.299ProLys: 1.299 ± 0.966
4.545ProLeu: 4.545 ± 1.836
2.597ProMet: 2.597 ± 0.876
1.299ProAsn: 1.299 ± 0.581
0.649ProPro: 0.649 ± 0.464
3.247ProGln: 3.247 ± 1.449
3.247ProArg: 3.247 ± 0.954
4.545ProSer: 4.545 ± 2.459
3.247ProThr: 3.247 ± 1.362
7.143ProVal: 7.143 ± 1.156
2.597ProTrp: 2.597 ± 0.975
0.649ProTyr: 0.649 ± 0.65
0.0ProXaa: 0.0 ± 0.0
Gln
6.494GlnAla: 6.494 ± 3.618
0.649GlnCys: 0.649 ± 0.595
1.299GlnAsp: 1.299 ± 0.732
1.299GlnGlu: 1.299 ± 0.651
1.299GlnPhe: 1.299 ± 0.69
3.247GlnGly: 3.247 ± 0.593
1.299GlnHis: 1.299 ± 1.023
5.195GlnIle: 5.195 ± 2.292
1.299GlnLys: 1.299 ± 0.749
3.247GlnLeu: 3.247 ± 1.376
0.649GlnMet: 0.649 ± 0.639
1.299GlnAsn: 1.299 ± 1.023
0.0GlnPro: 0.0 ± 0.0
1.299GlnGln: 1.299 ± 0.74
3.247GlnArg: 3.247 ± 1.288
1.299GlnSer: 1.299 ± 1.023
2.597GlnThr: 2.597 ± 1.303
2.597GlnVal: 2.597 ± 1.247
0.0GlnTrp: 0.0 ± 0.0
1.299GlnTyr: 1.299 ± 1.278
0.0GlnXaa: 0.0 ± 0.0
Arg
4.545ArgAla: 4.545 ± 0.992
0.649ArgCys: 0.649 ± 0.595
3.896ArgAsp: 3.896 ± 1.411
1.948ArgGlu: 1.948 ± 0.957
3.896ArgPhe: 3.896 ± 2.021
6.494ArgGly: 6.494 ± 1.298
3.247ArgHis: 3.247 ± 1.052
1.948ArgIle: 1.948 ± 1.785
1.948ArgLys: 1.948 ± 1.445
8.442ArgLeu: 8.442 ± 2.573
3.247ArgMet: 3.247 ± 1.073
1.299ArgAsn: 1.299 ± 1.356
2.597ArgPro: 2.597 ± 1.163
1.948ArgGln: 1.948 ± 1.491
3.896ArgArg: 3.896 ± 1.744
7.143ArgSer: 7.143 ± 1.948
1.948ArgThr: 1.948 ± 1.081
1.948ArgVal: 1.948 ± 0.805
0.0ArgTrp: 0.0 ± 0.0
5.844ArgTyr: 5.844 ± 2.205
0.0ArgXaa: 0.0 ± 0.0
Ser
12.987SerAla: 12.987 ± 2.819
0.649SerCys: 0.649 ± 0.464
5.195SerAsp: 5.195 ± 1.637
1.948SerGlu: 1.948 ± 0.566
3.247SerPhe: 3.247 ± 1.29
3.896SerGly: 3.896 ± 1.696
1.948SerHis: 1.948 ± 0.867
2.597SerIle: 2.597 ± 0.964
2.597SerLys: 2.597 ± 1.796
9.091SerLeu: 9.091 ± 2.652
2.597SerMet: 2.597 ± 1.718
1.948SerAsn: 1.948 ± 1.435
3.247SerPro: 3.247 ± 1.647
3.247SerGln: 3.247 ± 3.196
3.896SerArg: 3.896 ± 1.252
10.39SerSer: 10.39 ± 3.078
5.844SerThr: 5.844 ± 2.942
7.792SerVal: 7.792 ± 2.273
0.0SerTrp: 0.0 ± 0.0
1.948SerTyr: 1.948 ± 1.094
0.0SerXaa: 0.0 ± 0.0
Thr
7.792ThrAla: 7.792 ± 2.18
0.649ThrCys: 0.649 ± 0.772
1.299ThrAsp: 1.299 ± 0.966
1.948ThrGlu: 1.948 ± 0.933
3.896ThrPhe: 3.896 ± 1.502
8.442ThrGly: 8.442 ± 2.573
1.299ThrHis: 1.299 ± 0.928
1.299ThrIle: 1.299 ± 0.581
0.649ThrLys: 0.649 ± 0.595
3.247ThrLeu: 3.247 ± 1.695
1.299ThrMet: 1.299 ± 0.797
1.299ThrAsn: 1.299 ± 0.651
4.545ThrPro: 4.545 ± 2.061
0.649ThrGln: 0.649 ± 0.464
3.896ThrArg: 3.896 ± 1.228
5.844ThrSer: 5.844 ± 2.029
1.948ThrThr: 1.948 ± 0.867
5.195ThrVal: 5.195 ± 0.995
0.0ThrTrp: 0.0 ± 0.0
1.948ThrTyr: 1.948 ± 0.748
0.0ThrXaa: 0.0 ± 0.0
Val
5.844ValAla: 5.844 ± 1.7
0.0ValCys: 0.0 ± 0.0
3.247ValAsp: 3.247 ± 1.288
1.299ValGlu: 1.299 ± 1.659
3.896ValPhe: 3.896 ± 1.613
7.143ValGly: 7.143 ± 1.834
0.649ValHis: 0.649 ± 0.963
4.545ValIle: 4.545 ± 1.722
5.195ValLys: 5.195 ± 1.959
4.545ValLeu: 4.545 ± 1.735
3.247ValMet: 3.247 ± 1.109
1.948ValAsn: 1.948 ± 0.867
7.143ValPro: 7.143 ± 2.488
0.0ValGln: 0.0 ± 0.0
4.545ValArg: 4.545 ± 1.771
5.195ValSer: 5.195 ± 1.635
6.494ValThr: 6.494 ± 2.175
3.896ValVal: 3.896 ± 1.751
1.299ValTrp: 1.299 ± 0.581
3.247ValTyr: 3.247 ± 2.098
0.0ValXaa: 0.0 ± 0.0
Trp
1.948TrpAla: 1.948 ± 0.805
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.299TrpPhe: 1.299 ± 0.581
0.649TrpGly: 0.649 ± 0.595
0.649TrpHis: 0.649 ± 0.464
0.0TrpIle: 0.0 ± 0.0
0.649TrpLys: 0.649 ± 0.464
0.649TrpLeu: 0.649 ± 0.772
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.649TrpPro: 0.649 ± 0.464
0.0TrpGln: 0.0 ± 0.0
0.649TrpArg: 0.649 ± 0.65
0.649TrpSer: 0.649 ± 0.464
0.649TrpThr: 0.649 ± 0.595
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.597TyrAla: 2.597 ± 0.949
0.649TyrCys: 0.649 ± 0.464
3.247TyrAsp: 3.247 ± 1.402
0.0TyrGlu: 0.0 ± 0.0
3.896TyrPhe: 3.896 ± 1.081
3.896TyrGly: 3.896 ± 1.168
0.649TyrHis: 0.649 ± 0.464
2.597TyrIle: 2.597 ± 1.163
0.649TyrLys: 0.649 ± 0.595
3.247TyrLeu: 3.247 ± 2.098
0.0TyrMet: 0.0 ± 0.0
1.948TyrAsn: 1.948 ± 0.933
1.948TyrPro: 1.948 ± 0.566
1.948TyrGln: 1.948 ± 0.933
0.0TyrArg: 0.0 ± 0.0
3.896TyrSer: 3.896 ± 1.293
0.649TyrThr: 0.649 ± 0.464
3.247TyrVal: 3.247 ± 1.175
0.0TyrTrp: 0.0 ± 0.0
1.299TyrTyr: 1.299 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski