Amino acid dipepetide frequency for Apis mellifera associated microvirus 43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.983AlaAla: 7.983 ± 3.052
0.0AlaCys: 0.0 ± 0.0
5.08AlaAsp: 5.08 ± 1.707
2.903AlaGlu: 2.903 ± 1.111
3.628AlaPhe: 3.628 ± 1.09
7.983AlaGly: 7.983 ± 2.661
0.0AlaHis: 0.0 ± 0.0
2.177AlaIle: 2.177 ± 0.837
7.257AlaLys: 7.257 ± 3.174
8.708AlaLeu: 8.708 ± 1.642
1.451AlaMet: 1.451 ± 0.659
4.354AlaAsn: 4.354 ± 1.009
6.531AlaPro: 6.531 ± 3.463
6.531AlaGln: 6.531 ± 1.927
7.257AlaArg: 7.257 ± 1.941
5.806AlaSer: 5.806 ± 1.526
4.354AlaThr: 4.354 ± 2.314
2.903AlaVal: 2.903 ± 0.736
1.451AlaTrp: 1.451 ± 0.659
2.903AlaTyr: 2.903 ± 1.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.726CysAla: 0.726 ± 0.658
0.726CysCys: 0.726 ± 0.658
1.451CysAsp: 1.451 ± 0.676
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.451CysGly: 1.451 ± 0.676
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.726CysLys: 0.726 ± 0.872
0.0CysLeu: 0.0 ± 0.0
1.451CysMet: 1.451 ± 0.915
0.0CysAsn: 0.0 ± 0.0
0.726CysPro: 0.726 ± 0.658
0.726CysGln: 0.726 ± 0.658
0.726CysArg: 0.726 ± 0.658
0.726CysSer: 0.726 ± 0.495
0.0CysThr: 0.0 ± 0.0
1.451CysVal: 1.451 ± 0.814
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.354AspAla: 4.354 ± 2.401
0.0AspCys: 0.0 ± 0.0
4.354AspAsp: 4.354 ± 3.014
4.354AspGlu: 4.354 ± 1.673
3.628AspPhe: 3.628 ± 1.178
2.903AspGly: 2.903 ± 1.639
2.177AspHis: 2.177 ± 1.271
2.903AspIle: 2.903 ± 1.08
1.451AspLys: 1.451 ± 0.773
5.806AspLeu: 5.806 ± 1.659
2.177AspMet: 2.177 ± 0.832
0.0AspAsn: 0.0 ± 0.0
2.903AspPro: 2.903 ± 2.009
1.451AspGln: 1.451 ± 0.82
1.451AspArg: 1.451 ± 0.814
2.177AspSer: 2.177 ± 1.255
3.628AspThr: 3.628 ± 1.117
2.903AspVal: 2.903 ± 1.006
0.0AspTrp: 0.0 ± 0.0
3.628AspTyr: 3.628 ± 1.178
0.0AspXaa: 0.0 ± 0.0
Glu
4.354GluAla: 4.354 ± 2.294
0.0GluCys: 0.0 ± 0.0
4.354GluAsp: 4.354 ± 1.268
2.177GluGlu: 2.177 ± 0.832
2.903GluPhe: 2.903 ± 0.631
2.177GluGly: 2.177 ± 0.837
1.451GluHis: 1.451 ± 0.676
1.451GluIle: 1.451 ± 0.82
2.177GluLys: 2.177 ± 0.57
2.177GluLeu: 2.177 ± 1.484
2.903GluMet: 2.903 ± 1.57
2.903GluAsn: 2.903 ± 1.405
2.903GluPro: 2.903 ± 2.613
2.177GluGln: 2.177 ± 0.832
5.806GluArg: 5.806 ± 2.062
3.628GluSer: 3.628 ± 1.039
0.726GluThr: 0.726 ± 0.872
3.628GluVal: 3.628 ± 1.862
1.451GluTrp: 1.451 ± 0.989
2.903GluTyr: 2.903 ± 1.405
0.0GluXaa: 0.0 ± 0.0
Phe
5.08PheAla: 5.08 ± 2.032
0.0PheCys: 0.0 ± 0.0
0.726PheAsp: 0.726 ± 0.872
2.177PheGlu: 2.177 ± 1.958
0.0PhePhe: 0.0 ± 0.0
3.628PheGly: 3.628 ± 1.861
0.0PheHis: 0.0 ± 0.0
0.726PheIle: 0.726 ± 0.495
0.0PheLys: 0.0 ± 0.0
1.451PheLeu: 1.451 ± 0.659
0.0PheMet: 0.0 ± 0.0
2.177PheAsn: 2.177 ± 0.837
0.726PhePro: 0.726 ± 0.697
2.177PheGln: 2.177 ± 0.934
2.177PheArg: 2.177 ± 1.049
3.628PheSer: 3.628 ± 1.266
2.177PheThr: 2.177 ± 0.832
1.451PheVal: 1.451 ± 0.676
0.726PheTrp: 0.726 ± 0.495
0.726PheTyr: 0.726 ± 0.697
0.0PheXaa: 0.0 ± 0.0
Gly
7.257GlyAla: 7.257 ± 1.878
0.726GlyCys: 0.726 ± 0.658
4.354GlyAsp: 4.354 ± 1.15
3.628GlyGlu: 3.628 ± 1.244
0.726GlyPhe: 0.726 ± 0.697
6.531GlyGly: 6.531 ± 2.194
0.726GlyHis: 0.726 ± 0.495
5.08GlyIle: 5.08 ± 2.256
4.354GlyLys: 4.354 ± 0.943
4.354GlyLeu: 4.354 ± 1.223
0.0GlyMet: 0.0 ± 0.0
1.451GlyAsn: 1.451 ± 1.395
2.177GlyPro: 2.177 ± 1.484
5.08GlyGln: 5.08 ± 1.562
5.806GlyArg: 5.806 ± 2.398
7.983GlySer: 7.983 ± 1.639
3.628GlyThr: 3.628 ± 1.793
2.177GlyVal: 2.177 ± 1.484
0.726GlyTrp: 0.726 ± 0.872
5.08GlyTyr: 5.08 ± 1.361
0.0GlyXaa: 0.0 ± 0.0
His
0.726HisAla: 0.726 ± 0.495
0.726HisCys: 0.726 ± 0.658
0.726HisAsp: 0.726 ± 0.658
0.726HisGlu: 0.726 ± 0.658
1.451HisPhe: 1.451 ± 0.989
2.177HisGly: 2.177 ± 0.985
0.726HisHis: 0.726 ± 0.495
0.726HisIle: 0.726 ± 0.495
0.0HisLys: 0.0 ± 0.0
2.177HisLeu: 2.177 ± 1.457
0.0HisMet: 0.0 ± 0.0
0.726HisAsn: 0.726 ± 0.658
2.903HisPro: 2.903 ± 0.758
0.0HisGln: 0.0 ± 0.0
0.726HisArg: 0.726 ± 0.658
2.177HisSer: 2.177 ± 1.582
0.726HisThr: 0.726 ± 0.658
0.726HisVal: 0.726 ± 0.658
0.726HisTrp: 0.726 ± 0.495
0.726HisTyr: 0.726 ± 0.658
0.0HisXaa: 0.0 ± 0.0
Ile
5.08IleAla: 5.08 ± 1.775
0.726IleCys: 0.726 ± 0.844
0.726IleAsp: 0.726 ± 0.495
3.628IleGlu: 3.628 ± 2.473
1.451IlePhe: 1.451 ± 0.814
2.903IleGly: 2.903 ± 0.736
1.451IleHis: 1.451 ± 0.659
0.0IleIle: 0.0 ± 0.0
1.451IleLys: 1.451 ± 1.316
3.628IleLeu: 3.628 ± 1.831
0.726IleMet: 0.726 ± 0.658
0.0IleAsn: 0.0 ± 0.0
0.726IlePro: 0.726 ± 0.495
3.628IleGln: 3.628 ± 1.042
3.628IleArg: 3.628 ± 1.678
1.451IleSer: 1.451 ± 1.395
3.628IleThr: 3.628 ± 2.473
1.451IleVal: 1.451 ± 0.773
2.177IleTrp: 2.177 ± 0.985
3.628IleTyr: 3.628 ± 1.793
0.0IleXaa: 0.0 ± 0.0
Lys
2.177LysAla: 2.177 ± 1.616
1.451LysCys: 1.451 ± 0.676
3.628LysAsp: 3.628 ± 0.986
2.903LysGlu: 2.903 ± 0.892
0.726LysPhe: 0.726 ± 0.495
3.628LysGly: 3.628 ± 1.58
0.726LysHis: 0.726 ± 0.658
2.177LysIle: 2.177 ± 1.239
4.354LysLys: 4.354 ± 3.115
4.354LysLeu: 4.354 ± 1.15
2.903LysMet: 2.903 ± 1.502
2.177LysAsn: 2.177 ± 1.468
1.451LysPro: 1.451 ± 0.814
2.903LysGln: 2.903 ± 1.57
4.354LysArg: 4.354 ± 1.056
1.451LysSer: 1.451 ± 0.676
0.726LysThr: 0.726 ± 0.495
1.451LysVal: 1.451 ± 0.773
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.531LeuAla: 6.531 ± 1.874
0.0LeuCys: 0.0 ± 0.0
3.628LeuAsp: 3.628 ± 1.52
3.628LeuGlu: 3.628 ± 2.045
0.726LeuPhe: 0.726 ± 0.658
9.434LeuGly: 9.434 ± 2.574
0.726LeuHis: 0.726 ± 0.658
7.257LeuIle: 7.257 ± 0.352
1.451LeuLys: 1.451 ± 1.0
5.08LeuLeu: 5.08 ± 1.541
0.726LeuMet: 0.726 ± 0.686
5.08LeuAsn: 5.08 ± 1.544
6.531LeuPro: 6.531 ± 1.827
4.354LeuGln: 4.354 ± 2.282
8.708LeuArg: 8.708 ± 2.066
8.708LeuSer: 8.708 ± 2.695
5.08LeuThr: 5.08 ± 2.528
5.08LeuVal: 5.08 ± 1.256
0.726LeuTrp: 0.726 ± 0.658
2.903LeuTyr: 2.903 ± 0.892
0.0LeuXaa: 0.0 ± 0.0
Met
5.08MetAla: 5.08 ± 1.043
0.726MetCys: 0.726 ± 0.872
0.0MetAsp: 0.0 ± 0.0
1.451MetGlu: 1.451 ± 1.395
0.0MetPhe: 0.0 ± 0.0
2.903MetGly: 2.903 ± 1.931
1.451MetHis: 1.451 ± 0.676
0.0MetIle: 0.0 ± 0.0
0.726MetLys: 0.726 ± 0.495
1.451MetLeu: 1.451 ± 0.676
0.726MetMet: 0.726 ± 0.697
0.726MetAsn: 0.726 ± 0.872
1.451MetPro: 1.451 ± 1.038
0.726MetGln: 0.726 ± 0.658
2.903MetArg: 2.903 ± 0.974
4.354MetSer: 4.354 ± 2.005
2.177MetThr: 2.177 ± 1.457
0.726MetVal: 0.726 ± 0.697
0.726MetTrp: 0.726 ± 0.697
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.354AsnAla: 4.354 ± 1.867
0.0AsnCys: 0.0 ± 0.0
2.903AsnAsp: 2.903 ± 0.938
2.177AsnGlu: 2.177 ± 0.837
0.0AsnPhe: 0.0 ± 0.0
1.451AsnGly: 1.451 ± 0.773
0.726AsnHis: 0.726 ± 0.495
1.451AsnIle: 1.451 ± 0.989
1.451AsnLys: 1.451 ± 1.316
2.903AsnLeu: 2.903 ± 1.572
2.903AsnMet: 2.903 ± 1.405
1.451AsnAsn: 1.451 ± 0.659
3.628AsnPro: 3.628 ± 0.705
2.177AsnGln: 2.177 ± 1.263
5.08AsnArg: 5.08 ± 2.163
2.903AsnSer: 2.903 ± 0.985
1.451AsnThr: 1.451 ± 0.773
2.177AsnVal: 2.177 ± 0.91
0.0AsnTrp: 0.0 ± 0.0
0.726AsnTyr: 0.726 ± 0.495
0.0AsnXaa: 0.0 ± 0.0
Pro
5.806ProAla: 5.806 ± 3.594
0.726ProCys: 0.726 ± 0.495
4.354ProAsp: 4.354 ± 1.242
5.08ProGlu: 5.08 ± 2.038
1.451ProPhe: 1.451 ± 0.989
2.177ProGly: 2.177 ± 0.57
0.726ProHis: 0.726 ± 0.658
2.177ProIle: 2.177 ± 0.854
1.451ProLys: 1.451 ± 0.82
6.531ProLeu: 6.531 ± 3.136
1.451ProMet: 1.451 ± 0.659
2.903ProAsn: 2.903 ± 0.985
5.806ProPro: 5.806 ± 2.4
1.451ProGln: 1.451 ± 1.745
2.177ProArg: 2.177 ± 1.239
2.903ProSer: 2.903 ± 1.352
5.08ProThr: 5.08 ± 1.777
8.708ProVal: 8.708 ± 2.003
0.726ProTrp: 0.726 ± 0.495
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.628GlnAla: 3.628 ± 1.117
0.726GlnCys: 0.726 ± 0.658
4.354GlnAsp: 4.354 ± 1.337
4.354GlnGlu: 4.354 ± 1.242
1.451GlnPhe: 1.451 ± 0.989
2.903GlnGly: 2.903 ± 1.572
1.451GlnHis: 1.451 ± 0.676
2.177GlnIle: 2.177 ± 1.317
0.726GlnLys: 0.726 ± 0.495
5.08GlnLeu: 5.08 ± 0.792
1.451GlnMet: 1.451 ± 0.917
0.726GlnAsn: 0.726 ± 0.495
1.451GlnPro: 1.451 ± 0.676
3.628GlnGln: 3.628 ± 1.891
5.806GlnArg: 5.806 ± 1.335
2.177GlnSer: 2.177 ± 2.092
5.806GlnThr: 5.806 ± 1.983
1.451GlnVal: 1.451 ± 1.0
0.726GlnTrp: 0.726 ± 0.697
1.451GlnTyr: 1.451 ± 0.773
0.0GlnXaa: 0.0 ± 0.0
Arg
5.806ArgAla: 5.806 ± 1.73
1.451ArgCys: 1.451 ± 0.814
2.903ArgAsp: 2.903 ± 0.758
2.903ArgGlu: 2.903 ± 1.572
1.451ArgPhe: 1.451 ± 0.659
5.08ArgGly: 5.08 ± 0.792
2.903ArgHis: 2.903 ± 1.866
2.903ArgIle: 2.903 ± 1.29
6.531ArgLys: 6.531 ± 3.272
8.708ArgLeu: 8.708 ± 2.222
2.177ArgMet: 2.177 ± 1.216
1.451ArgAsn: 1.451 ± 1.084
5.806ArgPro: 5.806 ± 1.607
2.177ArgGln: 2.177 ± 0.91
7.983ArgArg: 7.983 ± 2.936
6.531ArgSer: 6.531 ± 2.559
5.08ArgThr: 5.08 ± 0.594
5.08ArgVal: 5.08 ± 3.045
0.726ArgTrp: 0.726 ± 0.844
6.531ArgTyr: 6.531 ± 2.441
0.0ArgXaa: 0.0 ± 0.0
Ser
7.983SerAla: 7.983 ± 0.635
2.903SerCys: 2.903 ± 1.866
2.903SerAsp: 2.903 ± 0.758
2.903SerGlu: 2.903 ± 1.572
2.903SerPhe: 2.903 ± 1.735
5.806SerGly: 5.806 ± 1.449
0.0SerHis: 0.0 ± 0.0
1.451SerIle: 1.451 ± 0.676
2.903SerLys: 2.903 ± 0.758
7.257SerLeu: 7.257 ± 1.572
1.451SerMet: 1.451 ± 0.773
3.628SerAsn: 3.628 ± 2.473
5.08SerPro: 5.08 ± 1.268
4.354SerGln: 4.354 ± 1.223
6.531SerArg: 6.531 ± 1.639
5.806SerSer: 5.806 ± 1.335
5.806SerThr: 5.806 ± 1.526
3.628SerVal: 3.628 ± 1.857
0.0SerTrp: 0.0 ± 0.0
2.177SerTyr: 2.177 ± 0.854
0.0SerXaa: 0.0 ± 0.0
Thr
4.354ThrAla: 4.354 ± 1.704
0.0ThrCys: 0.0 ± 0.0
2.177ThrAsp: 2.177 ± 1.003
2.903ThrGlu: 2.903 ± 0.736
5.806ThrPhe: 5.806 ± 1.236
2.177ThrGly: 2.177 ± 1.484
0.726ThrHis: 0.726 ± 0.844
5.806ThrIle: 5.806 ± 2.537
2.903ThrLys: 2.903 ± 0.631
2.177ThrLeu: 2.177 ± 0.837
0.0ThrMet: 0.0 ± 0.0
2.903ThrAsn: 2.903 ± 1.341
3.628ThrPro: 3.628 ± 2.804
2.903ThrGln: 2.903 ± 1.961
2.903ThrArg: 2.903 ± 1.931
7.983ThrSer: 7.983 ± 2.662
2.903ThrThr: 2.903 ± 1.405
2.177ThrVal: 2.177 ± 1.484
0.726ThrTrp: 0.726 ± 0.658
2.903ThrTyr: 2.903 ± 0.758
0.0ThrXaa: 0.0 ± 0.0
Val
3.628ValAla: 3.628 ± 0.531
0.0ValCys: 0.0 ± 0.0
2.177ValAsp: 2.177 ± 1.069
0.726ValGlu: 0.726 ± 0.697
0.0ValPhe: 0.0 ± 0.0
3.628ValGly: 3.628 ± 1.244
1.451ValHis: 1.451 ± 0.676
0.0ValIle: 0.0 ± 0.0
2.177ValLys: 2.177 ± 1.484
7.983ValLeu: 7.983 ± 2.142
2.903ValMet: 2.903 ± 0.892
2.177ValAsn: 2.177 ± 1.035
5.806ValPro: 5.806 ± 1.693
2.903ValGln: 2.903 ± 1.061
6.531ValArg: 6.531 ± 2.968
4.354ValSer: 4.354 ± 1.55
2.903ValThr: 2.903 ± 1.547
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.726ValTyr: 0.726 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.726TrpAsp: 0.726 ± 0.495
2.177TrpGlu: 2.177 ± 1.484
0.726TrpPhe: 0.726 ± 0.697
0.726TrpGly: 0.726 ± 0.658
0.726TrpHis: 0.726 ± 0.495
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.726TrpLeu: 0.726 ± 0.658
0.726TrpMet: 0.726 ± 0.658
2.177TrpAsn: 2.177 ± 0.91
0.726TrpPro: 0.726 ± 0.872
0.726TrpGln: 0.726 ± 0.495
0.726TrpArg: 0.726 ± 0.697
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.726TrpTrp: 0.726 ± 0.872
1.451TrpTyr: 1.451 ± 0.989
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.354TyrAla: 4.354 ± 1.11
0.0TyrCys: 0.0 ± 0.0
0.726TyrAsp: 0.726 ± 0.495
1.451TyrGlu: 1.451 ± 1.038
0.726TyrPhe: 0.726 ± 0.495
2.177TyrGly: 2.177 ± 1.239
1.451TyrHis: 1.451 ± 1.0
3.628TyrIle: 3.628 ± 1.862
1.451TyrLys: 1.451 ± 0.989
6.531TyrLeu: 6.531 ± 1.485
1.451TyrMet: 1.451 ± 0.686
2.903TyrAsn: 2.903 ± 1.061
0.726TyrPro: 0.726 ± 0.658
1.451TyrGln: 1.451 ± 0.659
2.903TyrArg: 2.903 ± 1.341
0.726TyrSer: 0.726 ± 0.658
2.177TyrThr: 2.177 ± 0.57
2.903TyrVal: 2.903 ± 0.736
0.726TyrTrp: 0.726 ± 0.495
1.451TyrTyr: 1.451 ± 1.005
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski