Amino acid dipepetide frequency for Apis mellifera associated microvirus 48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.403AlaAla: 13.403 ± 3.679
0.745AlaCys: 0.745 ± 0.727
5.212AlaAsp: 5.212 ± 0.614
7.446AlaGlu: 7.446 ± 2.59
4.468AlaPhe: 4.468 ± 0.792
5.212AlaGly: 5.212 ± 1.882
1.489AlaHis: 1.489 ± 1.455
3.723AlaIle: 3.723 ± 0.45
5.212AlaLys: 5.212 ± 1.226
5.212AlaLeu: 5.212 ± 0.754
4.468AlaMet: 4.468 ± 1.616
1.489AlaAsn: 1.489 ± 0.634
4.468AlaPro: 4.468 ± 1.732
3.723AlaGln: 3.723 ± 0.83
8.935AlaArg: 8.935 ± 1.991
6.701AlaSer: 6.701 ± 0.978
8.191AlaThr: 8.191 ± 1.482
5.212AlaVal: 5.212 ± 1.226
1.489AlaTrp: 1.489 ± 0.634
2.234AlaTyr: 2.234 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.745CysAla: 0.745 ± 0.684
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.745CysPhe: 0.745 ± 0.727
0.745CysGly: 0.745 ± 0.727
0.0CysHis: 0.0 ± 0.0
0.745CysIle: 0.745 ± 0.727
0.745CysLys: 0.745 ± 0.727
0.0CysLeu: 0.0 ± 0.0
0.745CysMet: 0.745 ± 0.646
0.0CysAsn: 0.0 ± 0.0
0.745CysPro: 0.745 ± 0.481
0.0CysGln: 0.0 ± 0.0
1.489CysArg: 1.489 ± 0.653
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.745CysTrp: 0.745 ± 0.727
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.957AspAla: 5.957 ± 1.179
0.0AspCys: 0.0 ± 0.0
5.212AspAsp: 5.212 ± 1.982
1.489AspGlu: 1.489 ± 0.875
2.978AspPhe: 2.978 ± 0.873
3.723AspGly: 3.723 ± 1.919
0.0AspHis: 0.0 ± 0.0
2.978AspIle: 2.978 ± 0.571
1.489AspLys: 1.489 ± 0.653
2.978AspLeu: 2.978 ± 0.809
1.489AspMet: 1.489 ± 0.739
0.0AspAsn: 0.0 ± 0.0
3.723AspPro: 3.723 ± 2.484
2.234AspGln: 2.234 ± 0.83
2.978AspArg: 2.978 ± 1.331
0.0AspSer: 0.0 ± 0.0
5.212AspThr: 5.212 ± 1.741
0.745AspVal: 0.745 ± 0.727
2.234AspTrp: 2.234 ± 0.887
3.723AspTyr: 3.723 ± 1.123
0.0AspXaa: 0.0 ± 0.0
Glu
6.701GluAla: 6.701 ± 0.273
0.0GluCys: 0.0 ± 0.0
2.978GluAsp: 2.978 ± 1.416
4.468GluGlu: 4.468 ± 2.493
4.468GluPhe: 4.468 ± 1.208
1.489GluGly: 1.489 ± 0.875
1.489GluHis: 1.489 ± 0.962
0.745GluIle: 0.745 ± 0.662
3.723GluLys: 3.723 ± 1.157
2.234GluLeu: 2.234 ± 0.73
0.745GluMet: 0.745 ± 0.715
1.489GluAsn: 1.489 ± 0.634
2.978GluPro: 2.978 ± 1.156
2.234GluGln: 2.234 ± 1.454
1.489GluArg: 1.489 ± 1.324
2.978GluSer: 2.978 ± 0.652
2.234GluThr: 2.234 ± 0.91
5.957GluVal: 5.957 ± 0.701
0.0GluTrp: 0.0 ± 0.0
2.978GluTyr: 2.978 ± 1.306
0.0GluXaa: 0.0 ± 0.0
Phe
5.212PheAla: 5.212 ± 0.836
0.0PheCys: 0.0 ± 0.0
1.489PheAsp: 1.489 ± 0.76
2.978PheGlu: 2.978 ± 0.652
1.489PhePhe: 1.489 ± 0.653
5.212PheGly: 5.212 ± 1.903
0.0PheHis: 0.0 ± 0.0
1.489PheIle: 1.489 ± 0.962
0.0PheLys: 0.0 ± 0.0
1.489PheLeu: 1.489 ± 0.634
2.234PheMet: 2.234 ± 0.843
2.234PheAsn: 2.234 ± 0.91
1.489PhePro: 1.489 ± 1.455
2.978PheGln: 2.978 ± 0.571
3.723PheArg: 3.723 ± 0.911
6.701PheSer: 6.701 ± 1.873
2.978PheThr: 2.978 ± 1.044
2.234PheVal: 2.234 ± 1.296
0.745PheTrp: 0.745 ± 0.481
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.701GlyAla: 6.701 ± 3.072
0.745GlyCys: 0.745 ± 0.481
1.489GlyAsp: 1.489 ± 0.653
2.234GlyGlu: 2.234 ± 0.455
5.957GlyPhe: 5.957 ± 1.864
3.723GlyGly: 3.723 ± 1.751
3.723GlyHis: 3.723 ± 1.447
3.723GlyIle: 3.723 ± 0.911
2.978GlyLys: 2.978 ± 1.393
5.212GlyLeu: 5.212 ± 3.794
0.745GlyMet: 0.745 ± 0.481
3.723GlyAsn: 3.723 ± 0.911
4.468GlyPro: 4.468 ± 1.616
5.212GlyGln: 5.212 ± 1.389
4.468GlyArg: 4.468 ± 2.248
5.212GlySer: 5.212 ± 1.882
3.723GlyThr: 3.723 ± 1.751
6.701GlyVal: 6.701 ± 2.327
0.0GlyTrp: 0.0 ± 0.0
2.234GlyTyr: 2.234 ± 0.887
0.0GlyXaa: 0.0 ± 0.0
His
0.745HisAla: 0.745 ± 0.727
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.745HisGlu: 0.745 ± 0.715
1.489HisPhe: 1.489 ± 0.653
3.723HisGly: 3.723 ± 0.924
0.745HisHis: 0.745 ± 0.481
1.489HisIle: 1.489 ± 0.653
0.0HisLys: 0.0 ± 0.0
1.489HisLeu: 1.489 ± 1.455
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.489HisPro: 1.489 ± 1.324
0.0HisGln: 0.0 ± 0.0
2.978HisArg: 2.978 ± 1.268
1.489HisSer: 1.489 ± 0.653
1.489HisThr: 1.489 ± 0.91
3.723HisVal: 3.723 ± 0.911
0.745HisTrp: 0.745 ± 0.481
0.745HisTyr: 0.745 ± 0.727
0.0HisXaa: 0.0 ± 0.0
Ile
0.745IleAla: 0.745 ± 0.481
0.0IleCys: 0.0 ± 0.0
2.234IleAsp: 2.234 ± 1.225
1.489IleGlu: 1.489 ± 0.653
2.978IlePhe: 2.978 ± 0.809
5.212IleGly: 5.212 ± 0.884
1.489IleHis: 1.489 ± 0.653
1.489IleIle: 1.489 ± 0.634
2.978IleLys: 2.978 ± 0.652
2.978IleLeu: 2.978 ± 1.369
0.745IleMet: 0.745 ± 0.481
2.234IleAsn: 2.234 ± 0.887
2.234IlePro: 2.234 ± 1.332
0.745IleGln: 0.745 ± 0.662
4.468IleArg: 4.468 ± 1.98
2.234IleSer: 2.234 ± 1.203
0.745IleThr: 0.745 ± 0.481
0.745IleVal: 0.745 ± 0.481
0.0IleTrp: 0.0 ± 0.0
0.745IleTyr: 0.745 ± 0.715
0.0IleXaa: 0.0 ± 0.0
Lys
2.978LysAla: 2.978 ± 1.369
1.489LysCys: 1.489 ± 0.91
2.234LysAsp: 2.234 ± 0.91
2.234LysGlu: 2.234 ± 1.443
1.489LysPhe: 1.489 ± 0.962
2.234LysGly: 2.234 ± 0.83
0.0LysHis: 0.0 ± 0.0
1.489LysIle: 1.489 ± 0.962
3.723LysLys: 3.723 ± 2.71
2.234LysLeu: 2.234 ± 1.332
2.234LysMet: 2.234 ± 1.332
1.489LysAsn: 1.489 ± 1.455
0.745LysPro: 0.745 ± 0.662
0.0LysGln: 0.0 ± 0.0
8.935LysArg: 8.935 ± 2.864
5.957LysSer: 5.957 ± 2.662
0.745LysThr: 0.745 ± 0.727
1.489LysVal: 1.489 ± 0.91
0.0LysTrp: 0.0 ± 0.0
0.745LysTyr: 0.745 ± 0.662
0.0LysXaa: 0.0 ± 0.0
Leu
10.424LeuAla: 10.424 ± 1.093
1.489LeuCys: 1.489 ± 0.695
2.978LeuAsp: 2.978 ± 1.501
3.723LeuGlu: 3.723 ± 2.149
0.745LeuPhe: 0.745 ± 0.481
9.68LeuGly: 9.68 ± 2.647
2.234LeuHis: 2.234 ± 1.203
2.978LeuIle: 2.978 ± 1.369
2.978LeuLys: 2.978 ± 2.007
9.68LeuLeu: 9.68 ± 2.486
1.489LeuMet: 1.489 ± 0.634
2.978LeuAsn: 2.978 ± 0.948
8.191LeuPro: 8.191 ± 1.035
5.212LeuGln: 5.212 ± 2.768
4.468LeuArg: 4.468 ± 2.832
5.957LeuSer: 5.957 ± 1.535
4.468LeuThr: 4.468 ± 0.751
2.234LeuVal: 2.234 ± 1.443
0.745LeuTrp: 0.745 ± 0.727
1.489LeuTyr: 1.489 ± 0.653
0.0LeuXaa: 0.0 ± 0.0
Met
2.234MetAla: 2.234 ± 1.225
0.0MetCys: 0.0 ± 0.0
0.745MetAsp: 0.745 ± 0.481
2.978MetGlu: 2.978 ± 0.873
0.745MetPhe: 0.745 ± 0.684
2.234MetGly: 2.234 ± 0.455
0.745MetHis: 0.745 ± 0.481
1.489MetIle: 1.489 ± 1.43
2.978MetLys: 2.978 ± 1.268
1.489MetLeu: 1.489 ± 0.875
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.234MetPro: 2.234 ± 1.373
0.0MetGln: 0.0 ± 0.0
2.234MetArg: 2.234 ± 0.714
2.978MetSer: 2.978 ± 1.044
2.234MetThr: 2.234 ± 1.203
0.745MetVal: 0.745 ± 0.481
0.0MetTrp: 0.0 ± 0.0
1.489MetTyr: 1.489 ± 0.962
0.0MetXaa: 0.0 ± 0.0
Asn
0.745AsnAla: 0.745 ± 0.684
0.745AsnCys: 0.745 ± 0.727
1.489AsnAsp: 1.489 ± 0.634
1.489AsnGlu: 1.489 ± 0.653
0.0AsnPhe: 0.0 ± 0.0
2.234AsnGly: 2.234 ± 1.022
0.745AsnHis: 0.745 ± 0.727
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
4.468AsnLeu: 4.468 ± 1.774
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.234AsnPro: 2.234 ± 0.455
0.0AsnGln: 0.0 ± 0.0
2.234AsnArg: 2.234 ± 0.91
1.489AsnSer: 1.489 ± 0.962
3.723AsnThr: 3.723 ± 1.751
4.468AsnVal: 4.468 ± 1.208
0.0AsnTrp: 0.0 ± 0.0
0.745AsnTyr: 0.745 ± 0.481
0.0AsnXaa: 0.0 ± 0.0
Pro
6.701ProAla: 6.701 ± 2.917
0.745ProCys: 0.745 ± 0.727
5.212ProAsp: 5.212 ± 1.226
3.723ProGlu: 3.723 ± 0.45
3.723ProPhe: 3.723 ± 0.924
2.978ProGly: 2.978 ± 0.929
2.234ProHis: 2.234 ± 0.455
2.978ProIle: 2.978 ± 1.035
0.0ProLys: 0.0 ± 0.0
5.957ProLeu: 5.957 ± 1.823
2.234ProMet: 2.234 ± 0.867
0.745ProAsn: 0.745 ± 0.715
4.468ProPro: 4.468 ± 2.194
2.234ProGln: 2.234 ± 0.73
4.468ProArg: 4.468 ± 2.329
2.978ProSer: 2.978 ± 0.929
6.701ProThr: 6.701 ± 1.657
6.701ProVal: 6.701 ± 1.964
0.745ProTrp: 0.745 ± 0.481
1.489ProTyr: 1.489 ± 0.995
0.0ProXaa: 0.0 ± 0.0
Gln
4.468GlnAla: 4.468 ± 0.792
0.745GlnCys: 0.745 ± 0.727
2.234GlnAsp: 2.234 ± 0.91
2.234GlnGlu: 2.234 ± 0.455
0.745GlnPhe: 0.745 ± 0.727
2.978GlnGly: 2.978 ± 0.873
0.745GlnHis: 0.745 ± 0.481
2.978GlnIle: 2.978 ± 1.316
2.234GlnLys: 2.234 ± 1.203
0.745GlnLeu: 0.745 ± 0.481
0.745GlnMet: 0.745 ± 0.727
1.489GlnAsn: 1.489 ± 0.739
4.468GlnPro: 4.468 ± 1.365
1.489GlnGln: 1.489 ± 0.653
2.978GlnArg: 2.978 ± 0.547
2.978GlnSer: 2.978 ± 1.156
5.212GlnThr: 5.212 ± 1.268
0.745GlnVal: 0.745 ± 0.662
0.0GlnTrp: 0.0 ± 0.0
1.489GlnTyr: 1.489 ± 0.739
0.0GlnXaa: 0.0 ± 0.0
Arg
11.169ArgAla: 11.169 ± 2.263
0.0ArgCys: 0.0 ± 0.0
3.723ArgAsp: 3.723 ± 1.011
3.723ArgGlu: 3.723 ± 1.397
3.723ArgPhe: 3.723 ± 2.043
2.978ArgGly: 2.978 ± 1.384
1.489ArgHis: 1.489 ± 0.653
1.489ArgIle: 1.489 ± 0.653
3.723ArgLys: 3.723 ± 2.019
6.701ArgLeu: 6.701 ± 1.221
2.978ArgMet: 2.978 ± 1.315
1.489ArgAsn: 1.489 ± 0.653
3.723ArgPro: 3.723 ± 1.254
4.468ArgGln: 4.468 ± 2.151
6.701ArgArg: 6.701 ± 1.555
10.424ArgSer: 10.424 ± 3.407
2.978ArgThr: 2.978 ± 0.96
4.468ArgVal: 4.468 ± 1.526
0.745ArgTrp: 0.745 ± 0.662
5.212ArgTyr: 5.212 ± 1.965
0.0ArgXaa: 0.0 ± 0.0
Ser
5.212SerAla: 5.212 ± 2.48
0.745SerCys: 0.745 ± 0.727
4.468SerAsp: 4.468 ± 1.021
2.978SerGlu: 2.978 ± 1.299
3.723SerPhe: 3.723 ± 1.165
4.468SerGly: 4.468 ± 1.456
2.234SerHis: 2.234 ± 0.455
1.489SerIle: 1.489 ± 0.971
4.468SerLys: 4.468 ± 1.609
9.68SerLeu: 9.68 ± 0.757
2.234SerMet: 2.234 ± 1.454
3.723SerAsn: 3.723 ± 0.911
4.468SerPro: 4.468 ± 1.98
2.234SerGln: 2.234 ± 0.996
8.935SerArg: 8.935 ± 2.06
8.935SerSer: 8.935 ± 2.499
4.468SerThr: 4.468 ± 1.368
2.978SerVal: 2.978 ± 0.547
0.0SerTrp: 0.0 ± 0.0
0.745SerTyr: 0.745 ± 0.662
0.0SerXaa: 0.0 ± 0.0
Thr
7.446ThrAla: 7.446 ± 2.969
0.0ThrCys: 0.0 ± 0.0
2.234ThrAsp: 2.234 ± 1.225
2.978ThrGlu: 2.978 ± 1.473
2.978ThrPhe: 2.978 ± 1.035
6.701ThrGly: 6.701 ± 1.295
1.489ThrHis: 1.489 ± 1.455
2.234ThrIle: 2.234 ± 1.443
1.489ThrLys: 1.489 ± 0.695
8.191ThrLeu: 8.191 ± 2.038
3.723ThrMet: 3.723 ± 1.437
2.234ThrAsn: 2.234 ± 1.443
5.212ThrPro: 5.212 ± 2.17
1.489ThrGln: 1.489 ± 0.76
2.978ThrArg: 2.978 ± 1.715
4.468ThrSer: 4.468 ± 1.924
6.701ThrThr: 6.701 ± 4.328
5.212ThrVal: 5.212 ± 1.812
1.489ThrTrp: 1.489 ± 0.962
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.212ValAla: 5.212 ± 2.075
0.0ValCys: 0.0 ± 0.0
2.978ValAsp: 2.978 ± 0.547
0.745ValGlu: 0.745 ± 0.662
1.489ValPhe: 1.489 ± 0.653
3.723ValGly: 3.723 ± 0.567
0.745ValHis: 0.745 ± 0.481
1.489ValIle: 1.489 ± 0.971
2.978ValLys: 2.978 ± 1.263
6.701ValLeu: 6.701 ± 1.337
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
6.701ValPro: 6.701 ± 1.28
3.723ValGln: 3.723 ± 1.806
4.468ValArg: 4.468 ± 1.613
6.701ValSer: 6.701 ± 1.824
5.212ValThr: 5.212 ± 0.614
0.745ValVal: 0.745 ± 0.481
0.745ValTrp: 0.745 ± 0.727
2.234ValTyr: 2.234 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.745TrpAsp: 0.745 ± 0.481
1.489TrpGlu: 1.489 ± 0.962
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.745TrpHis: 0.745 ± 0.481
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.489TrpLeu: 1.489 ± 1.324
0.0TrpMet: 0.0 ± 0.0
1.489TrpAsn: 1.489 ± 0.962
2.234TrpPro: 2.234 ± 2.182
1.489TrpGln: 1.489 ± 0.653
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.745TrpTyr: 0.745 ± 0.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.234TyrAla: 2.234 ± 1.443
0.0TyrCys: 0.0 ± 0.0
1.489TyrAsp: 1.489 ± 0.653
2.234TyrGlu: 2.234 ± 1.377
0.745TyrPhe: 0.745 ± 0.481
3.723TyrGly: 3.723 ± 0.83
0.745TyrHis: 0.745 ± 0.727
1.489TyrIle: 1.489 ± 0.653
0.745TyrLys: 0.745 ± 0.727
5.212TyrLeu: 5.212 ± 1.553
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.489TyrPro: 1.489 ± 0.695
2.234TyrGln: 2.234 ± 1.022
2.978TyrArg: 2.978 ± 1.268
0.0TyrSer: 0.0 ± 0.0
2.234TyrThr: 2.234 ± 0.714
1.489TyrVal: 1.489 ± 0.739
0.0TyrTrp: 0.0 ± 0.0
0.745TyrTyr: 0.745 ± 0.684
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1344 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski