Amino acid dipepetide frequency for Apis mellifera associated microvirus 37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.598AlaAla: 6.598 ± 3.091
1.466AlaCys: 1.466 ± 0.906
5.865AlaAsp: 5.865 ± 2.967
4.399AlaGlu: 4.399 ± 2.46
0.0AlaPhe: 0.0 ± 0.0
7.331AlaGly: 7.331 ± 4.374
2.199AlaHis: 2.199 ± 0.545
2.199AlaIle: 2.199 ± 1.631
3.666AlaLys: 3.666 ± 0.664
8.065AlaLeu: 8.065 ± 2.732
2.199AlaMet: 2.199 ± 0.545
0.733AlaAsn: 0.733 ± 0.483
6.598AlaPro: 6.598 ± 3.701
2.933AlaGln: 2.933 ± 1.485
5.132AlaArg: 5.132 ± 4.081
4.399AlaSer: 4.399 ± 2.556
4.399AlaThr: 4.399 ± 0.882
2.933AlaVal: 2.933 ± 0.99
1.466AlaTrp: 1.466 ± 0.752
1.466AlaTyr: 1.466 ± 0.834
0.0AlaXaa: 0.0 ± 0.0
Cys
0.733CysAla: 0.733 ± 0.903
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.733CysGlu: 0.733 ± 0.644
0.733CysPhe: 0.733 ± 0.483
1.466CysGly: 1.466 ± 0.634
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.733CysLeu: 0.733 ± 0.644
0.733CysMet: 0.733 ± 1.133
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.466CysArg: 1.466 ± 1.289
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.399AspAla: 4.399 ± 2.46
0.0AspCys: 0.0 ± 0.0
2.933AspAsp: 2.933 ± 2.315
4.399AspGlu: 4.399 ± 1.09
2.933AspPhe: 2.933 ± 1.017
3.666AspGly: 3.666 ± 1.776
0.733AspHis: 0.733 ± 0.483
2.199AspIle: 2.199 ± 0.924
2.199AspLys: 2.199 ± 0.88
5.132AspLeu: 5.132 ± 2.367
3.666AspMet: 3.666 ± 1.3
0.733AspAsn: 0.733 ± 0.705
3.666AspPro: 3.666 ± 2.222
1.466AspGln: 1.466 ± 0.69
5.132AspArg: 5.132 ± 1.181
1.466AspSer: 1.466 ± 0.634
4.399AspThr: 4.399 ± 1.593
2.933AspVal: 2.933 ± 0.703
1.466AspTrp: 1.466 ± 0.634
4.399AspTyr: 4.399 ± 1.901
0.0AspXaa: 0.0 ± 0.0
Glu
2.199GluAla: 2.199 ± 0.88
0.0GluCys: 0.0 ± 0.0
2.199GluAsp: 2.199 ± 1.535
2.199GluGlu: 2.199 ± 0.79
3.666GluPhe: 3.666 ± 2.191
2.933GluGly: 2.933 ± 1.331
1.466GluHis: 1.466 ± 0.965
0.733GluIle: 0.733 ± 0.789
5.865GluLys: 5.865 ± 3.444
3.666GluLeu: 3.666 ± 2.679
0.733GluMet: 0.733 ± 0.644
4.399GluAsn: 4.399 ± 0.905
2.199GluPro: 2.199 ± 0.924
0.0GluGln: 0.0 ± 0.0
2.199GluArg: 2.199 ± 1.61
2.933GluSer: 2.933 ± 0.703
1.466GluThr: 1.466 ± 0.965
4.399GluVal: 4.399 ± 1.196
0.733GluTrp: 0.733 ± 0.483
3.666GluTyr: 3.666 ± 1.195
0.0GluXaa: 0.0 ± 0.0
Phe
4.399PheAla: 4.399 ± 1.655
0.0PheCys: 0.0 ± 0.0
2.933PheAsp: 2.933 ± 1.362
2.933PheGlu: 2.933 ± 1.64
2.199PhePhe: 2.199 ± 0.924
2.199PheGly: 2.199 ± 1.933
0.733PheHis: 0.733 ± 0.644
1.466PheIle: 1.466 ± 0.82
2.933PheLys: 2.933 ± 1.267
4.399PheLeu: 4.399 ± 1.09
0.733PheMet: 0.733 ± 0.789
2.199PheAsn: 2.199 ± 1.309
2.199PhePro: 2.199 ± 0.88
0.733PheGln: 0.733 ± 0.789
0.733PheArg: 0.733 ± 0.483
2.199PheSer: 2.199 ± 0.924
2.199PheThr: 2.199 ± 1.305
3.666PheVal: 3.666 ± 1.402
0.0PheTrp: 0.0 ± 0.0
2.933PheTyr: 2.933 ± 0.703
0.0PheXaa: 0.0 ± 0.0
Gly
3.666GlyAla: 3.666 ± 1.747
0.0GlyCys: 0.0 ± 0.0
2.933GlyAsp: 2.933 ± 0.703
3.666GlyGlu: 3.666 ± 1.076
4.399GlyPhe: 4.399 ± 2.247
6.598GlyGly: 6.598 ± 3.932
3.666GlyHis: 3.666 ± 0.948
8.065GlyIle: 8.065 ± 1.842
1.466GlyLys: 1.466 ± 1.114
5.865GlyLeu: 5.865 ± 1.948
2.933GlyMet: 2.933 ± 0.703
2.199GlyAsn: 2.199 ± 1.448
0.0GlyPro: 0.0 ± 0.0
3.666GlyGln: 3.666 ± 1.307
5.132GlyArg: 5.132 ± 2.026
6.598GlySer: 6.598 ± 2.943
6.598GlyThr: 6.598 ± 1.635
2.199GlyVal: 2.199 ± 0.924
0.733GlyTrp: 0.733 ± 0.483
2.199GlyTyr: 2.199 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
2.199HisAla: 2.199 ± 0.545
0.0HisCys: 0.0 ± 0.0
2.199HisAsp: 2.199 ± 1.21
0.0HisGlu: 0.0 ± 0.0
0.733HisPhe: 0.733 ± 0.483
1.466HisGly: 1.466 ± 0.634
1.466HisHis: 1.466 ± 0.69
0.0HisIle: 0.0 ± 0.0
1.466HisLys: 1.466 ± 0.82
2.933HisLeu: 2.933 ± 1.353
0.733HisMet: 0.733 ± 0.483
0.733HisAsn: 0.733 ± 0.483
2.199HisPro: 2.199 ± 1.631
1.466HisGln: 1.466 ± 0.752
4.399HisArg: 4.399 ± 1.131
0.733HisSer: 0.733 ± 0.644
0.733HisThr: 0.733 ± 0.644
2.199HisVal: 2.199 ± 0.79
0.733HisTrp: 0.733 ± 0.483
0.733HisTyr: 0.733 ± 0.644
0.0HisXaa: 0.0 ± 0.0
Ile
2.933IleAla: 2.933 ± 2.124
0.0IleCys: 0.0 ± 0.0
1.466IleAsp: 1.466 ± 0.634
2.199IleGlu: 2.199 ± 0.848
2.199IlePhe: 2.199 ± 0.848
2.933IleGly: 2.933 ± 0.974
2.199IleHis: 2.199 ± 0.96
2.933IleIle: 2.933 ± 0.703
3.666IleLys: 3.666 ± 1.027
0.733IleLeu: 0.733 ± 0.483
0.733IleMet: 0.733 ± 0.705
0.733IleAsn: 0.733 ± 0.483
5.865IlePro: 5.865 ± 1.257
2.199IleGln: 2.199 ± 0.545
2.933IleArg: 2.933 ± 1.667
3.666IleSer: 3.666 ± 1.076
5.865IleThr: 5.865 ± 1.968
0.733IleVal: 0.733 ± 0.705
0.733IleTrp: 0.733 ± 0.483
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.933LysAla: 2.933 ± 1.313
0.733LysCys: 0.733 ± 0.903
2.933LysAsp: 2.933 ± 0.703
4.399LysGlu: 4.399 ± 1.579
2.933LysPhe: 2.933 ± 1.588
5.132LysGly: 5.132 ± 0.829
0.0LysHis: 0.0 ± 0.0
4.399LysIle: 4.399 ± 1.473
0.733LysLys: 0.733 ± 0.644
6.598LysLeu: 6.598 ± 2.334
1.466LysMet: 1.466 ± 0.748
1.466LysAsn: 1.466 ± 0.69
1.466LysPro: 1.466 ± 0.634
2.199LysGln: 2.199 ± 0.96
5.132LysArg: 5.132 ± 1.378
4.399LysSer: 4.399 ± 1.625
2.933LysThr: 2.933 ± 1.473
2.933LysVal: 2.933 ± 2.07
0.0LysTrp: 0.0 ± 0.0
1.466LysTyr: 1.466 ± 0.634
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 3.261
1.466LeuCys: 1.466 ± 0.634
2.199LeuAsp: 2.199 ± 0.79
4.399LeuGlu: 4.399 ± 1.051
2.933LeuPhe: 2.933 ± 1.297
9.531LeuGly: 9.531 ± 2.901
3.666LeuHis: 3.666 ± 0.812
5.132LeuIle: 5.132 ± 1.483
5.132LeuLys: 5.132 ± 1.673
7.331LeuLeu: 7.331 ± 1.827
3.666LeuMet: 3.666 ± 1.198
3.666LeuAsn: 3.666 ± 1.786
6.598LeuPro: 6.598 ± 2.334
5.132LeuGln: 5.132 ± 1.002
6.598LeuArg: 6.598 ± 1.531
3.666LeuSer: 3.666 ± 1.239
2.199LeuThr: 2.199 ± 1.103
7.331LeuVal: 7.331 ± 1.324
2.199LeuTrp: 2.199 ± 1.21
2.199LeuTyr: 2.199 ± 1.103
0.0LeuXaa: 0.0 ± 0.0
Met
1.466MetAla: 1.466 ± 1.215
0.0MetCys: 0.0 ± 0.0
3.666MetAsp: 3.666 ± 0.948
0.733MetGlu: 0.733 ± 0.644
0.0MetPhe: 0.0 ± 0.0
2.199MetGly: 2.199 ± 0.96
1.466MetHis: 1.466 ± 0.82
0.0MetIle: 0.0 ± 0.0
0.733MetLys: 0.733 ± 0.483
0.733MetLeu: 0.733 ± 0.644
0.0MetMet: 0.0 ± 0.0
1.466MetAsn: 1.466 ± 0.69
0.733MetPro: 0.733 ± 0.705
3.666MetGln: 3.666 ± 0.581
1.466MetArg: 1.466 ± 0.82
0.0MetSer: 0.0 ± 0.0
1.466MetThr: 1.466 ± 0.906
1.466MetVal: 1.466 ± 0.961
1.466MetTrp: 1.466 ± 0.965
2.199MetTyr: 2.199 ± 0.924
0.0MetXaa: 0.0 ± 0.0
Asn
3.666AsnAla: 3.666 ± 1.307
0.0AsnCys: 0.0 ± 0.0
4.399AsnAsp: 4.399 ± 1.848
1.466AsnGlu: 1.466 ± 1.411
2.199AsnPhe: 2.199 ± 0.88
0.733AsnGly: 0.733 ± 0.483
1.466AsnHis: 1.466 ± 0.69
0.733AsnIle: 0.733 ± 0.483
1.466AsnLys: 1.466 ± 0.965
1.466AsnLeu: 1.466 ± 0.906
0.733AsnMet: 0.733 ± 0.483
0.733AsnAsn: 0.733 ± 0.483
2.933AsnPro: 2.933 ± 1.485
3.666AsnGln: 3.666 ± 0.948
2.199AsnArg: 2.199 ± 0.88
2.199AsnSer: 2.199 ± 1.309
1.466AsnThr: 1.466 ± 0.965
2.199AsnVal: 2.199 ± 0.924
1.466AsnTrp: 1.466 ± 0.82
2.199AsnTyr: 2.199 ± 0.924
0.0AsnXaa: 0.0 ± 0.0
Pro
6.598ProAla: 6.598 ± 4.746
2.199ProCys: 2.199 ± 1.183
3.666ProAsp: 3.666 ± 1.475
3.666ProGlu: 3.666 ± 0.581
1.466ProPhe: 1.466 ± 0.965
0.733ProGly: 0.733 ± 0.483
1.466ProHis: 1.466 ± 1.289
2.199ProIle: 2.199 ± 1.36
5.865ProLys: 5.865 ± 1.995
7.331ProLeu: 7.331 ± 2.423
0.733ProMet: 0.733 ± 0.483
2.199ProAsn: 2.199 ± 1.021
3.666ProPro: 3.666 ± 2.421
2.933ProGln: 2.933 ± 0.99
3.666ProArg: 3.666 ± 2.054
9.531ProSer: 9.531 ± 3.924
3.666ProThr: 3.666 ± 1.188
7.331ProVal: 7.331 ± 1.318
0.733ProTrp: 0.733 ± 0.644
1.466ProTyr: 1.466 ± 1.411
0.0ProXaa: 0.0 ± 0.0
Gln
4.399GlnAla: 4.399 ± 0.882
0.0GlnCys: 0.0 ± 0.0
4.399GlnAsp: 4.399 ± 1.051
1.466GlnGlu: 1.466 ± 0.69
2.933GlnPhe: 2.933 ± 0.703
2.933GlnGly: 2.933 ± 2.124
0.733GlnHis: 0.733 ± 0.705
1.466GlnIle: 1.466 ± 0.69
1.466GlnLys: 1.466 ± 0.752
4.399GlnLeu: 4.399 ± 1.682
1.466GlnMet: 1.466 ± 0.906
3.666GlnAsn: 3.666 ± 0.87
2.933GlnPro: 2.933 ± 1.64
0.733GlnGln: 0.733 ± 0.483
6.598GlnArg: 6.598 ± 0.252
1.466GlnSer: 1.466 ± 0.634
1.466GlnThr: 1.466 ± 1.411
2.933GlnVal: 2.933 ± 0.703
0.0GlnTrp: 0.0 ± 0.0
0.733GlnTyr: 0.733 ± 0.644
0.0GlnXaa: 0.0 ± 0.0
Arg
4.399ArgAla: 4.399 ± 1.808
0.0ArgCys: 0.0 ± 0.0
3.666ArgAsp: 3.666 ± 1.076
2.199ArgGlu: 2.199 ± 1.259
3.666ArgPhe: 3.666 ± 1.199
2.933ArgGly: 2.933 ± 1.711
0.0ArgHis: 0.0 ± 0.0
5.865ArgIle: 5.865 ± 2.543
2.933ArgLys: 2.933 ± 2.578
8.798ArgLeu: 8.798 ± 2.526
0.733ArgMet: 0.733 ± 0.471
2.933ArgAsn: 2.933 ± 0.945
7.331ArgPro: 7.331 ± 2.937
3.666ArgGln: 3.666 ± 1.722
8.065ArgArg: 8.065 ± 5.95
2.933ArgSer: 2.933 ± 1.071
2.933ArgThr: 2.933 ± 1.425
2.933ArgVal: 2.933 ± 1.669
0.733ArgTrp: 0.733 ± 0.644
5.132ArgTyr: 5.132 ± 1.842
0.0ArgXaa: 0.0 ± 0.0
Ser
5.132SerAla: 5.132 ± 2.322
0.0SerCys: 0.0 ± 0.0
2.199SerAsp: 2.199 ± 0.545
3.666SerGlu: 3.666 ± 1.509
3.666SerPhe: 3.666 ± 0.664
7.331SerGly: 7.331 ± 1.208
1.466SerHis: 1.466 ± 0.752
3.666SerIle: 3.666 ± 0.87
4.399SerLys: 4.399 ± 2.961
7.331SerLeu: 7.331 ± 1.658
2.199SerMet: 2.199 ± 0.848
2.199SerAsn: 2.199 ± 1.09
2.933SerPro: 2.933 ± 0.935
2.199SerGln: 2.199 ± 1.49
2.933SerArg: 2.933 ± 2.761
5.865SerSer: 5.865 ± 2.392
0.733SerThr: 0.733 ± 0.483
4.399SerVal: 4.399 ± 2.837
1.466SerTrp: 1.466 ± 0.752
2.199SerTyr: 2.199 ± 0.848
0.0SerXaa: 0.0 ± 0.0
Thr
4.399ThrAla: 4.399 ± 1.147
0.0ThrCys: 0.0 ± 0.0
3.666ThrAsp: 3.666 ± 1.076
0.733ThrGlu: 0.733 ± 0.789
0.0ThrPhe: 0.0 ± 0.0
2.933ThrGly: 2.933 ± 1.473
2.199ThrHis: 2.199 ± 0.88
1.466ThrIle: 1.466 ± 1.114
5.865ThrLys: 5.865 ± 1.431
5.865ThrLeu: 5.865 ± 1.31
0.0ThrMet: 0.0 ± 0.0
1.466ThrAsn: 1.466 ± 0.965
4.399ThrPro: 4.399 ± 1.131
2.199ThrGln: 2.199 ± 0.545
2.933ThrArg: 2.933 ± 1.09
3.666ThrSer: 3.666 ± 1.59
2.933ThrThr: 2.933 ± 1.473
5.132ThrVal: 5.132 ± 1.162
1.466ThrTrp: 1.466 ± 0.965
1.466ThrTyr: 1.466 ± 0.634
0.0ThrXaa: 0.0 ± 0.0
Val
5.132ValAla: 5.132 ± 1.847
1.466ValCys: 1.466 ± 1.068
3.666ValAsp: 3.666 ± 1.276
1.466ValGlu: 1.466 ± 0.634
1.466ValPhe: 1.466 ± 1.114
4.399ValGly: 4.399 ± 1.513
0.0ValHis: 0.0 ± 0.0
0.733ValIle: 0.733 ± 0.483
2.199ValLys: 2.199 ± 0.924
5.865ValLeu: 5.865 ± 1.544
0.0ValMet: 0.0 ± 0.0
4.399ValAsn: 4.399 ± 1.194
8.798ValPro: 8.798 ± 2.427
2.199ValGln: 2.199 ± 1.631
2.933ValArg: 2.933 ± 1.267
5.865ValSer: 5.865 ± 1.802
4.399ValThr: 4.399 ± 0.905
1.466ValVal: 1.466 ± 0.82
0.0ValTrp: 0.0 ± 0.0
1.466ValTyr: 1.466 ± 0.634
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.733TrpGlu: 0.733 ± 0.483
2.199TrpPhe: 2.199 ± 1.448
0.733TrpGly: 0.733 ± 0.483
0.733TrpHis: 0.733 ± 0.483
0.733TrpIle: 0.733 ± 0.644
0.733TrpLys: 0.733 ± 0.483
1.466TrpLeu: 1.466 ± 0.82
0.0TrpMet: 0.0 ± 0.0
1.466TrpAsn: 1.466 ± 0.752
1.466TrpPro: 1.466 ± 0.752
2.199TrpGln: 2.199 ± 0.545
0.733TrpArg: 0.733 ± 0.644
0.733TrpSer: 0.733 ± 0.644
0.733TrpThr: 0.733 ± 0.483
0.733TrpVal: 0.733 ± 0.644
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.733TyrAla: 0.733 ± 0.789
0.0TyrCys: 0.0 ± 0.0
2.933TyrAsp: 2.933 ± 1.156
2.199TyrGlu: 2.199 ± 0.924
1.466TyrPhe: 1.466 ± 1.289
4.399TyrGly: 4.399 ± 1.62
1.466TyrHis: 1.466 ± 1.068
0.733TyrIle: 0.733 ± 0.903
1.466TyrLys: 1.466 ± 0.69
4.399TyrLeu: 4.399 ± 1.513
0.733TyrMet: 0.733 ± 0.483
0.0TyrAsn: 0.0 ± 0.0
5.132TyrPro: 5.132 ± 1.237
2.933TyrGln: 2.933 ± 1.797
1.466TyrArg: 1.466 ± 0.634
3.666TyrSer: 3.666 ± 1.753
2.199TyrThr: 2.199 ± 1.183
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.733TyrTyr: 0.733 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1365 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski