Amino acid dipepetide frequency for Apis mellifera associated microvirus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.536AlaAla: 6.536 ± 1.786
0.0AlaCys: 0.0 ± 0.0
5.084AlaAsp: 5.084 ± 1.724
5.084AlaGlu: 5.084 ± 1.478
2.179AlaPhe: 2.179 ± 1.002
4.357AlaGly: 4.357 ± 2.004
1.452AlaHis: 1.452 ± 1.247
2.905AlaIle: 2.905 ± 2.15
7.988AlaLys: 7.988 ± 2.548
7.262AlaLeu: 7.262 ± 1.76
2.179AlaMet: 2.179 ± 0.965
2.905AlaAsn: 2.905 ± 1.389
4.357AlaPro: 4.357 ± 0.943
5.084AlaGln: 5.084 ± 2.793
5.084AlaArg: 5.084 ± 1.749
7.988AlaSer: 7.988 ± 2.688
3.631AlaThr: 3.631 ± 1.34
3.631AlaVal: 3.631 ± 1.319
0.726AlaTrp: 0.726 ± 0.877
3.631AlaTyr: 3.631 ± 1.831
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.726CysCys: 0.726 ± 0.896
0.0CysAsp: 0.0 ± 0.0
0.726CysGlu: 0.726 ± 0.896
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.726CysIle: 0.726 ± 0.624
1.452CysLys: 1.452 ± 1.313
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.726CysThr: 0.726 ± 0.896
0.726CysVal: 0.726 ± 0.896
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.631AspAla: 3.631 ± 1.332
0.0AspCys: 0.0 ± 0.0
2.179AspAsp: 2.179 ± 0.921
2.179AspGlu: 2.179 ± 1.116
3.631AspPhe: 3.631 ± 1.774
1.452AspGly: 1.452 ± 0.609
0.726AspHis: 0.726 ± 0.498
1.452AspIle: 1.452 ± 1.06
0.726AspLys: 0.726 ± 0.624
3.631AspLeu: 3.631 ± 1.238
2.179AspMet: 2.179 ± 1.062
2.905AspAsn: 2.905 ± 1.396
4.357AspPro: 4.357 ± 2.38
1.452AspGln: 1.452 ± 0.609
2.179AspArg: 2.179 ± 0.896
3.631AspSer: 3.631 ± 0.999
3.631AspThr: 3.631 ± 1.486
2.905AspVal: 2.905 ± 0.99
0.726AspTrp: 0.726 ± 0.624
3.631AspTyr: 3.631 ± 1.774
0.0AspXaa: 0.0 ± 0.0
Glu
6.536GluAla: 6.536 ± 1.637
0.0GluCys: 0.0 ± 0.0
2.179GluAsp: 2.179 ± 1.081
3.631GluGlu: 3.631 ± 1.608
2.905GluPhe: 2.905 ± 0.818
0.726GluGly: 0.726 ± 0.802
2.179GluHis: 2.179 ± 0.928
5.81GluIle: 5.81 ± 1.396
7.988GluLys: 7.988 ± 3.311
2.179GluLeu: 2.179 ± 0.921
0.726GluMet: 0.726 ± 1.584
2.179GluAsn: 2.179 ± 1.606
2.905GluPro: 2.905 ± 1.693
2.179GluGln: 2.179 ± 0.985
3.631GluArg: 3.631 ± 0.999
2.905GluSer: 2.905 ± 1.872
5.81GluThr: 5.81 ± 2.518
1.452GluVal: 1.452 ± 1.435
0.726GluTrp: 0.726 ± 0.498
3.631GluTyr: 3.631 ± 2.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.179PheAla: 2.179 ± 1.116
0.726PheCys: 0.726 ± 1.153
2.179PheAsp: 2.179 ± 1.495
1.452PheGlu: 1.452 ± 1.429
4.357PhePhe: 4.357 ± 2.296
2.905PheGly: 2.905 ± 1.263
2.179PheHis: 2.179 ± 1.402
1.452PheIle: 1.452 ± 0.997
1.452PheLys: 1.452 ± 1.06
2.179PheLeu: 2.179 ± 1.062
0.726PheMet: 0.726 ± 0.457
3.631PheAsn: 3.631 ± 1.815
0.726PhePro: 0.726 ± 0.877
2.179PheGln: 2.179 ± 1.002
4.357PheArg: 4.357 ± 1.635
1.452PheSer: 1.452 ± 0.967
2.179PheThr: 2.179 ± 0.921
2.905PheVal: 2.905 ± 1.64
1.452PheTrp: 1.452 ± 0.997
1.452PheTyr: 1.452 ± 0.609
0.0PheXaa: 0.0 ± 0.0
Gly
4.357GlyAla: 4.357 ± 2.411
0.0GlyCys: 0.0 ± 0.0
2.905GlyAsp: 2.905 ± 1.217
4.357GlyGlu: 4.357 ± 2.26
0.726GlyPhe: 0.726 ± 0.498
4.357GlyGly: 4.357 ± 2.274
0.726GlyHis: 0.726 ± 0.624
4.357GlyIle: 4.357 ± 1.17
1.452GlyLys: 1.452 ± 1.429
6.536GlyLeu: 6.536 ± 2.022
0.0GlyMet: 0.0 ± 0.0
3.631GlyAsn: 3.631 ± 1.152
2.905GlyPro: 2.905 ± 1.64
1.452GlyGln: 1.452 ± 0.997
0.726GlyArg: 0.726 ± 0.498
5.084GlySer: 5.084 ± 1.318
5.084GlyThr: 5.084 ± 1.49
5.084GlyVal: 5.084 ± 1.395
0.726GlyTrp: 0.726 ± 0.498
0.726GlyTyr: 0.726 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
2.905HisAla: 2.905 ± 0.989
0.0HisCys: 0.0 ± 0.0
0.726HisAsp: 0.726 ± 0.498
2.179HisGlu: 2.179 ± 1.664
0.726HisPhe: 0.726 ± 0.498
2.179HisGly: 2.179 ± 1.081
0.0HisHis: 0.0 ± 0.0
0.726HisIle: 0.726 ± 0.624
0.726HisLys: 0.726 ± 0.624
2.179HisLeu: 2.179 ± 1.116
0.726HisMet: 0.726 ± 0.498
1.452HisAsn: 1.452 ± 1.06
2.905HisPro: 2.905 ± 1.102
1.452HisGln: 1.452 ± 0.918
2.179HisArg: 2.179 ± 1.019
2.179HisSer: 2.179 ± 1.612
2.179HisThr: 2.179 ± 1.402
0.0HisVal: 0.0 ± 0.0
0.726HisTrp: 0.726 ± 0.624
1.452HisTyr: 1.452 ± 1.247
0.0HisXaa: 0.0 ± 0.0
Ile
2.905IleAla: 2.905 ± 1.516
0.0IleCys: 0.0 ± 0.0
0.726IleAsp: 0.726 ± 0.498
3.631IleGlu: 3.631 ± 1.335
0.0IlePhe: 0.0 ± 0.0
2.905IleGly: 2.905 ± 1.389
0.726IleHis: 0.726 ± 0.624
2.905IleIle: 2.905 ± 2.108
0.0IleLys: 0.0 ± 0.0
5.084IleLeu: 5.084 ± 1.453
0.0IleMet: 0.0 ± 0.0
3.631IleAsn: 3.631 ± 1.831
5.81IlePro: 5.81 ± 1.755
2.179IleGln: 2.179 ± 1.629
2.905IleArg: 2.905 ± 1.151
2.179IleSer: 2.179 ± 1.062
3.631IleThr: 3.631 ± 1.898
1.452IleVal: 1.452 ± 0.609
1.452IleTrp: 1.452 ± 0.758
2.905IleTyr: 2.905 ± 1.389
0.0IleXaa: 0.0 ± 0.0
Lys
5.084LysAla: 5.084 ± 3.622
0.0LysCys: 0.0 ± 0.0
2.179LysAsp: 2.179 ± 0.896
4.357LysGlu: 4.357 ± 2.377
3.631LysPhe: 3.631 ± 1.322
3.631LysGly: 3.631 ± 1.524
3.631LysHis: 3.631 ± 1.055
3.631LysIle: 3.631 ± 1.328
8.715LysLys: 8.715 ± 3.532
7.988LysLeu: 7.988 ± 3.253
0.0LysMet: 0.0 ± 0.0
5.81LysAsn: 5.81 ± 4.581
3.631LysPro: 3.631 ± 1.568
3.631LysGln: 3.631 ± 1.743
5.084LysArg: 5.084 ± 2.09
5.084LysSer: 5.084 ± 1.327
8.715LysThr: 8.715 ± 5.409
1.452LysVal: 1.452 ± 0.758
2.179LysTrp: 2.179 ± 1.229
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.262LeuAla: 7.262 ± 2.236
0.726LeuCys: 0.726 ± 0.624
4.357LeuAsp: 4.357 ± 1.826
4.357LeuGlu: 4.357 ± 1.322
3.631LeuPhe: 3.631 ± 1.597
5.81LeuGly: 5.81 ± 1.236
0.726LeuHis: 0.726 ± 0.498
3.631LeuIle: 3.631 ± 1.018
4.357LeuLys: 4.357 ± 2.257
2.905LeuLeu: 2.905 ± 1.173
4.357LeuMet: 4.357 ± 1.342
3.631LeuAsn: 3.631 ± 1.238
4.357LeuPro: 4.357 ± 1.354
7.262LeuGln: 7.262 ± 1.771
2.179LeuArg: 2.179 ± 0.994
1.452LeuSer: 1.452 ± 0.997
2.905LeuThr: 2.905 ± 1.349
5.81LeuVal: 5.81 ± 1.768
1.452LeuTrp: 1.452 ± 1.247
1.452LeuTyr: 1.452 ± 1.313
0.0LeuXaa: 0.0 ± 0.0
Met
0.726MetAla: 0.726 ± 0.802
1.452MetCys: 1.452 ± 1.791
0.726MetAsp: 0.726 ± 0.498
1.452MetGlu: 1.452 ± 0.758
0.726MetPhe: 0.726 ± 0.859
2.179MetGly: 2.179 ± 1.062
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.905MetLys: 2.905 ± 1.798
1.452MetLeu: 1.452 ± 1.313
0.726MetMet: 0.726 ± 0.498
1.452MetAsn: 1.452 ± 0.967
0.726MetPro: 0.726 ± 0.498
1.452MetGln: 1.452 ± 0.997
1.452MetArg: 1.452 ± 0.997
4.357MetSer: 4.357 ± 2.937
0.726MetThr: 0.726 ± 0.877
1.452MetVal: 1.452 ± 1.06
0.726MetTrp: 0.726 ± 0.498
0.726MetTyr: 0.726 ± 0.498
0.0MetXaa: 0.0 ± 0.0
Asn
5.084AsnAla: 5.084 ± 1.453
0.0AsnCys: 0.0 ± 0.0
3.631AsnAsp: 3.631 ± 1.725
2.905AsnGlu: 2.905 ± 1.236
0.726AsnPhe: 0.726 ± 0.498
3.631AsnGly: 3.631 ± 0.729
1.452AsnHis: 1.452 ± 1.223
2.905AsnIle: 2.905 ± 2.254
2.905AsnLys: 2.905 ± 2.448
7.262AsnLeu: 7.262 ± 1.245
1.452AsnMet: 1.452 ± 0.609
1.452AsnAsn: 1.452 ± 1.429
4.357AsnPro: 4.357 ± 1.676
2.905AsnGln: 2.905 ± 1.396
2.905AsnArg: 2.905 ± 0.818
3.631AsnSer: 3.631 ± 1.641
1.452AsnThr: 1.452 ± 1.054
1.452AsnVal: 1.452 ± 0.758
0.726AsnTrp: 0.726 ± 0.498
3.631AsnTyr: 3.631 ± 1.701
0.0AsnXaa: 0.0 ± 0.0
Pro
7.262ProAla: 7.262 ± 4.135
1.452ProCys: 1.452 ± 1.06
4.357ProAsp: 4.357 ± 1.68
3.631ProGlu: 3.631 ± 2.214
3.631ProPhe: 3.631 ± 1.997
3.631ProGly: 3.631 ± 2.492
2.905ProHis: 2.905 ± 1.133
2.905ProIle: 2.905 ± 1.35
5.81ProLys: 5.81 ± 2.216
3.631ProLeu: 3.631 ± 1.018
2.905ProMet: 2.905 ± 1.877
1.452ProAsn: 1.452 ± 0.827
2.179ProPro: 2.179 ± 1.079
5.084ProGln: 5.084 ± 1.831
4.357ProArg: 4.357 ± 1.911
7.262ProSer: 7.262 ± 2.962
0.726ProThr: 0.726 ± 0.877
3.631ProVal: 3.631 ± 1.872
0.726ProTrp: 0.726 ± 0.498
1.452ProTyr: 1.452 ± 0.609
0.0ProXaa: 0.0 ± 0.0
Gln
1.452GlnAla: 1.452 ± 0.609
0.0GlnCys: 0.0 ± 0.0
0.726GlnAsp: 0.726 ± 0.498
4.357GlnGlu: 4.357 ± 1.17
1.452GlnPhe: 1.452 ± 0.997
2.905GlnGly: 2.905 ± 1.061
2.905GlnHis: 2.905 ± 1.347
2.179GlnIle: 2.179 ± 1.478
5.084GlnLys: 5.084 ± 2.544
1.452GlnLeu: 1.452 ± 0.609
3.631GlnMet: 3.631 ± 2.209
4.357GlnAsn: 4.357 ± 1.43
2.905GlnPro: 2.905 ± 1.263
1.452GlnGln: 1.452 ± 1.223
2.179GlnArg: 2.179 ± 1.032
2.905GlnSer: 2.905 ± 0.818
2.179GlnThr: 2.179 ± 0.921
1.452GlnVal: 1.452 ± 1.019
0.0GlnTrp: 0.0 ± 0.0
0.726GlnTyr: 0.726 ± 1.153
0.0GlnXaa: 0.0 ± 0.0
Arg
4.357ArgAla: 4.357 ± 1.888
0.0ArgCys: 0.0 ± 0.0
5.084ArgAsp: 5.084 ± 2.519
3.631ArgGlu: 3.631 ± 0.987
2.179ArgPhe: 2.179 ± 0.896
3.631ArgGly: 3.631 ± 1.701
0.726ArgHis: 0.726 ± 0.498
0.726ArgIle: 0.726 ± 0.877
2.905ArgLys: 2.905 ± 1.103
3.631ArgLeu: 3.631 ± 1.48
1.452ArgMet: 1.452 ± 0.758
2.905ArgAsn: 2.905 ± 2.309
4.357ArgPro: 4.357 ± 1.42
0.0ArgGln: 0.0 ± 0.0
2.905ArgArg: 2.905 ± 1.244
4.357ArgSer: 4.357 ± 1.388
3.631ArgThr: 3.631 ± 2.007
5.084ArgVal: 5.084 ± 1.002
0.726ArgTrp: 0.726 ± 0.498
2.905ArgTyr: 2.905 ± 1.35
0.0ArgXaa: 0.0 ± 0.0
Ser
7.988SerAla: 7.988 ± 3.222
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
5.084SerGlu: 5.084 ± 1.187
5.084SerPhe: 5.084 ± 3.008
3.631SerGly: 3.631 ± 1.303
1.452SerHis: 1.452 ± 0.758
0.726SerIle: 0.726 ± 0.498
9.441SerLys: 9.441 ± 3.654
4.357SerLeu: 4.357 ± 1.888
0.0SerMet: 0.0 ± 0.0
3.631SerAsn: 3.631 ± 2.171
7.262SerPro: 7.262 ± 3.579
1.452SerGln: 1.452 ± 0.758
2.905SerArg: 2.905 ± 0.819
6.536SerSer: 6.536 ± 2.674
5.084SerThr: 5.084 ± 2.054
2.905SerVal: 2.905 ± 0.819
2.179SerTrp: 2.179 ± 1.143
1.452SerTyr: 1.452 ± 1.753
0.0SerXaa: 0.0 ± 0.0
Thr
3.631ThrAla: 3.631 ± 1.522
0.0ThrCys: 0.0 ± 0.0
3.631ThrAsp: 3.631 ± 2.468
3.631ThrGlu: 3.631 ± 1.238
2.179ThrPhe: 2.179 ± 1.039
3.631ThrGly: 3.631 ± 1.048
0.726ThrHis: 0.726 ± 1.153
3.631ThrIle: 3.631 ± 1.815
5.084ThrLys: 5.084 ± 2.167
5.81ThrLeu: 5.81 ± 3.598
0.0ThrMet: 0.0 ± 0.0
4.357ThrAsn: 4.357 ± 1.352
6.536ThrPro: 6.536 ± 2.769
2.179ThrGln: 2.179 ± 1.039
3.631ThrArg: 3.631 ± 1.859
7.262ThrSer: 7.262 ± 1.344
1.452ThrThr: 1.452 ± 0.609
2.905ThrVal: 2.905 ± 0.989
0.726ThrTrp: 0.726 ± 0.877
1.452ThrTyr: 1.452 ± 0.997
0.0ThrXaa: 0.0 ± 0.0
Val
4.357ValAla: 4.357 ± 2.229
0.0ValCys: 0.0 ± 0.0
2.905ValAsp: 2.905 ± 0.818
1.452ValGlu: 1.452 ± 0.827
0.726ValPhe: 0.726 ± 0.624
0.726ValGly: 0.726 ± 0.498
1.452ValHis: 1.452 ± 0.967
0.726ValIle: 0.726 ± 0.498
5.084ValLys: 5.084 ± 1.48
1.452ValLeu: 1.452 ± 0.997
1.452ValMet: 1.452 ± 1.375
2.179ValAsn: 2.179 ± 1.211
5.81ValPro: 5.81 ± 3.226
1.452ValGln: 1.452 ± 1.06
4.357ValArg: 4.357 ± 2.158
0.726ValSer: 0.726 ± 0.802
7.262ValThr: 7.262 ± 2.371
3.631ValVal: 3.631 ± 1.837
1.452ValTrp: 1.452 ± 0.997
2.179ValTyr: 2.179 ± 1.039
0.0ValXaa: 0.0 ± 0.0
Trp
2.905TrpAla: 2.905 ± 0.819
0.0TrpCys: 0.0 ± 0.0
1.452TrpAsp: 1.452 ± 0.997
1.452TrpGlu: 1.452 ± 0.609
1.452TrpPhe: 1.452 ± 0.997
0.0TrpGly: 0.0 ± 0.0
2.905TrpHis: 2.905 ± 0.994
0.726TrpIle: 0.726 ± 0.624
0.726TrpLys: 0.726 ± 0.624
1.452TrpLeu: 1.452 ± 1.28
0.726TrpMet: 0.726 ± 0.498
1.452TrpAsn: 1.452 ± 0.758
0.726TrpPro: 0.726 ± 0.498
0.0TrpGln: 0.0 ± 0.0
0.726TrpArg: 0.726 ± 0.498
0.726TrpSer: 0.726 ± 0.859
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.726TrpTrp: 0.726 ± 0.877
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.905TyrAla: 2.905 ± 0.99
0.0TyrCys: 0.0 ± 0.0
2.179TyrAsp: 2.179 ± 2.118
1.452TyrGlu: 1.452 ± 1.115
2.179TyrPhe: 2.179 ± 1.116
2.905TyrGly: 2.905 ± 0.818
0.726TyrHis: 0.726 ± 0.624
2.905TyrIle: 2.905 ± 1.217
3.631TyrLys: 3.631 ± 1.261
2.179TyrLeu: 2.179 ± 1.499
1.452TyrMet: 1.452 ± 0.609
1.452TyrAsn: 1.452 ± 0.997
2.179TyrPro: 2.179 ± 1.116
1.452TyrGln: 1.452 ± 0.997
1.452TyrArg: 1.452 ± 0.609
1.452TyrSer: 1.452 ± 0.809
1.452TyrThr: 1.452 ± 0.997
1.452TyrVal: 1.452 ± 0.609
0.0TyrTrp: 0.0 ± 0.0
1.452TyrTyr: 1.452 ± 1.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1378 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski