Amino acid dipepetide frequency for BtMr-AlphaCoV/SAX2011

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.445AlaAla: 6.445 ± 2.213
3.0AlaCys: 3.0 ± 0.6
4.112AlaAsp: 4.112 ± 0.813
2.334AlaGlu: 2.334 ± 0.448
5.112AlaPhe: 5.112 ± 1.18
4.112AlaGly: 4.112 ± 0.638
1.556AlaHis: 1.556 ± 0.353
3.334AlaIle: 3.334 ± 0.507
3.445AlaLys: 3.445 ± 0.783
6.89AlaLeu: 6.89 ± 0.768
2.111AlaMet: 2.111 ± 0.389
4.334AlaAsn: 4.334 ± 0.669
3.223AlaPro: 3.223 ± 1.424
2.556AlaGln: 2.556 ± 0.775
2.778AlaArg: 2.778 ± 0.567
4.334AlaSer: 4.334 ± 1.028
3.667AlaThr: 3.667 ± 0.601
6.556AlaVal: 6.556 ± 0.812
0.667AlaTrp: 0.667 ± 0.623
3.111AlaTyr: 3.111 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
2.334CysAla: 2.334 ± 0.622
1.556CysCys: 1.556 ± 0.481
2.667CysAsp: 2.667 ± 0.45
1.0CysGlu: 1.0 ± 0.531
2.111CysPhe: 2.111 ± 0.481
2.334CysGly: 2.334 ± 0.694
0.222CysHis: 0.222 ± 0.319
1.667CysIle: 1.667 ± 0.738
1.667CysLys: 1.667 ± 0.448
2.222CysLeu: 2.222 ± 0.369
0.111CysMet: 0.111 ± 0.197
3.223CysAsn: 3.223 ± 0.835
1.0CysPro: 1.0 ± 0.338
0.778CysGln: 0.778 ± 0.351
0.889CysArg: 0.889 ± 0.283
2.778CysSer: 2.778 ± 0.743
2.0CysThr: 2.0 ± 0.55
3.334CysVal: 3.334 ± 0.745
0.667CysTrp: 0.667 ± 0.354
2.222CysTyr: 2.222 ± 0.947
0.0CysXaa: 0.0 ± 0.0
Asp
3.889AspAla: 3.889 ± 0.499
2.889AspCys: 2.889 ± 1.129
3.111AspAsp: 3.111 ± 0.45
2.111AspGlu: 2.111 ± 0.297
5.223AspPhe: 5.223 ± 0.991
5.223AspGly: 5.223 ± 0.733
1.222AspHis: 1.222 ± 0.45
2.889AspIle: 2.889 ± 0.431
1.667AspLys: 1.667 ± 0.568
3.445AspLeu: 3.445 ± 0.577
1.222AspMet: 1.222 ± 0.45
3.223AspAsn: 3.223 ± 0.295
1.667AspPro: 1.667 ± 0.356
1.445AspGln: 1.445 ± 0.661
2.111AspArg: 2.111 ± 0.602
3.0AspSer: 3.0 ± 0.742
2.667AspThr: 2.667 ± 0.45
5.112AspVal: 5.112 ± 0.995
1.0AspTrp: 1.0 ± 0.338
3.445AspTyr: 3.445 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
2.445GluAla: 2.445 ± 0.196
1.0GluCys: 1.0 ± 0.378
1.889GluAsp: 1.889 ± 0.601
1.333GluGlu: 1.333 ± 0.707
1.778GluPhe: 1.778 ± 0.567
3.223GluGly: 3.223 ± 0.874
1.333GluHis: 1.333 ± 0.532
2.111GluIle: 2.111 ± 0.365
2.0GluLys: 2.0 ± 0.754
2.667GluLeu: 2.667 ± 0.571
0.333GluMet: 0.333 ± 0.177
1.667GluAsn: 1.667 ± 0.68
1.333GluPro: 1.333 ± 0.331
1.333GluGln: 1.333 ± 0.198
2.334GluArg: 2.334 ± 0.797
2.667GluSer: 2.667 ± 0.888
2.111GluThr: 2.111 ± 0.523
3.334GluVal: 3.334 ± 0.654
0.667GluTrp: 0.667 ± 0.183
1.556GluTyr: 1.556 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
3.111PheAla: 3.111 ± 0.997
1.889PheCys: 1.889 ± 0.621
5.001PheAsp: 5.001 ± 1.211
2.111PheGlu: 2.111 ± 0.504
2.556PhePhe: 2.556 ± 0.592
4.334PheGly: 4.334 ± 0.57
0.556PheHis: 0.556 ± 0.307
2.222PheIle: 2.222 ± 0.369
3.111PheLys: 3.111 ± 0.795
3.667PheLeu: 3.667 ± 0.887
1.222PheMet: 1.222 ± 0.402
3.778PheAsn: 3.778 ± 0.459
1.0PhePro: 1.0 ± 0.214
0.667PheGln: 0.667 ± 0.183
1.333PheArg: 1.333 ± 0.291
4.667PheSer: 4.667 ± 0.913
3.445PheThr: 3.445 ± 0.919
6.445PheVal: 6.445 ± 1.172
1.445PheTrp: 1.445 ± 0.251
2.556PheTyr: 2.556 ± 0.775
0.0PheXaa: 0.0 ± 0.0
Gly
4.667GlyAla: 4.667 ± 0.897
2.0GlyCys: 2.0 ± 0.52
4.667GlyAsp: 4.667 ± 1.474
2.222GlyGlu: 2.222 ± 0.559
3.889GlyPhe: 3.889 ± 0.631
5.001GlyGly: 5.001 ± 1.756
0.556GlyHis: 0.556 ± 0.28
3.667GlyIle: 3.667 ± 0.878
3.778GlyLys: 3.778 ± 1.25
5.778GlyLeu: 5.778 ± 0.947
1.333GlyMet: 1.333 ± 0.512
3.889GlyAsn: 3.889 ± 1.188
1.667GlyPro: 1.667 ± 0.696
1.445GlyGln: 1.445 ± 0.297
1.889GlyArg: 1.889 ± 0.762
4.0GlySer: 4.0 ± 0.809
4.334GlyThr: 4.334 ± 1.433
7.445GlyVal: 7.445 ± 0.994
0.667GlyTrp: 0.667 ± 0.311
3.334GlyTyr: 3.334 ± 0.152
0.0GlyXaa: 0.0 ± 0.0
His
1.889HisAla: 1.889 ± 0.596
0.556HisCys: 0.556 ± 0.295
0.556HisAsp: 0.556 ± 0.144
0.778HisGlu: 0.778 ± 0.413
1.445HisPhe: 1.445 ± 0.637
1.222HisGly: 1.222 ± 0.45
0.222HisHis: 0.222 ± 0.31
1.333HisIle: 1.333 ± 0.332
0.778HisLys: 0.778 ± 0.413
1.667HisLeu: 1.667 ± 0.47
0.0HisMet: 0.0 ± 0.0
0.889HisAsn: 0.889 ± 0.283
0.444HisPro: 0.444 ± 0.236
0.667HisGln: 0.667 ± 0.456
0.667HisArg: 0.667 ± 0.275
0.778HisSer: 0.778 ± 0.643
1.667HisThr: 1.667 ± 0.26
2.334HisVal: 2.334 ± 0.537
0.0HisTrp: 0.0 ± 0.0
0.667HisTyr: 0.667 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
3.445IleAla: 3.445 ± 1.292
1.0IleCys: 1.0 ± 0.538
1.667IleAsp: 1.667 ± 0.481
2.222IleGlu: 2.222 ± 1.579
2.445IlePhe: 2.445 ± 0.505
2.778IleGly: 2.778 ± 0.615
0.222IleHis: 0.222 ± 0.118
2.556IleIle: 2.556 ± 0.724
2.222IleLys: 2.222 ± 0.326
3.889IleLeu: 3.889 ± 1.172
1.333IleMet: 1.333 ± 0.29
2.556IleAsn: 2.556 ± 0.442
2.334IlePro: 2.334 ± 1.148
1.445IleGln: 1.445 ± 0.522
1.556IleArg: 1.556 ± 0.469
3.778IleSer: 3.778 ± 0.638
4.556IleThr: 4.556 ± 1.09
5.556IleVal: 5.556 ± 0.868
0.556IleTrp: 0.556 ± 0.311
1.556IleTyr: 1.556 ± 0.683
0.0IleXaa: 0.0 ± 0.0
Lys
4.0LysAla: 4.0 ± 1.783
1.445LysCys: 1.445 ± 0.564
2.556LysAsp: 2.556 ± 0.971
2.445LysGlu: 2.445 ± 0.498
3.334LysPhe: 3.334 ± 0.955
2.556LysGly: 2.556 ± 1.046
2.222LysHis: 2.222 ± 0.559
2.111LysIle: 2.111 ± 0.708
2.334LysLys: 2.334 ± 1.09
5.778LysLeu: 5.778 ± 1.123
1.0LysMet: 1.0 ± 0.248
2.0LysAsn: 2.0 ± 0.844
2.222LysPro: 2.222 ± 0.765
2.445LysGln: 2.445 ± 0.408
1.667LysArg: 1.667 ± 0.71
3.778LysSer: 3.778 ± 1.101
2.445LysThr: 2.445 ± 0.632
4.334LysVal: 4.334 ± 0.778
0.444LysTrp: 0.444 ± 0.545
2.889LysTyr: 2.889 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
6.445LeuAla: 6.445 ± 1.187
4.0LeuCys: 4.0 ± 0.993
4.334LeuAsp: 4.334 ± 0.7
3.556LeuGlu: 3.556 ± 0.572
4.223LeuPhe: 4.223 ± 0.631
4.889LeuGly: 4.889 ± 0.52
1.889LeuHis: 1.889 ± 0.574
2.556LeuIle: 2.556 ± 2.117
6.001LeuLys: 6.001 ± 1.211
8.779LeuLeu: 8.779 ± 3.598
1.667LeuMet: 1.667 ± 0.266
5.223LeuAsn: 5.223 ± 0.924
3.556LeuPro: 3.556 ± 1.94
4.112LeuGln: 4.112 ± 0.323
2.445LeuArg: 2.445 ± 0.461
7.668LeuSer: 7.668 ± 1.68
4.889LeuThr: 4.889 ± 0.784
5.445LeuVal: 5.445 ± 1.547
1.445LeuTrp: 1.445 ± 1.083
4.556LeuTyr: 4.556 ± 0.663
0.0LeuXaa: 0.0 ± 0.0
Met
1.445MetAla: 1.445 ± 0.251
1.0MetCys: 1.0 ± 0.531
1.0MetAsp: 1.0 ± 0.54
0.778MetGlu: 0.778 ± 0.241
1.333MetPhe: 1.333 ± 0.367
1.0MetGly: 1.0 ± 0.338
0.667MetHis: 0.667 ± 0.354
1.0MetIle: 1.0 ± 0.338
0.778MetLys: 0.778 ± 0.32
2.778MetLeu: 2.778 ± 0.663
1.0MetMet: 1.0 ± 0.21
0.444MetAsn: 0.444 ± 0.28
0.556MetPro: 0.556 ± 0.295
0.556MetGln: 0.556 ± 0.367
1.111MetArg: 1.111 ± 0.245
1.222MetSer: 1.222 ± 0.296
1.222MetThr: 1.222 ± 0.247
1.445MetVal: 1.445 ± 0.298
0.444MetTrp: 0.444 ± 0.28
1.222MetTyr: 1.222 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
4.445AsnAla: 4.445 ± 0.417
2.0AsnCys: 2.0 ± 0.855
2.667AsnAsp: 2.667 ± 0.562
2.0AsnGlu: 2.0 ± 0.767
3.111AsnPhe: 3.111 ± 1.421
6.667AsnGly: 6.667 ± 0.758
0.667AsnHis: 0.667 ± 0.354
2.778AsnIle: 2.778 ± 0.793
2.556AsnLys: 2.556 ± 0.68
3.556AsnLeu: 3.556 ± 1.049
0.333AsnMet: 0.333 ± 0.177
4.0AsnAsn: 4.0 ± 1.826
1.556AsnPro: 1.556 ± 1.17
0.667AsnGln: 0.667 ± 0.21
1.778AsnArg: 1.778 ± 0.654
4.889AsnSer: 4.889 ± 1.923
4.0AsnThr: 4.0 ± 0.817
6.223AsnVal: 6.223 ± 0.873
0.444AsnTrp: 0.444 ± 0.876
2.445AsnTyr: 2.445 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
3.445ProAla: 3.445 ± 1.014
0.889ProCys: 0.889 ± 0.331
1.445ProAsp: 1.445 ± 0.47
1.333ProGlu: 1.333 ± 0.29
1.667ProPhe: 1.667 ± 0.428
2.556ProGly: 2.556 ± 0.84
0.667ProHis: 0.667 ± 0.21
1.556ProIle: 1.556 ± 0.725
2.111ProLys: 2.111 ± 2.251
3.667ProLeu: 3.667 ± 0.485
0.444ProMet: 0.444 ± 0.122
1.0ProAsn: 1.0 ± 0.518
1.556ProPro: 1.556 ± 0.865
1.333ProGln: 1.333 ± 0.962
1.222ProArg: 1.222 ± 0.76
2.778ProSer: 2.778 ± 0.893
2.556ProThr: 2.556 ± 0.577
3.445ProVal: 3.445 ± 1.164
0.556ProTrp: 0.556 ± 0.144
1.667ProTyr: 1.667 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
2.445GlnAla: 2.445 ± 0.489
0.667GlnCys: 0.667 ± 0.29
1.0GlnAsp: 1.0 ± 0.338
0.778GlnGlu: 0.778 ± 0.231
0.444GlnPhe: 0.444 ± 0.302
1.333GlnGly: 1.333 ± 0.454
0.333GlnHis: 0.333 ± 0.126
1.333GlnIle: 1.333 ± 0.433
1.222GlnLys: 1.222 ± 0.325
4.667GlnLeu: 4.667 ± 1.187
1.222GlnMet: 1.222 ± 0.302
1.445GlnAsn: 1.445 ± 0.601
1.778GlnPro: 1.778 ± 1.01
1.556GlnGln: 1.556 ± 0.894
1.667GlnArg: 1.667 ± 0.664
2.222GlnSer: 2.222 ± 0.729
1.889GlnThr: 1.889 ± 0.628
2.222GlnVal: 2.222 ± 0.55
0.333GlnTrp: 0.333 ± 0.305
2.0GlnTyr: 2.0 ± 0.849
0.0GlnXaa: 0.0 ± 0.0
Arg
3.0ArgAla: 3.0 ± 0.487
1.889ArgCys: 1.889 ± 0.507
1.333ArgAsp: 1.333 ± 0.512
0.889ArgGlu: 0.889 ± 0.479
1.556ArgPhe: 1.556 ± 0.463
1.778ArgGly: 1.778 ± 1.481
0.667ArgHis: 0.667 ± 0.244
1.889ArgIle: 1.889 ± 0.214
1.778ArgLys: 1.778 ± 0.738
3.667ArgLeu: 3.667 ± 0.728
0.556ArgMet: 0.556 ± 0.268
2.778ArgAsn: 2.778 ± 1.506
0.889ArgPro: 0.889 ± 0.283
1.0ArgGln: 1.0 ± 0.358
1.333ArgArg: 1.333 ± 0.332
1.667ArgSer: 1.667 ± 1.385
3.334ArgThr: 3.334 ± 0.831
3.667ArgVal: 3.667 ± 0.463
0.556ArgTrp: 0.556 ± 0.268
1.778ArgTyr: 1.778 ± 0.567
0.0ArgXaa: 0.0 ± 0.0
Ser
5.334SerAla: 5.334 ± 0.632
1.333SerCys: 1.333 ± 0.365
4.112SerAsp: 4.112 ± 0.398
2.556SerGlu: 2.556 ± 0.981
4.445SerPhe: 4.445 ± 0.99
4.445SerGly: 4.445 ± 0.652
1.222SerHis: 1.222 ± 0.45
4.0SerIle: 4.0 ± 1.095
3.667SerLys: 3.667 ± 1.134
5.89SerLeu: 5.89 ± 1.366
1.445SerMet: 1.445 ± 0.564
3.778SerAsn: 3.778 ± 1.288
1.889SerPro: 1.889 ± 0.271
2.334SerGln: 2.334 ± 0.634
2.667SerArg: 2.667 ± 2.434
5.556SerSer: 5.556 ± 1.67
4.667SerThr: 4.667 ± 0.904
6.556SerVal: 6.556 ± 0.791
0.889SerTrp: 0.889 ± 0.263
2.778SerTyr: 2.778 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
3.556ThrAla: 3.556 ± 1.606
2.556ThrCys: 2.556 ± 0.853
3.111ThrAsp: 3.111 ± 0.569
1.889ThrGlu: 1.889 ± 0.202
2.778ThrPhe: 2.778 ± 0.72
4.889ThrGly: 4.889 ± 2.692
0.778ThrHis: 0.778 ± 0.241
2.889ThrIle: 2.889 ± 0.633
2.445ThrLys: 2.445 ± 0.649
5.667ThrLeu: 5.667 ± 0.797
2.334ThrMet: 2.334 ± 0.615
2.778ThrAsn: 2.778 ± 0.605
3.445ThrPro: 3.445 ± 0.856
2.111ThrGln: 2.111 ± 0.701
2.556ThrArg: 2.556 ± 0.306
4.445ThrSer: 4.445 ± 0.81
4.556ThrThr: 4.556 ± 1.56
6.89ThrVal: 6.89 ± 1.387
0.444ThrTrp: 0.444 ± 0.236
3.0ThrTyr: 3.0 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
6.89ValAla: 6.89 ± 0.959
3.223ValCys: 3.223 ± 1.0
6.112ValAsp: 6.112 ± 0.526
4.334ValGlu: 4.334 ± 0.687
4.667ValPhe: 4.667 ± 0.705
5.001ValGly: 5.001 ± 0.712
2.334ValHis: 2.334 ± 0.573
5.223ValIle: 5.223 ± 2.019
7.223ValLys: 7.223 ± 2.764
7.445ValLeu: 7.445 ± 0.835
1.667ValMet: 1.667 ± 0.333
5.445ValAsn: 5.445 ± 0.909
4.0ValPro: 4.0 ± 1.058
2.556ValGln: 2.556 ± 0.723
3.445ValArg: 3.445 ± 0.65
5.89ValSer: 5.89 ± 0.823
5.556ValThr: 5.556 ± 0.917
10.668ValVal: 10.668 ± 2.182
0.556ValTrp: 0.556 ± 0.24
4.223ValTyr: 4.223 ± 0.998
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 1.151
0.444TrpCys: 0.444 ± 0.236
1.111TrpAsp: 1.111 ± 0.229
0.444TrpGlu: 0.444 ± 0.236
0.444TrpPhe: 0.444 ± 0.122
0.556TrpGly: 0.556 ± 0.144
0.444TrpHis: 0.444 ± 0.28
0.444TrpIle: 0.444 ± 0.3
0.444TrpLys: 0.444 ± 0.259
1.667TrpLeu: 1.667 ± 0.737
0.111TrpMet: 0.111 ± 0.059
1.111TrpAsn: 1.111 ± 0.613
0.667TrpPro: 0.667 ± 0.42
0.556TrpGln: 0.556 ± 0.275
0.667TrpArg: 0.667 ± 0.29
0.778TrpSer: 0.778 ± 0.253
0.556TrpThr: 0.556 ± 0.502
0.778TrpVal: 0.778 ± 0.194
0.333TrpTrp: 0.333 ± 0.126
0.778TrpTyr: 0.778 ± 0.458
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.889TyrAla: 3.889 ± 0.501
1.445TyrCys: 1.445 ± 0.21
4.112TyrAsp: 4.112 ± 1.929
2.0TyrGlu: 2.0 ± 0.675
2.556TyrPhe: 2.556 ± 0.785
2.334TyrGly: 2.334 ± 1.118
0.778TyrHis: 0.778 ± 0.302
2.111TyrIle: 2.111 ± 0.602
2.778TyrLys: 2.778 ± 0.458
3.889TyrLeu: 3.889 ± 1.464
1.445TyrMet: 1.445 ± 0.402
3.223TyrAsn: 3.223 ± 0.412
1.111TyrPro: 1.111 ± 0.393
1.0TyrGln: 1.0 ± 0.55
1.889TyrArg: 1.889 ± 0.623
2.667TyrSer: 2.667 ± 1.084
3.0TyrThr: 3.0 ± 0.792
4.667TyrVal: 4.667 ± 0.739
0.889TyrTrp: 0.889 ± 0.705
2.445TyrTyr: 2.445 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (9000 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski