Amino acid dipepetide frequency for Mycobacterium phage SwissCheese

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.8AlaAla: 12.8 ± 1.579
0.753AlaCys: 0.753 ± 0.208
6.714AlaAsp: 6.714 ± 0.606
6.651AlaGlu: 6.651 ± 0.736
3.137AlaPhe: 3.137 ± 0.487
7.655AlaGly: 7.655 ± 0.747
1.506AlaHis: 1.506 ± 0.352
4.267AlaIle: 4.267 ± 0.561
4.392AlaLys: 4.392 ± 0.518
8.847AlaLeu: 8.847 ± 0.798
2.573AlaMet: 2.573 ± 0.462
2.447AlaAsn: 2.447 ± 0.394
4.894AlaPro: 4.894 ± 0.79
2.761AlaGln: 2.761 ± 0.442
6.086AlaArg: 6.086 ± 0.513
4.894AlaSer: 4.894 ± 0.554
5.773AlaThr: 5.773 ± 0.689
8.471AlaVal: 8.471 ± 0.808
1.882AlaTrp: 1.882 ± 0.342
2.573AlaTyr: 2.573 ± 0.404
0.0AlaXaa: 0.0 ± 0.0
Cys
0.816CysAla: 0.816 ± 0.259
0.0CysCys: 0.0 ± 0.0
0.565CysAsp: 0.565 ± 0.192
0.69CysGlu: 0.69 ± 0.2
0.125CysPhe: 0.125 ± 0.081
0.376CysGly: 0.376 ± 0.168
0.188CysHis: 0.188 ± 0.105
0.251CysIle: 0.251 ± 0.12
0.251CysLys: 0.251 ± 0.143
0.376CysLeu: 0.376 ± 0.166
0.125CysMet: 0.125 ± 0.092
0.251CysAsn: 0.251 ± 0.11
0.314CysPro: 0.314 ± 0.127
0.188CysGln: 0.188 ± 0.099
0.627CysArg: 0.627 ± 0.208
0.565CysSer: 0.565 ± 0.237
0.314CysThr: 0.314 ± 0.14
0.314CysVal: 0.314 ± 0.17
0.188CysTrp: 0.188 ± 0.105
0.188CysTyr: 0.188 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
6.086AspAla: 6.086 ± 0.714
0.565AspCys: 0.565 ± 0.181
4.33AspAsp: 4.33 ± 0.562
3.514AspGlu: 3.514 ± 0.452
2.51AspPhe: 2.51 ± 0.378
6.086AspGly: 6.086 ± 0.641
1.192AspHis: 1.192 ± 0.305
2.949AspIle: 2.949 ± 0.483
2.51AspLys: 2.51 ± 0.445
6.965AspLeu: 6.965 ± 0.669
1.129AspMet: 1.129 ± 0.208
1.82AspAsn: 1.82 ± 0.351
4.957AspPro: 4.957 ± 0.566
2.008AspGln: 2.008 ± 0.444
3.388AspArg: 3.388 ± 0.397
3.137AspSer: 3.137 ± 0.488
3.765AspThr: 3.765 ± 0.459
4.204AspVal: 4.204 ± 0.599
1.757AspTrp: 1.757 ± 0.327
2.259AspTyr: 2.259 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
6.212GluAla: 6.212 ± 0.675
0.439GluCys: 0.439 ± 0.244
4.267GluAsp: 4.267 ± 0.484
5.584GluGlu: 5.584 ± 0.561
2.322GluPhe: 2.322 ± 0.351
4.141GluGly: 4.141 ± 0.454
1.129GluHis: 1.129 ± 0.273
3.514GluIle: 3.514 ± 0.458
2.886GluLys: 2.886 ± 0.392
7.09GluLeu: 7.09 ± 0.601
1.569GluMet: 1.569 ± 0.3
2.008GluAsn: 2.008 ± 0.403
2.761GluPro: 2.761 ± 0.387
2.447GluGln: 2.447 ± 0.417
3.828GluArg: 3.828 ± 0.608
3.263GluSer: 3.263 ± 0.358
3.765GluThr: 3.765 ± 0.504
5.584GluVal: 5.584 ± 0.699
1.757GluTrp: 1.757 ± 0.366
2.447GluTyr: 2.447 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
2.51PheAla: 2.51 ± 0.32
0.314PheCys: 0.314 ± 0.152
3.012PheAsp: 3.012 ± 0.357
1.945PheGlu: 1.945 ± 0.361
0.439PhePhe: 0.439 ± 0.161
3.263PheGly: 3.263 ± 0.484
1.004PheHis: 1.004 ± 0.304
1.506PheIle: 1.506 ± 0.293
1.569PheLys: 1.569 ± 0.362
2.447PheLeu: 2.447 ± 0.453
0.502PheMet: 0.502 ± 0.206
1.255PheAsn: 1.255 ± 0.24
1.569PhePro: 1.569 ± 0.357
1.129PheGln: 1.129 ± 0.26
2.196PheArg: 2.196 ± 0.344
2.259PheSer: 2.259 ± 0.423
1.945PheThr: 1.945 ± 0.357
1.82PheVal: 1.82 ± 0.364
0.627PheTrp: 0.627 ± 0.177
1.004PheTyr: 1.004 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
6.902GlyAla: 6.902 ± 0.996
0.439GlyCys: 0.439 ± 0.173
5.459GlyAsp: 5.459 ± 0.556
5.083GlyGlu: 5.083 ± 0.489
2.886GlyPhe: 2.886 ± 0.482
9.726GlyGly: 9.726 ± 2.279
1.82GlyHis: 1.82 ± 0.367
4.769GlyIle: 4.769 ± 0.668
3.514GlyLys: 3.514 ± 0.493
7.655GlyLeu: 7.655 ± 0.849
1.882GlyMet: 1.882 ± 0.323
3.326GlyAsn: 3.326 ± 0.351
3.765GlyPro: 3.765 ± 0.561
2.071GlyGln: 2.071 ± 0.288
5.271GlyArg: 5.271 ± 0.558
6.086GlySer: 6.086 ± 0.897
4.643GlyThr: 4.643 ± 0.607
5.522GlyVal: 5.522 ± 0.656
2.51GlyTrp: 2.51 ± 0.418
2.824GlyTyr: 2.824 ± 0.426
0.0GlyXaa: 0.0 ± 0.0
His
1.631HisAla: 1.631 ± 0.349
0.251HisCys: 0.251 ± 0.176
1.067HisAsp: 1.067 ± 0.219
1.631HisGlu: 1.631 ± 0.301
0.941HisPhe: 0.941 ± 0.217
1.757HisGly: 1.757 ± 0.387
0.627HisHis: 0.627 ± 0.186
0.816HisIle: 0.816 ± 0.211
1.255HisLys: 1.255 ± 0.33
1.443HisLeu: 1.443 ± 0.369
0.188HisMet: 0.188 ± 0.139
0.376HisAsn: 0.376 ± 0.17
1.255HisPro: 1.255 ± 0.253
0.878HisGln: 0.878 ± 0.199
1.443HisArg: 1.443 ± 0.317
0.565HisSer: 0.565 ± 0.166
0.941HisThr: 0.941 ± 0.284
1.631HisVal: 1.631 ± 0.308
0.565HisTrp: 0.565 ± 0.179
0.565HisTyr: 0.565 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
6.651IleAla: 6.651 ± 0.85
0.188IleCys: 0.188 ± 0.095
3.012IleAsp: 3.012 ± 0.359
3.514IleGlu: 3.514 ± 0.473
0.878IlePhe: 0.878 ± 0.188
3.953IleGly: 3.953 ± 0.416
0.878IleHis: 0.878 ± 0.21
1.82IleIle: 1.82 ± 0.395
1.631IleLys: 1.631 ± 0.358
3.639IleLeu: 3.639 ± 0.459
0.753IleMet: 0.753 ± 0.198
2.008IleAsn: 2.008 ± 0.321
3.388IlePro: 3.388 ± 0.387
1.569IleGln: 1.569 ± 0.425
3.577IleArg: 3.577 ± 0.484
3.326IleSer: 3.326 ± 0.45
3.263IleThr: 3.263 ± 0.49
2.949IleVal: 2.949 ± 0.51
0.816IleTrp: 0.816 ± 0.176
1.757IleTyr: 1.757 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
4.267LysAla: 4.267 ± 0.526
0.188LysCys: 0.188 ± 0.118
2.761LysAsp: 2.761 ± 0.421
2.259LysGlu: 2.259 ± 0.427
1.882LysPhe: 1.882 ± 0.309
2.384LysGly: 2.384 ± 0.343
1.004LysHis: 1.004 ± 0.237
2.384LysIle: 2.384 ± 0.455
2.071LysLys: 2.071 ± 0.365
3.702LysLeu: 3.702 ± 0.462
1.004LysMet: 1.004 ± 0.261
1.569LysAsn: 1.569 ± 0.249
2.698LysPro: 2.698 ± 0.443
1.506LysGln: 1.506 ± 0.317
2.886LysArg: 2.886 ± 0.528
2.635LysSer: 2.635 ± 0.453
2.196LysThr: 2.196 ± 0.337
3.263LysVal: 3.263 ± 0.48
0.816LysTrp: 0.816 ± 0.22
0.878LysTyr: 0.878 ± 0.26
0.0LysXaa: 0.0 ± 0.0
Leu
9.349LeuAla: 9.349 ± 0.91
0.314LeuCys: 0.314 ± 0.122
5.773LeuAsp: 5.773 ± 0.535
5.71LeuGlu: 5.71 ± 0.605
2.071LeuPhe: 2.071 ± 0.335
7.843LeuGly: 7.843 ± 0.824
1.757LeuHis: 1.757 ± 0.37
4.957LeuIle: 4.957 ± 0.623
4.016LeuLys: 4.016 ± 0.562
5.647LeuLeu: 5.647 ± 0.652
1.757LeuMet: 1.757 ± 0.306
2.824LeuAsn: 2.824 ± 0.38
5.334LeuPro: 5.334 ± 0.584
2.949LeuGln: 2.949 ± 0.433
6.024LeuArg: 6.024 ± 0.54
5.647LeuSer: 5.647 ± 0.588
6.275LeuThr: 6.275 ± 0.572
4.957LeuVal: 4.957 ± 0.686
1.129LeuTrp: 1.129 ± 0.329
2.196LeuTyr: 2.196 ± 0.393
0.0LeuXaa: 0.0 ± 0.0
Met
2.573MetAla: 2.573 ± 0.346
0.063MetCys: 0.063 ± 0.06
1.443MetAsp: 1.443 ± 0.287
1.569MetGlu: 1.569 ± 0.304
0.565MetPhe: 0.565 ± 0.192
1.38MetGly: 1.38 ± 0.298
0.314MetHis: 0.314 ± 0.13
0.627MetIle: 0.627 ± 0.208
1.067MetLys: 1.067 ± 0.213
1.255MetLeu: 1.255 ± 0.262
0.125MetMet: 0.125 ± 0.086
1.129MetAsn: 1.129 ± 0.241
1.192MetPro: 1.192 ± 0.241
0.627MetGln: 0.627 ± 0.189
1.192MetArg: 1.192 ± 0.288
2.196MetSer: 2.196 ± 0.36
2.071MetThr: 2.071 ± 0.298
1.067MetVal: 1.067 ± 0.265
0.188MetTrp: 0.188 ± 0.105
0.376MetTyr: 0.376 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
3.137AsnAla: 3.137 ± 0.475
0.125AsnCys: 0.125 ± 0.086
2.384AsnAsp: 2.384 ± 0.432
2.196AsnGlu: 2.196 ± 0.375
1.004AsnPhe: 1.004 ± 0.282
3.89AsnGly: 3.89 ± 0.523
0.753AsnHis: 0.753 ± 0.215
1.38AsnIle: 1.38 ± 0.321
0.565AsnLys: 0.565 ± 0.213
2.322AsnLeu: 2.322 ± 0.311
0.565AsnMet: 0.565 ± 0.177
0.816AsnAsn: 0.816 ± 0.192
2.51AsnPro: 2.51 ± 0.343
0.941AsnGln: 0.941 ± 0.211
1.38AsnArg: 1.38 ± 0.297
2.133AsnSer: 2.133 ± 0.422
2.071AsnThr: 2.071 ± 0.338
2.384AsnVal: 2.384 ± 0.382
0.753AsnTrp: 0.753 ± 0.184
1.129AsnTyr: 1.129 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
5.208ProAla: 5.208 ± 0.628
0.502ProCys: 0.502 ± 0.198
4.832ProAsp: 4.832 ± 0.472
4.141ProGlu: 4.141 ± 0.532
2.133ProPhe: 2.133 ± 0.387
4.581ProGly: 4.581 ± 0.601
0.878ProHis: 0.878 ± 0.229
2.322ProIle: 2.322 ± 0.416
2.133ProLys: 2.133 ± 0.295
4.455ProLeu: 4.455 ± 0.536
1.192ProMet: 1.192 ± 0.323
1.443ProAsn: 1.443 ± 0.336
2.824ProPro: 2.824 ± 0.491
1.38ProGln: 1.38 ± 0.286
2.573ProArg: 2.573 ± 0.447
3.639ProSer: 3.639 ± 0.489
4.016ProThr: 4.016 ± 0.515
3.828ProVal: 3.828 ± 0.417
0.627ProTrp: 0.627 ± 0.27
1.38ProTyr: 1.38 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
2.949GlnAla: 2.949 ± 0.476
0.063GlnCys: 0.063 ± 0.063
1.318GlnAsp: 1.318 ± 0.377
1.82GlnGlu: 1.82 ± 0.315
1.318GlnPhe: 1.318 ± 0.277
2.635GlnGly: 2.635 ± 0.399
0.565GlnHis: 0.565 ± 0.174
3.012GlnIle: 3.012 ± 0.519
1.192GlnLys: 1.192 ± 0.283
3.828GlnLeu: 3.828 ± 0.467
0.941GlnMet: 0.941 ± 0.244
0.627GlnAsn: 0.627 ± 0.199
1.443GlnPro: 1.443 ± 0.286
1.82GlnGln: 1.82 ± 0.423
1.631GlnArg: 1.631 ± 0.375
1.443GlnSer: 1.443 ± 0.326
2.071GlnThr: 2.071 ± 0.322
2.635GlnVal: 2.635 ± 0.375
0.627GlnTrp: 0.627 ± 0.17
0.502GlnTyr: 0.502 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
5.396ArgAla: 5.396 ± 0.709
0.878ArgCys: 0.878 ± 0.267
2.573ArgAsp: 2.573 ± 0.45
4.581ArgGlu: 4.581 ± 0.606
1.694ArgPhe: 1.694 ± 0.354
5.145ArgGly: 5.145 ± 0.755
1.192ArgHis: 1.192 ± 0.344
2.949ArgIle: 2.949 ± 0.464
3.326ArgLys: 3.326 ± 0.524
6.149ArgLeu: 6.149 ± 0.678
1.882ArgMet: 1.882 ± 0.358
2.071ArgAsn: 2.071 ± 0.433
2.322ArgPro: 2.322 ± 0.338
1.82ArgGln: 1.82 ± 0.392
5.208ArgArg: 5.208 ± 0.859
4.079ArgSer: 4.079 ± 0.573
3.012ArgThr: 3.012 ± 0.492
5.334ArgVal: 5.334 ± 0.59
1.318ArgTrp: 1.318 ± 0.315
1.882ArgTyr: 1.882 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
5.584SerAla: 5.584 ± 0.756
0.565SerCys: 0.565 ± 0.199
3.2SerAsp: 3.2 ± 0.379
4.079SerGlu: 4.079 ± 0.546
2.071SerPhe: 2.071 ± 0.393
6.965SerGly: 6.965 ± 0.906
1.318SerHis: 1.318 ± 0.292
2.949SerIle: 2.949 ± 0.437
2.698SerLys: 2.698 ± 0.391
5.02SerLeu: 5.02 ± 0.616
1.443SerMet: 1.443 ± 0.295
2.196SerAsn: 2.196 ± 0.439
2.949SerPro: 2.949 ± 0.465
2.196SerGln: 2.196 ± 0.34
3.137SerArg: 3.137 ± 0.42
3.326SerSer: 3.326 ± 0.616
3.326SerThr: 3.326 ± 0.491
3.765SerVal: 3.765 ± 0.525
1.38SerTrp: 1.38 ± 0.275
1.318SerTyr: 1.318 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
6.4ThrAla: 6.4 ± 0.861
0.376ThrCys: 0.376 ± 0.164
4.33ThrAsp: 4.33 ± 0.604
4.079ThrGlu: 4.079 ± 0.509
2.259ThrPhe: 2.259 ± 0.382
5.898ThrGly: 5.898 ± 0.636
1.067ThrHis: 1.067 ± 0.261
3.075ThrIle: 3.075 ± 0.577
2.635ThrLys: 2.635 ± 0.361
5.522ThrLeu: 5.522 ± 0.555
0.816ThrMet: 0.816 ± 0.165
1.882ThrAsn: 1.882 ± 0.403
4.079ThrPro: 4.079 ± 0.508
1.882ThrGln: 1.882 ± 0.348
3.388ThrArg: 3.388 ± 0.511
3.263ThrSer: 3.263 ± 0.406
4.079ThrThr: 4.079 ± 0.595
5.145ThrVal: 5.145 ± 0.62
1.067ThrTrp: 1.067 ± 0.28
1.945ThrTyr: 1.945 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
6.463ValAla: 6.463 ± 0.783
0.251ValCys: 0.251 ± 0.11
5.773ValAsp: 5.773 ± 0.646
4.832ValGlu: 4.832 ± 0.617
2.573ValPhe: 2.573 ± 0.4
4.392ValGly: 4.392 ± 0.727
1.443ValHis: 1.443 ± 0.263
3.451ValIle: 3.451 ± 0.417
3.012ValLys: 3.012 ± 0.452
5.773ValLeu: 5.773 ± 0.643
1.38ValMet: 1.38 ± 0.289
2.573ValAsn: 2.573 ± 0.317
3.828ValPro: 3.828 ± 0.517
2.322ValGln: 2.322 ± 0.396
5.208ValArg: 5.208 ± 0.673
4.581ValSer: 4.581 ± 0.408
5.522ValThr: 5.522 ± 0.589
4.706ValVal: 4.706 ± 0.577
1.129ValTrp: 1.129 ± 0.249
2.259ValTyr: 2.259 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
1.569TrpAla: 1.569 ± 0.277
0.314TrpCys: 0.314 ± 0.145
1.318TrpAsp: 1.318 ± 0.288
0.878TrpGlu: 0.878 ± 0.218
0.878TrpPhe: 0.878 ± 0.248
1.569TrpGly: 1.569 ± 0.274
0.565TrpHis: 0.565 ± 0.204
1.004TrpIle: 1.004 ± 0.249
0.376TrpLys: 0.376 ± 0.236
2.008TrpLeu: 2.008 ± 0.424
0.376TrpMet: 0.376 ± 0.177
0.627TrpAsn: 0.627 ± 0.185
0.753TrpPro: 0.753 ± 0.233
1.004TrpGln: 1.004 ± 0.204
1.192TrpArg: 1.192 ± 0.271
0.941TrpSer: 0.941 ± 0.254
1.882TrpThr: 1.882 ± 0.379
2.071TrpVal: 2.071 ± 0.316
0.565TrpTrp: 0.565 ± 0.222
0.251TrpTyr: 0.251 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.573TyrAla: 2.573 ± 0.43
0.125TyrCys: 0.125 ± 0.099
1.004TyrAsp: 1.004 ± 0.264
2.259TyrGlu: 2.259 ± 0.338
0.627TyrPhe: 0.627 ± 0.191
2.51TyrGly: 2.51 ± 0.399
0.627TyrHis: 0.627 ± 0.211
1.569TyrIle: 1.569 ± 0.297
1.255TyrLys: 1.255 ± 0.301
2.384TyrLeu: 2.384 ± 0.397
0.753TyrMet: 0.753 ± 0.216
1.192TyrAsn: 1.192 ± 0.277
1.255TyrPro: 1.255 ± 0.274
0.941TyrGln: 0.941 ± 0.263
2.635TyrArg: 2.635 ± 0.38
1.506TyrSer: 1.506 ± 0.302
2.259TyrThr: 2.259 ± 0.445
1.82TyrVal: 1.82 ± 0.389
0.439TyrTrp: 0.439 ± 0.161
0.502TyrTyr: 0.502 ± 0.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (15938 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski