Amino acid dipepetide frequency for Mycobacterium phage NicoleTera

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.484AlaAla: 11.484 ± 1.134
0.428AlaCys: 0.428 ± 0.165
5.498AlaAsp: 5.498 ± 0.701
7.941AlaGlu: 7.941 ± 0.876
3.91AlaPhe: 3.91 ± 0.463
7.88AlaGly: 7.88 ± 0.872
1.833AlaHis: 1.833 ± 0.294
3.971AlaIle: 3.971 ± 0.457
4.52AlaLys: 4.52 ± 0.523
8.491AlaLeu: 8.491 ± 0.939
2.627AlaMet: 2.627 ± 0.449
3.665AlaAsn: 3.665 ± 0.534
4.459AlaPro: 4.459 ± 0.604
3.604AlaGln: 3.604 ± 0.539
6.597AlaArg: 6.597 ± 0.669
5.437AlaSer: 5.437 ± 0.584
5.498AlaThr: 5.498 ± 0.859
7.33AlaVal: 7.33 ± 0.853
2.016AlaTrp: 2.016 ± 0.335
2.81AlaTyr: 2.81 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.207
0.0CysCys: 0.0 ± 0.0
0.611CysAsp: 0.611 ± 0.209
0.55CysGlu: 0.55 ± 0.166
0.305CysPhe: 0.305 ± 0.163
0.794CysGly: 0.794 ± 0.222
0.305CysHis: 0.305 ± 0.148
0.183CysIle: 0.183 ± 0.109
0.489CysLys: 0.489 ± 0.189
0.794CysLeu: 0.794 ± 0.25
0.061CysMet: 0.061 ± 0.053
0.489CysAsn: 0.489 ± 0.158
0.55CysPro: 0.55 ± 0.237
0.061CysGln: 0.061 ± 0.067
0.55CysArg: 0.55 ± 0.189
0.672CysSer: 0.672 ± 0.24
0.489CysThr: 0.489 ± 0.17
0.672CysVal: 0.672 ± 0.19
0.305CysTrp: 0.305 ± 0.141
0.428CysTyr: 0.428 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
6.048AspAla: 6.048 ± 0.691
0.733AspCys: 0.733 ± 0.227
3.787AspAsp: 3.787 ± 0.542
3.971AspGlu: 3.971 ± 0.635
2.688AspPhe: 2.688 ± 0.554
6.17AspGly: 6.17 ± 0.527
1.344AspHis: 1.344 ± 0.296
3.482AspIle: 3.482 ± 0.444
2.077AspLys: 2.077 ± 0.347
5.559AspLeu: 5.559 ± 0.752
1.833AspMet: 1.833 ± 0.299
1.405AspAsn: 1.405 ± 0.354
4.643AspPro: 4.643 ± 0.697
1.772AspGln: 1.772 ± 0.377
2.81AspArg: 2.81 ± 0.425
2.627AspSer: 2.627 ± 0.42
2.871AspThr: 2.871 ± 0.429
4.337AspVal: 4.337 ± 0.616
1.344AspTrp: 1.344 ± 0.304
2.321AspTyr: 2.321 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
7.819GluAla: 7.819 ± 0.895
0.244GluCys: 0.244 ± 0.129
4.887GluAsp: 4.887 ± 0.827
5.009GluGlu: 5.009 ± 0.609
2.749GluPhe: 2.749 ± 0.486
5.315GluGly: 5.315 ± 0.699
1.344GluHis: 1.344 ± 0.309
3.36GluIle: 3.36 ± 0.466
2.443GluLys: 2.443 ± 0.383
7.697GluLeu: 7.697 ± 0.79
2.382GluMet: 2.382 ± 0.369
2.016GluAsn: 2.016 ± 0.354
2.871GluPro: 2.871 ± 0.486
2.749GluGln: 2.749 ± 0.377
4.582GluArg: 4.582 ± 0.554
2.81GluSer: 2.81 ± 0.378
3.91GluThr: 3.91 ± 0.533
4.215GluVal: 4.215 ± 0.519
1.344GluTrp: 1.344 ± 0.26
1.772GluTyr: 1.772 ± 0.282
0.0GluXaa: 0.0 ± 0.0
Phe
2.932PheAla: 2.932 ± 0.414
0.367PheCys: 0.367 ± 0.156
2.443PheAsp: 2.443 ± 0.508
2.871PheGlu: 2.871 ± 0.495
0.794PhePhe: 0.794 ± 0.231
2.749PheGly: 2.749 ± 0.354
0.794PheHis: 0.794 ± 0.234
1.588PheIle: 1.588 ± 0.292
1.588PheLys: 1.588 ± 0.31
2.566PheLeu: 2.566 ± 0.516
0.611PheMet: 0.611 ± 0.196
1.833PheAsn: 1.833 ± 0.378
1.894PhePro: 1.894 ± 0.364
1.038PheGln: 1.038 ± 0.3
2.138PheArg: 2.138 ± 0.373
2.016PheSer: 2.016 ± 0.333
2.26PheThr: 2.26 ± 0.321
1.955PheVal: 1.955 ± 0.364
0.489PheTrp: 0.489 ± 0.14
0.733PheTyr: 0.733 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
7.88GlyAla: 7.88 ± 0.97
0.794GlyCys: 0.794 ± 0.252
5.559GlyAsp: 5.559 ± 0.774
5.559GlyGlu: 5.559 ± 0.501
2.932GlyPhe: 2.932 ± 0.438
9.896GlyGly: 9.896 ± 2.137
1.527GlyHis: 1.527 ± 0.288
4.337GlyIle: 4.337 ± 0.597
3.91GlyLys: 3.91 ± 0.511
6.597GlyLeu: 6.597 ± 0.658
2.138GlyMet: 2.138 ± 0.461
3.849GlyAsn: 3.849 ± 0.426
4.948GlyPro: 4.948 ± 2.12
3.177GlyGln: 3.177 ± 0.493
3.726GlyArg: 3.726 ± 0.488
4.52GlySer: 4.52 ± 0.721
5.437GlyThr: 5.437 ± 0.774
6.414GlyVal: 6.414 ± 0.686
1.588GlyTrp: 1.588 ± 0.306
2.077GlyTyr: 2.077 ± 0.302
0.0GlyXaa: 0.0 ± 0.0
His
1.833HisAla: 1.833 ± 0.309
0.244HisCys: 0.244 ± 0.121
1.222HisAsp: 1.222 ± 0.281
1.1HisGlu: 1.1 ± 0.287
0.428HisPhe: 0.428 ± 0.159
1.527HisGly: 1.527 ± 0.463
0.55HisHis: 0.55 ± 0.169
1.405HisIle: 1.405 ± 0.288
1.344HisLys: 1.344 ± 0.301
1.772HisLeu: 1.772 ± 0.351
0.244HisMet: 0.244 ± 0.104
0.611HisAsn: 0.611 ± 0.207
1.038HisPro: 1.038 ± 0.223
0.611HisGln: 0.611 ± 0.204
1.466HisArg: 1.466 ± 0.366
0.855HisSer: 0.855 ± 0.235
0.916HisThr: 0.916 ± 0.255
1.038HisVal: 1.038 ± 0.257
0.367HisTrp: 0.367 ± 0.153
0.733HisTyr: 0.733 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
5.925IleAla: 5.925 ± 0.661
0.611IleCys: 0.611 ± 0.196
3.543IleAsp: 3.543 ± 0.454
4.032IleGlu: 4.032 ± 0.516
1.222IlePhe: 1.222 ± 0.271
4.52IleGly: 4.52 ± 0.886
0.855IleHis: 0.855 ± 0.265
2.138IleIle: 2.138 ± 0.351
2.505IleLys: 2.505 ± 0.447
3.421IleLeu: 3.421 ± 0.433
0.428IleMet: 0.428 ± 0.15
2.26IleAsn: 2.26 ± 0.466
3.299IlePro: 3.299 ± 0.471
1.71IleGln: 1.71 ± 0.326
2.566IleArg: 2.566 ± 0.404
2.382IleSer: 2.382 ± 0.456
2.871IleThr: 2.871 ± 0.383
2.688IleVal: 2.688 ± 0.403
0.611IleTrp: 0.611 ± 0.158
0.916IleTyr: 0.916 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
4.704LysAla: 4.704 ± 0.655
0.367LysCys: 0.367 ± 0.155
2.26LysAsp: 2.26 ± 0.425
2.016LysGlu: 2.016 ± 0.402
1.038LysPhe: 1.038 ± 0.265
3.421LysGly: 3.421 ± 0.478
0.672LysHis: 0.672 ± 0.236
1.833LysIle: 1.833 ± 0.301
3.543LysLys: 3.543 ± 0.633
4.276LysLeu: 4.276 ± 0.534
0.916LysMet: 0.916 ± 0.219
1.283LysAsn: 1.283 ± 0.249
2.932LysPro: 2.932 ± 0.516
1.466LysGln: 1.466 ± 0.288
3.299LysArg: 3.299 ± 0.504
2.077LysSer: 2.077 ± 0.33
2.749LysThr: 2.749 ± 0.446
4.215LysVal: 4.215 ± 0.519
0.733LysTrp: 0.733 ± 0.25
1.344LysTyr: 1.344 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
8.491LeuAla: 8.491 ± 0.867
0.794LeuCys: 0.794 ± 0.242
5.009LeuAsp: 5.009 ± 0.543
5.437LeuGlu: 5.437 ± 0.626
2.688LeuPhe: 2.688 ± 0.429
5.742LeuGly: 5.742 ± 0.704
1.588LeuHis: 1.588 ± 0.36
4.093LeuIle: 4.093 ± 0.445
2.871LeuLys: 2.871 ± 0.417
6.353LeuLeu: 6.353 ± 0.816
2.382LeuMet: 2.382 ± 0.427
2.321LeuAsn: 2.321 ± 0.528
4.398LeuPro: 4.398 ± 0.508
2.688LeuGln: 2.688 ± 0.585
6.353LeuArg: 6.353 ± 0.784
5.925LeuSer: 5.925 ± 0.572
6.414LeuThr: 6.414 ± 0.828
4.643LeuVal: 4.643 ± 0.657
1.344LeuTrp: 1.344 ± 0.264
2.443LeuTyr: 2.443 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
3.054MetAla: 3.054 ± 0.416
0.061MetCys: 0.061 ± 0.067
1.038MetAsp: 1.038 ± 0.275
1.161MetGlu: 1.161 ± 0.237
0.489MetPhe: 0.489 ± 0.148
1.344MetGly: 1.344 ± 0.297
0.428MetHis: 0.428 ± 0.148
1.161MetIle: 1.161 ± 0.305
1.588MetLys: 1.588 ± 0.375
1.71MetLeu: 1.71 ± 0.409
0.55MetMet: 0.55 ± 0.189
0.794MetAsn: 0.794 ± 0.217
1.283MetPro: 1.283 ± 0.351
0.794MetGln: 0.794 ± 0.259
1.71MetArg: 1.71 ± 0.339
2.749MetSer: 2.749 ± 0.397
2.26MetThr: 2.26 ± 0.34
1.161MetVal: 1.161 ± 0.275
0.367MetTrp: 0.367 ± 0.177
0.611MetTyr: 0.611 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.115AsnAla: 3.115 ± 0.564
0.611AsnCys: 0.611 ± 0.189
1.894AsnAsp: 1.894 ± 0.288
2.199AsnGlu: 2.199 ± 0.369
0.794AsnPhe: 0.794 ± 0.268
3.604AsnGly: 3.604 ± 0.507
1.161AsnHis: 1.161 ± 0.248
1.588AsnIle: 1.588 ± 0.329
1.222AsnLys: 1.222 ± 0.249
2.871AsnLeu: 2.871 ± 0.415
0.611AsnMet: 0.611 ± 0.184
0.611AsnAsn: 0.611 ± 0.156
2.382AsnPro: 2.382 ± 0.365
0.916AsnGln: 0.916 ± 0.244
1.894AsnArg: 1.894 ± 0.395
1.466AsnSer: 1.466 ± 0.294
1.833AsnThr: 1.833 ± 0.391
2.871AsnVal: 2.871 ± 0.405
1.038AsnTrp: 1.038 ± 0.253
1.283AsnTyr: 1.283 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
5.437ProAla: 5.437 ± 0.599
0.305ProCys: 0.305 ± 0.13
4.154ProAsp: 4.154 ± 0.54
4.948ProGlu: 4.948 ± 0.566
1.527ProPhe: 1.527 ± 0.372
4.459ProGly: 4.459 ± 0.511
0.916ProHis: 0.916 ± 0.209
2.321ProIle: 2.321 ± 0.355
2.321ProLys: 2.321 ± 0.582
3.299ProLeu: 3.299 ± 0.436
1.1ProMet: 1.1 ± 0.265
2.199ProAsn: 2.199 ± 0.357
2.321ProPro: 2.321 ± 0.404
2.871ProGln: 2.871 ± 1.287
4.093ProArg: 4.093 ± 0.656
2.627ProSer: 2.627 ± 0.362
3.238ProThr: 3.238 ± 0.435
3.726ProVal: 3.726 ± 0.402
1.038ProTrp: 1.038 ± 0.354
1.405ProTyr: 1.405 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
4.704GlnAla: 4.704 ± 0.595
0.244GlnCys: 0.244 ± 0.125
1.283GlnAsp: 1.283 ± 0.262
1.466GlnGlu: 1.466 ± 0.333
1.71GlnPhe: 1.71 ± 0.382
4.337GlnGly: 4.337 ± 1.55
0.672GlnHis: 0.672 ± 0.208
2.566GlnIle: 2.566 ± 0.369
1.466GlnLys: 1.466 ± 0.293
3.177GlnLeu: 3.177 ± 0.693
0.916GlnMet: 0.916 ± 0.237
0.916GlnAsn: 0.916 ± 0.244
1.405GlnPro: 1.405 ± 0.289
1.894GlnGln: 1.894 ± 0.368
2.138GlnArg: 2.138 ± 0.472
1.466GlnSer: 1.466 ± 0.344
2.077GlnThr: 2.077 ± 0.299
1.955GlnVal: 1.955 ± 0.291
0.794GlnTrp: 0.794 ± 0.215
0.672GlnTyr: 0.672 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
5.437ArgAla: 5.437 ± 0.524
0.672ArgCys: 0.672 ± 0.256
4.215ArgAsp: 4.215 ± 0.528
5.376ArgGlu: 5.376 ± 0.791
2.627ArgPhe: 2.627 ± 0.365
4.582ArgGly: 4.582 ± 0.533
1.527ArgHis: 1.527 ± 0.288
3.421ArgIle: 3.421 ± 0.449
3.421ArgLys: 3.421 ± 0.641
4.887ArgLeu: 4.887 ± 0.567
2.382ArgMet: 2.382 ± 0.325
1.649ArgAsn: 1.649 ± 0.27
2.627ArgPro: 2.627 ± 0.34
1.71ArgGln: 1.71 ± 0.403
5.742ArgArg: 5.742 ± 0.643
3.238ArgSer: 3.238 ± 0.407
2.932ArgThr: 2.932 ± 0.445
4.826ArgVal: 4.826 ± 0.556
1.222ArgTrp: 1.222 ± 0.287
2.077ArgTyr: 2.077 ± 0.322
0.0ArgXaa: 0.0 ± 0.0
Ser
4.704SerAla: 4.704 ± 0.584
0.672SerCys: 0.672 ± 0.193
3.054SerAsp: 3.054 ± 0.445
4.032SerGlu: 4.032 ± 0.564
2.199SerPhe: 2.199 ± 0.481
5.315SerGly: 5.315 ± 0.725
0.977SerHis: 0.977 ± 0.232
2.566SerIle: 2.566 ± 0.333
2.321SerLys: 2.321 ± 0.42
4.765SerLeu: 4.765 ± 0.598
1.344SerMet: 1.344 ± 0.236
1.588SerAsn: 1.588 ± 0.359
3.054SerPro: 3.054 ± 0.454
1.772SerGln: 1.772 ± 0.349
3.421SerArg: 3.421 ± 0.416
2.443SerSer: 2.443 ± 0.37
3.482SerThr: 3.482 ± 0.433
3.421SerVal: 3.421 ± 0.489
0.977SerTrp: 0.977 ± 0.251
1.527SerTyr: 1.527 ± 0.263
0.0SerXaa: 0.0 ± 0.0
Thr
5.742ThrAla: 5.742 ± 0.583
0.489ThrCys: 0.489 ± 0.17
3.177ThrAsp: 3.177 ± 0.553
3.421ThrGlu: 3.421 ± 0.36
2.26ThrPhe: 2.26 ± 0.316
5.987ThrGly: 5.987 ± 1.284
0.855ThrHis: 0.855 ± 0.238
2.81ThrIle: 2.81 ± 0.436
3.36ThrLys: 3.36 ± 0.501
4.948ThrLeu: 4.948 ± 0.573
1.344ThrMet: 1.344 ± 0.291
1.833ThrAsn: 1.833 ± 0.317
4.582ThrPro: 4.582 ± 0.633
2.199ThrGln: 2.199 ± 0.32
3.054ThrArg: 3.054 ± 0.48
3.115ThrSer: 3.115 ± 0.586
3.299ThrThr: 3.299 ± 0.523
5.07ThrVal: 5.07 ± 0.638
1.222ThrTrp: 1.222 ± 0.331
2.199ThrTyr: 2.199 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
6.292ValAla: 6.292 ± 0.852
0.733ValCys: 0.733 ± 0.177
5.009ValAsp: 5.009 ± 0.535
4.765ValGlu: 4.765 ± 0.513
2.016ValPhe: 2.016 ± 0.38
5.376ValGly: 5.376 ± 0.555
1.1ValHis: 1.1 ± 0.218
3.177ValIle: 3.177 ± 0.462
3.177ValLys: 3.177 ± 0.345
5.131ValLeu: 5.131 ± 0.681
1.405ValMet: 1.405 ± 0.34
2.81ValAsn: 2.81 ± 0.414
2.932ValPro: 2.932 ± 0.51
2.505ValGln: 2.505 ± 0.369
4.704ValArg: 4.704 ± 0.552
4.398ValSer: 4.398 ± 0.654
5.254ValThr: 5.254 ± 0.699
5.009ValVal: 5.009 ± 0.514
1.344ValTrp: 1.344 ± 0.368
1.833ValTyr: 1.833 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
1.527TrpAla: 1.527 ± 0.338
0.305TrpCys: 0.305 ± 0.165
1.1TrpAsp: 1.1 ± 0.271
1.344TrpGlu: 1.344 ± 0.294
0.55TrpPhe: 0.55 ± 0.184
1.344TrpGly: 1.344 ± 0.311
0.367TrpHis: 0.367 ± 0.15
1.1TrpIle: 1.1 ± 0.249
0.55TrpLys: 0.55 ± 0.16
1.222TrpLeu: 1.222 ± 0.273
0.428TrpMet: 0.428 ± 0.16
0.916TrpAsn: 0.916 ± 0.275
0.977TrpPro: 0.977 ± 0.228
1.405TrpGln: 1.405 ± 0.284
1.161TrpArg: 1.161 ± 0.243
1.038TrpSer: 1.038 ± 0.227
1.344TrpThr: 1.344 ± 0.286
1.344TrpVal: 1.344 ± 0.275
0.489TrpTrp: 0.489 ± 0.188
0.672TrpTyr: 0.672 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.356
0.244TyrCys: 0.244 ± 0.13
2.077TyrAsp: 2.077 ± 0.338
2.321TyrGlu: 2.321 ± 0.389
0.733TyrPhe: 0.733 ± 0.195
2.505TyrGly: 2.505 ± 0.452
0.489TyrHis: 0.489 ± 0.18
1.405TyrIle: 1.405 ± 0.274
0.489TyrLys: 0.489 ± 0.171
2.443TyrLeu: 2.443 ± 0.391
0.55TyrMet: 0.55 ± 0.167
0.916TyrAsn: 0.916 ± 0.213
1.71TyrPro: 1.71 ± 0.344
0.916TyrGln: 0.916 ± 0.238
2.566TyrArg: 2.566 ± 0.436
1.772TyrSer: 1.772 ± 0.367
1.833TyrThr: 1.833 ± 0.338
2.077TyrVal: 2.077 ± 0.377
0.55TyrTrp: 0.55 ± 0.205
0.672TyrTyr: 0.672 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski