Amino acid dipepetide frequency for Streptococcus phage phiD12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.297AlaAla: 4.297 ± 1.07
0.198AlaCys: 0.198 ± 0.099
4.099AlaAsp: 4.099 ± 0.435
4.562AlaGlu: 4.562 ± 0.526
2.777AlaPhe: 2.777 ± 0.358
5.355AlaGly: 5.355 ± 0.675
0.859AlaHis: 0.859 ± 0.274
6.214AlaIle: 6.214 ± 0.766
5.619AlaLys: 5.619 ± 0.574
4.892AlaLeu: 4.892 ± 0.676
1.388AlaMet: 1.388 ± 0.315
4.033AlaAsn: 4.033 ± 0.552
1.388AlaPro: 1.388 ± 0.282
3.504AlaGln: 3.504 ± 0.739
3.239AlaArg: 3.239 ± 0.489
4.892AlaSer: 4.892 ± 0.851
5.024AlaThr: 5.024 ± 0.587
3.834AlaVal: 3.834 ± 0.518
1.058AlaTrp: 1.058 ± 0.282
2.909AlaTyr: 2.909 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
0.264CysAla: 0.264 ± 0.109
0.132CysCys: 0.132 ± 0.082
0.264CysAsp: 0.264 ± 0.154
0.661CysGlu: 0.661 ± 0.197
0.264CysPhe: 0.264 ± 0.15
0.661CysGly: 0.661 ± 0.189
0.264CysHis: 0.264 ± 0.135
0.331CysIle: 0.331 ± 0.168
0.331CysLys: 0.331 ± 0.249
0.529CysLeu: 0.529 ± 0.191
0.132CysMet: 0.132 ± 0.093
0.331CysAsn: 0.331 ± 0.137
0.264CysPro: 0.264 ± 0.159
0.661CysGln: 0.661 ± 0.207
0.463CysArg: 0.463 ± 0.227
0.529CysSer: 0.529 ± 0.227
0.132CysThr: 0.132 ± 0.098
0.463CysVal: 0.463 ± 0.178
0.0CysTrp: 0.0 ± 0.0
0.529CysTyr: 0.529 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
3.57AspAla: 3.57 ± 0.507
0.397AspCys: 0.397 ± 0.152
2.777AspAsp: 2.777 ± 0.473
3.967AspGlu: 3.967 ± 0.618
3.239AspPhe: 3.239 ± 0.371
4.496AspGly: 4.496 ± 0.576
1.19AspHis: 1.19 ± 0.291
4.099AspIle: 4.099 ± 0.462
4.562AspLys: 4.562 ± 0.431
4.826AspLeu: 4.826 ± 0.738
1.851AspMet: 1.851 ± 0.362
2.116AspAsn: 2.116 ± 0.359
1.322AspPro: 1.322 ± 0.291
1.917AspGln: 1.917 ± 0.328
2.975AspArg: 2.975 ± 0.557
4.231AspSer: 4.231 ± 0.539
2.843AspThr: 2.843 ± 0.416
2.975AspVal: 2.975 ± 0.476
0.859AspTrp: 0.859 ± 0.235
2.843AspTyr: 2.843 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
5.95GluAla: 5.95 ± 0.799
0.331GluCys: 0.331 ± 0.145
4.297GluAsp: 4.297 ± 0.584
6.214GluGlu: 6.214 ± 0.883
2.116GluPhe: 2.116 ± 0.428
4.165GluGly: 4.165 ± 0.467
0.793GluHis: 0.793 ± 0.269
4.363GluIle: 4.363 ± 0.534
6.214GluLys: 6.214 ± 0.885
7.404GluLeu: 7.404 ± 0.637
2.182GluMet: 2.182 ± 0.472
4.562GluAsn: 4.562 ± 0.574
1.521GluPro: 1.521 ± 0.413
4.099GluGln: 4.099 ± 0.473
3.041GluArg: 3.041 ± 0.422
3.041GluSer: 3.041 ± 0.436
4.496GluThr: 4.496 ± 0.583
4.033GluVal: 4.033 ± 0.728
0.661GluTrp: 0.661 ± 0.235
1.983GluTyr: 1.983 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.182PheAla: 2.182 ± 0.413
0.397PheCys: 0.397 ± 0.181
3.041PheAsp: 3.041 ± 0.444
3.107PheGlu: 3.107 ± 0.525
1.653PhePhe: 1.653 ± 0.351
2.446PheGly: 2.446 ± 0.424
0.992PheHis: 0.992 ± 0.233
1.983PheIle: 1.983 ± 0.452
2.975PheLys: 2.975 ± 0.679
3.636PheLeu: 3.636 ± 0.578
0.727PheMet: 0.727 ± 0.201
1.917PheAsn: 1.917 ± 0.397
0.992PhePro: 0.992 ± 0.35
1.058PheGln: 1.058 ± 0.289
1.983PheArg: 1.983 ± 0.299
2.38PheSer: 2.38 ± 0.413
2.049PheThr: 2.049 ± 0.413
1.983PheVal: 1.983 ± 0.302
0.661PheTrp: 0.661 ± 0.239
1.983PheTyr: 1.983 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
3.967GlyAla: 3.967 ± 0.562
0.264GlyCys: 0.264 ± 0.143
4.363GlyAsp: 4.363 ± 0.608
3.57GlyGlu: 3.57 ± 0.431
2.909GlyPhe: 2.909 ± 0.368
3.967GlyGly: 3.967 ± 0.841
1.851GlyHis: 1.851 ± 0.431
5.223GlyIle: 5.223 ± 0.755
4.892GlyLys: 4.892 ± 0.558
6.942GlyLeu: 6.942 ± 0.899
1.785GlyMet: 1.785 ± 0.344
3.438GlyAsn: 3.438 ± 0.438
0.926GlyPro: 0.926 ± 0.216
3.636GlyGln: 3.636 ± 0.608
3.372GlyArg: 3.372 ± 0.426
3.702GlySer: 3.702 ± 0.557
3.702GlyThr: 3.702 ± 0.635
3.901GlyVal: 3.901 ± 0.532
0.595GlyTrp: 0.595 ± 0.192
2.975GlyTyr: 2.975 ± 0.597
0.0GlyXaa: 0.0 ± 0.0
His
0.992HisAla: 0.992 ± 0.199
0.0HisCys: 0.0 ± 0.0
1.058HisAsp: 1.058 ± 0.274
1.388HisGlu: 1.388 ± 0.32
0.859HisPhe: 0.859 ± 0.266
1.322HisGly: 1.322 ± 0.25
0.595HisHis: 0.595 ± 0.223
1.587HisIle: 1.587 ± 0.227
1.058HisLys: 1.058 ± 0.281
2.116HisLeu: 2.116 ± 0.348
0.397HisMet: 0.397 ± 0.167
0.859HisAsn: 0.859 ± 0.202
1.322HisPro: 1.322 ± 0.345
1.124HisGln: 1.124 ± 0.319
1.256HisArg: 1.256 ± 0.291
0.727HisSer: 0.727 ± 0.239
0.926HisThr: 0.926 ± 0.266
1.256HisVal: 1.256 ± 0.306
0.331HisTrp: 0.331 ± 0.136
0.595HisTyr: 0.595 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.95IleAla: 5.95 ± 0.528
0.397IleCys: 0.397 ± 0.158
5.091IleAsp: 5.091 ± 0.592
4.231IleGlu: 4.231 ± 0.602
1.917IlePhe: 1.917 ± 0.363
4.826IleGly: 4.826 ± 0.56
1.256IleHis: 1.256 ± 0.308
3.901IleIle: 3.901 ± 0.568
4.363IleLys: 4.363 ± 0.521
6.413IleLeu: 6.413 ± 1.003
1.256IleMet: 1.256 ± 0.326
2.909IleAsn: 2.909 ± 0.54
2.644IlePro: 2.644 ± 0.387
3.041IleGln: 3.041 ± 0.288
2.843IleArg: 2.843 ± 0.398
4.826IleSer: 4.826 ± 0.743
5.157IleThr: 5.157 ± 0.805
4.892IleVal: 4.892 ± 0.698
1.058IleTrp: 1.058 ± 0.301
2.512IleTyr: 2.512 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
5.95LysAla: 5.95 ± 0.722
0.595LysCys: 0.595 ± 0.193
3.702LysAsp: 3.702 ± 0.591
5.289LysGlu: 5.289 ± 0.526
2.116LysPhe: 2.116 ± 0.353
4.826LysGly: 4.826 ± 0.512
1.587LysHis: 1.587 ± 0.276
5.157LysIle: 5.157 ± 0.46
5.091LysLys: 5.091 ± 0.55
5.553LysLeu: 5.553 ± 0.556
1.322LysMet: 1.322 ± 0.352
3.306LysAsn: 3.306 ± 0.495
1.653LysPro: 1.653 ± 0.235
4.231LysGln: 4.231 ± 0.523
3.967LysArg: 3.967 ± 0.532
4.694LysSer: 4.694 ± 0.489
4.297LysThr: 4.297 ± 0.512
5.157LysVal: 5.157 ± 0.643
0.793LysTrp: 0.793 ± 0.243
2.38LysTyr: 2.38 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
7.272LeuAla: 7.272 ± 0.951
0.331LeuCys: 0.331 ± 0.195
5.421LeuAsp: 5.421 ± 0.501
7.471LeuGlu: 7.471 ± 0.964
2.843LeuPhe: 2.843 ± 0.541
5.487LeuGly: 5.487 ± 0.674
1.587LeuHis: 1.587 ± 0.3
5.355LeuIle: 5.355 ± 0.6
6.942LeuLys: 6.942 ± 0.591
8.33LeuLeu: 8.33 ± 1.209
2.578LeuMet: 2.578 ± 0.444
4.496LeuAsn: 4.496 ± 0.507
3.504LeuPro: 3.504 ± 0.487
4.165LeuGln: 4.165 ± 0.602
3.173LeuArg: 3.173 ± 0.427
6.347LeuSer: 6.347 ± 0.589
6.479LeuThr: 6.479 ± 0.602
6.611LeuVal: 6.611 ± 0.736
0.463LeuTrp: 0.463 ± 0.188
4.231LeuTyr: 4.231 ± 0.655
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 0.293
0.0MetCys: 0.0 ± 0.0
1.851MetAsp: 1.851 ± 0.387
1.785MetGlu: 1.785 ± 0.357
0.661MetPhe: 0.661 ± 0.226
1.19MetGly: 1.19 ± 0.32
0.264MetHis: 0.264 ± 0.141
1.587MetIle: 1.587 ± 0.364
2.116MetLys: 2.116 ± 0.39
1.521MetLeu: 1.521 ± 0.321
0.793MetMet: 0.793 ± 0.292
0.661MetAsn: 0.661 ± 0.227
0.264MetPro: 0.264 ± 0.144
0.595MetGln: 0.595 ± 0.204
1.454MetArg: 1.454 ± 0.279
2.049MetSer: 2.049 ± 0.411
1.454MetThr: 1.454 ± 0.327
1.785MetVal: 1.785 ± 0.357
0.198MetTrp: 0.198 ± 0.111
0.661MetTyr: 0.661 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
4.694AsnAla: 4.694 ± 0.663
0.198AsnCys: 0.198 ± 0.115
2.512AsnAsp: 2.512 ± 0.371
2.049AsnGlu: 2.049 ± 0.323
2.049AsnPhe: 2.049 ± 0.375
4.099AsnGly: 4.099 ± 0.495
1.124AsnHis: 1.124 ± 0.264
3.57AsnIle: 3.57 ± 0.538
3.041AsnLys: 3.041 ± 0.536
4.892AsnLeu: 4.892 ± 0.789
0.859AsnMet: 0.859 ± 0.24
2.711AsnAsn: 2.711 ± 0.519
2.049AsnPro: 2.049 ± 0.321
1.983AsnGln: 1.983 ± 0.277
2.248AsnArg: 2.248 ± 0.344
2.512AsnSer: 2.512 ± 0.459
2.644AsnThr: 2.644 ± 0.466
2.512AsnVal: 2.512 ± 0.395
0.727AsnTrp: 0.727 ± 0.247
1.058AsnTyr: 1.058 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
1.388ProAla: 1.388 ± 0.361
0.529ProCys: 0.529 ± 0.202
1.587ProAsp: 1.587 ± 0.296
2.182ProGlu: 2.182 ± 0.383
0.992ProPhe: 0.992 ± 0.24
1.19ProGly: 1.19 ± 0.316
0.793ProHis: 0.793 ± 0.221
2.182ProIle: 2.182 ± 0.412
1.983ProLys: 1.983 ± 0.35
2.843ProLeu: 2.843 ± 0.372
0.331ProMet: 0.331 ± 0.137
1.587ProAsn: 1.587 ± 0.298
0.992ProPro: 0.992 ± 0.277
1.124ProGln: 1.124 ± 0.345
1.322ProArg: 1.322 ± 0.275
2.314ProSer: 2.314 ± 0.468
2.049ProThr: 2.049 ± 0.399
2.182ProVal: 2.182 ± 0.362
0.463ProTrp: 0.463 ± 0.171
1.454ProTyr: 1.454 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.834GlnAla: 3.834 ± 0.664
0.529GlnCys: 0.529 ± 0.224
1.983GlnAsp: 1.983 ± 0.379
4.099GlnGlu: 4.099 ± 0.598
1.719GlnPhe: 1.719 ± 0.344
3.107GlnGly: 3.107 ± 0.519
0.992GlnHis: 0.992 ± 0.298
3.107GlnIle: 3.107 ± 0.419
2.777GlnLys: 2.777 ± 0.471
5.223GlnLeu: 5.223 ± 0.631
1.256GlnMet: 1.256 ± 0.382
1.917GlnAsn: 1.917 ± 0.398
1.587GlnPro: 1.587 ± 0.323
2.314GlnGln: 2.314 ± 0.344
1.917GlnArg: 1.917 ± 0.398
2.777GlnSer: 2.777 ± 0.416
3.173GlnThr: 3.173 ± 0.707
3.901GlnVal: 3.901 ± 0.495
0.661GlnTrp: 0.661 ± 0.24
0.727GlnTyr: 0.727 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
2.248ArgAla: 2.248 ± 0.421
0.661ArgCys: 0.661 ± 0.23
2.049ArgAsp: 2.049 ± 0.383
3.57ArgGlu: 3.57 ± 0.358
2.116ArgPhe: 2.116 ± 0.507
2.644ArgGly: 2.644 ± 0.416
0.926ArgHis: 0.926 ± 0.269
3.107ArgIle: 3.107 ± 0.499
3.107ArgLys: 3.107 ± 0.607
5.289ArgLeu: 5.289 ± 0.626
0.727ArgMet: 0.727 ± 0.203
2.711ArgAsn: 2.711 ± 0.48
1.124ArgPro: 1.124 ± 0.251
2.777ArgGln: 2.777 ± 0.378
2.314ArgArg: 2.314 ± 0.457
2.446ArgSer: 2.446 ± 0.432
3.173ArgThr: 3.173 ± 0.633
2.975ArgVal: 2.975 ± 0.502
0.793ArgTrp: 0.793 ± 0.22
1.653ArgTyr: 1.653 ± 0.426
0.0ArgXaa: 0.0 ± 0.0
Ser
3.173SerAla: 3.173 ± 0.524
0.727SerCys: 0.727 ± 0.278
4.033SerAsp: 4.033 ± 0.582
4.099SerGlu: 4.099 ± 0.48
2.512SerPhe: 2.512 ± 0.479
5.091SerGly: 5.091 ± 0.578
1.587SerHis: 1.587 ± 0.323
5.091SerIle: 5.091 ± 0.709
4.363SerLys: 4.363 ± 0.628
5.752SerLeu: 5.752 ± 0.622
1.058SerMet: 1.058 ± 0.201
2.446SerAsn: 2.446 ± 0.512
2.248SerPro: 2.248 ± 0.36
3.438SerGln: 3.438 ± 0.674
2.116SerArg: 2.116 ± 0.4
5.091SerSer: 5.091 ± 0.934
3.901SerThr: 3.901 ± 0.538
4.165SerVal: 4.165 ± 0.424
0.859SerTrp: 0.859 ± 0.193
2.116SerTyr: 2.116 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
4.826ThrAla: 4.826 ± 0.556
0.198ThrCys: 0.198 ± 0.129
3.041ThrAsp: 3.041 ± 0.487
4.496ThrGlu: 4.496 ± 0.512
2.644ThrPhe: 2.644 ± 0.483
4.363ThrGly: 4.363 ± 0.76
0.595ThrHis: 0.595 ± 0.198
5.289ThrIle: 5.289 ± 0.64
4.694ThrLys: 4.694 ± 0.504
6.214ThrLeu: 6.214 ± 0.518
1.454ThrMet: 1.454 ± 0.26
2.777ThrAsn: 2.777 ± 0.442
2.049ThrPro: 2.049 ± 0.39
2.446ThrGln: 2.446 ± 0.776
2.38ThrArg: 2.38 ± 0.472
4.231ThrSer: 4.231 ± 0.946
6.281ThrThr: 6.281 ± 0.73
5.091ThrVal: 5.091 ± 0.691
0.859ThrTrp: 0.859 ± 0.245
1.983ThrTyr: 1.983 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
4.76ValAla: 4.76 ± 0.535
0.463ValCys: 0.463 ± 0.141
3.372ValAsp: 3.372 ± 0.552
5.024ValGlu: 5.024 ± 0.672
2.248ValPhe: 2.248 ± 0.455
3.702ValGly: 3.702 ± 0.616
1.322ValHis: 1.322 ± 0.274
4.363ValIle: 4.363 ± 0.644
4.694ValLys: 4.694 ± 0.584
5.818ValLeu: 5.818 ± 0.572
1.587ValMet: 1.587 ± 0.376
1.917ValAsn: 1.917 ± 0.374
2.248ValPro: 2.248 ± 0.345
2.314ValGln: 2.314 ± 0.329
3.768ValArg: 3.768 ± 0.607
4.694ValSer: 4.694 ± 0.732
4.429ValThr: 4.429 ± 0.555
3.768ValVal: 3.768 ± 0.425
0.859ValTrp: 0.859 ± 0.255
2.38ValTyr: 2.38 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
0.727TrpAla: 0.727 ± 0.184
0.132TrpCys: 0.132 ± 0.092
0.397TrpAsp: 0.397 ± 0.159
0.926TrpGlu: 0.926 ± 0.28
0.793TrpPhe: 0.793 ± 0.248
0.727TrpGly: 0.727 ± 0.167
0.264TrpHis: 0.264 ± 0.151
0.727TrpIle: 0.727 ± 0.227
0.727TrpLys: 0.727 ± 0.215
1.058TrpLeu: 1.058 ± 0.271
0.331TrpMet: 0.331 ± 0.121
1.322TrpAsn: 1.322 ± 0.307
0.066TrpPro: 0.066 ± 0.063
0.793TrpGln: 0.793 ± 0.285
0.463TrpArg: 0.463 ± 0.211
0.661TrpSer: 0.661 ± 0.253
1.19TrpThr: 1.19 ± 0.339
0.727TrpVal: 0.727 ± 0.174
0.198TrpTrp: 0.198 ± 0.118
0.264TrpTyr: 0.264 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.38TyrAla: 2.38 ± 0.462
0.727TyrCys: 0.727 ± 0.23
2.116TyrAsp: 2.116 ± 0.552
2.777TyrGlu: 2.777 ± 0.475
1.719TyrPhe: 1.719 ± 0.461
2.314TyrGly: 2.314 ± 0.511
0.926TyrHis: 0.926 ± 0.249
2.248TyrIle: 2.248 ± 0.445
2.116TyrLys: 2.116 ± 0.456
3.834TyrLeu: 3.834 ± 0.678
0.529TyrMet: 0.529 ± 0.246
1.587TyrAsn: 1.587 ± 0.461
1.322TyrPro: 1.322 ± 0.295
2.38TyrGln: 2.38 ± 0.391
2.116TyrArg: 2.116 ± 0.355
1.719TyrSer: 1.719 ± 0.394
2.512TyrThr: 2.512 ± 0.451
1.521TyrVal: 1.521 ± 0.3
0.397TyrTrp: 0.397 ± 0.172
1.124TyrTyr: 1.124 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (15127 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski