Amino acid dipepetide frequency for Gordonia phage Obliviate

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.495AlaAla: 17.495 ± 1.168
0.809AlaCys: 0.809 ± 0.231
8.343AlaAsp: 8.343 ± 0.782
6.724AlaGlu: 6.724 ± 0.64
3.051AlaPhe: 3.051 ± 0.621
9.339AlaGly: 9.339 ± 1.054
2.055AlaHis: 2.055 ± 0.486
5.728AlaIle: 5.728 ± 0.557
3.237AlaLys: 3.237 ± 0.607
10.584AlaLeu: 10.584 ± 0.87
2.864AlaMet: 2.864 ± 0.542
3.237AlaAsn: 3.237 ± 0.591
5.23AlaPro: 5.23 ± 0.638
4.42AlaGln: 4.42 ± 0.589
7.596AlaArg: 7.596 ± 0.935
5.23AlaSer: 5.23 ± 0.686
7.658AlaThr: 7.658 ± 0.76
7.782AlaVal: 7.782 ± 0.953
1.681AlaTrp: 1.681 ± 0.311
2.241AlaTyr: 2.241 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
0.623CysAla: 0.623 ± 0.185
0.187CysCys: 0.187 ± 0.151
1.37CysAsp: 1.37 ± 0.414
0.56CysGlu: 0.56 ± 0.23
0.0CysPhe: 0.0 ± 0.0
0.872CysGly: 0.872 ± 0.304
0.56CysHis: 0.56 ± 0.194
0.125CysIle: 0.125 ± 0.086
0.125CysLys: 0.125 ± 0.094
0.374CysLeu: 0.374 ± 0.149
0.187CysMet: 0.187 ± 0.122
0.311CysAsn: 0.311 ± 0.136
0.934CysPro: 0.934 ± 0.276
0.311CysGln: 0.311 ± 0.135
0.498CysArg: 0.498 ± 0.172
0.436CysSer: 0.436 ± 0.18
0.498CysThr: 0.498 ± 0.168
0.436CysVal: 0.436 ± 0.158
0.436CysTrp: 0.436 ± 0.192
0.062CysTyr: 0.062 ± 0.062
0.0CysXaa: 0.0 ± 0.0
Asp
7.409AspAla: 7.409 ± 0.802
0.249AspCys: 0.249 ± 0.115
5.417AspAsp: 5.417 ± 0.837
5.043AspGlu: 5.043 ± 0.645
1.868AspPhe: 1.868 ± 0.316
6.35AspGly: 6.35 ± 0.808
1.93AspHis: 1.93 ± 0.464
2.926AspIle: 2.926 ± 0.457
1.432AspLys: 1.432 ± 0.345
6.039AspLeu: 6.039 ± 0.638
1.37AspMet: 1.37 ± 0.28
1.868AspAsn: 1.868 ± 0.38
3.798AspPro: 3.798 ± 0.565
2.739AspGln: 2.739 ± 0.328
5.417AspArg: 5.417 ± 0.832
3.611AspSer: 3.611 ± 0.387
3.985AspThr: 3.985 ± 0.688
5.541AspVal: 5.541 ± 0.553
1.121AspTrp: 1.121 ± 0.233
1.868AspTyr: 1.868 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
5.79GluAla: 5.79 ± 0.619
0.374GluCys: 0.374 ± 0.202
3.175GluAsp: 3.175 ± 0.47
1.868GluGlu: 1.868 ± 0.389
1.681GluPhe: 1.681 ± 0.317
4.669GluGly: 4.669 ± 0.605
1.183GluHis: 1.183 ± 0.228
2.553GluIle: 2.553 ± 0.385
1.992GluLys: 1.992 ± 0.281
5.541GluLeu: 5.541 ± 0.75
1.681GluMet: 1.681 ± 0.262
1.307GluAsn: 1.307 ± 0.262
3.798GluPro: 3.798 ± 0.644
3.175GluGln: 3.175 ± 0.45
4.358GluArg: 4.358 ± 0.562
2.304GluSer: 2.304 ± 0.343
2.615GluThr: 2.615 ± 0.318
4.483GluVal: 4.483 ± 0.518
1.37GluTrp: 1.37 ± 0.318
1.681GluTyr: 1.681 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.988PheAla: 2.988 ± 0.38
0.374PheCys: 0.374 ± 0.153
2.117PheAsp: 2.117 ± 0.304
1.619PheGlu: 1.619 ± 0.335
0.809PhePhe: 0.809 ± 0.257
2.366PheGly: 2.366 ± 0.321
0.498PheHis: 0.498 ± 0.195
0.747PheIle: 0.747 ± 0.242
0.809PheLys: 0.809 ± 0.32
1.743PheLeu: 1.743 ± 0.364
0.56PheMet: 0.56 ± 0.164
0.623PheAsn: 0.623 ± 0.239
1.432PhePro: 1.432 ± 0.271
0.56PheGln: 0.56 ± 0.162
2.117PheArg: 2.117 ± 0.313
1.806PheSer: 1.806 ± 0.358
2.304PheThr: 2.304 ± 0.344
2.677PheVal: 2.677 ± 0.366
0.374PheTrp: 0.374 ± 0.137
0.56PheTyr: 0.56 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
8.156GlyAla: 8.156 ± 1.11
0.374GlyCys: 0.374 ± 0.157
5.977GlyAsp: 5.977 ± 0.518
4.732GlyGlu: 4.732 ± 0.637
2.864GlyPhe: 2.864 ± 0.474
7.658GlyGly: 7.658 ± 0.892
1.681GlyHis: 1.681 ± 0.308
3.798GlyIle: 3.798 ± 0.606
3.549GlyLys: 3.549 ± 0.587
8.778GlyLeu: 8.778 ± 1.164
1.37GlyMet: 1.37 ± 0.248
2.677GlyAsn: 2.677 ± 0.424
4.607GlyPro: 4.607 ± 0.592
3.486GlyGln: 3.486 ± 0.451
6.413GlyArg: 6.413 ± 0.673
4.171GlySer: 4.171 ± 0.604
5.479GlyThr: 5.479 ± 0.677
5.728GlyVal: 5.728 ± 0.543
2.677GlyTrp: 2.677 ± 0.408
2.366GlyTyr: 2.366 ± 0.385
0.0GlyXaa: 0.0 ± 0.0
His
2.428HisAla: 2.428 ± 0.342
0.498HisCys: 0.498 ± 0.178
1.37HisAsp: 1.37 ± 0.275
0.934HisGlu: 0.934 ± 0.205
0.436HisPhe: 0.436 ± 0.183
1.494HisGly: 1.494 ± 0.264
0.809HisHis: 0.809 ± 0.256
0.809HisIle: 0.809 ± 0.242
0.436HisLys: 0.436 ± 0.16
1.992HisLeu: 1.992 ± 0.371
0.374HisMet: 0.374 ± 0.143
0.56HisAsn: 0.56 ± 0.205
1.681HisPro: 1.681 ± 0.343
0.809HisGln: 0.809 ± 0.191
2.241HisArg: 2.241 ± 0.376
1.121HisSer: 1.121 ± 0.276
1.307HisThr: 1.307 ± 0.354
1.681HisVal: 1.681 ± 0.337
0.498HisTrp: 0.498 ± 0.157
0.623HisTyr: 0.623 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
5.977IleAla: 5.977 ± 0.603
0.187IleCys: 0.187 ± 0.126
3.486IleAsp: 3.486 ± 0.417
2.304IleGlu: 2.304 ± 0.411
0.747IlePhe: 0.747 ± 0.245
4.981IleGly: 4.981 ± 0.663
0.809IleHis: 0.809 ± 0.233
1.432IleIle: 1.432 ± 0.306
1.619IleLys: 1.619 ± 0.42
2.428IleLeu: 2.428 ± 0.459
0.187IleMet: 0.187 ± 0.091
1.183IleAsn: 1.183 ± 0.269
3.051IlePro: 3.051 ± 0.476
1.058IleGln: 1.058 ± 0.244
3.673IleArg: 3.673 ± 0.517
2.739IleSer: 2.739 ± 0.388
3.362IleThr: 3.362 ± 0.388
3.922IleVal: 3.922 ± 0.448
0.311IleTrp: 0.311 ± 0.127
0.996IleTyr: 0.996 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
3.237LysAla: 3.237 ± 0.476
0.125LysCys: 0.125 ± 0.099
2.179LysAsp: 2.179 ± 0.447
1.183LysGlu: 1.183 ± 0.273
0.996LysPhe: 0.996 ± 0.314
2.553LysGly: 2.553 ± 0.471
0.623LysHis: 0.623 ± 0.24
1.619LysIle: 1.619 ± 0.379
1.93LysLys: 1.93 ± 0.383
3.113LysLeu: 3.113 ± 0.424
0.56LysMet: 0.56 ± 0.23
1.183LysAsn: 1.183 ± 0.321
2.428LysPro: 2.428 ± 0.401
0.809LysGln: 0.809 ± 0.213
1.806LysArg: 1.806 ± 0.328
1.743LysSer: 1.743 ± 0.309
2.304LysThr: 2.304 ± 0.375
2.241LysVal: 2.241 ± 0.35
0.934LysTrp: 0.934 ± 0.242
0.809LysTyr: 0.809 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
10.646LeuAla: 10.646 ± 0.81
0.747LeuCys: 0.747 ± 0.212
5.79LeuAsp: 5.79 ± 0.728
3.549LeuGlu: 3.549 ± 0.486
2.241LeuPhe: 2.241 ± 0.325
6.973LeuGly: 6.973 ± 0.78
1.432LeuHis: 1.432 ± 0.27
3.175LeuIle: 3.175 ± 0.471
1.992LeuLys: 1.992 ± 0.358
5.417LeuLeu: 5.417 ± 0.795
1.743LeuMet: 1.743 ± 0.401
2.553LeuAsn: 2.553 ± 0.324
4.732LeuPro: 4.732 ± 0.519
1.806LeuGln: 1.806 ± 0.358
5.292LeuArg: 5.292 ± 0.567
4.607LeuSer: 4.607 ± 0.605
5.977LeuThr: 5.977 ± 0.606
6.413LeuVal: 6.413 ± 0.697
2.179LeuTrp: 2.179 ± 0.369
1.37LeuTyr: 1.37 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
3.362MetAla: 3.362 ± 0.514
0.249MetCys: 0.249 ± 0.135
0.56MetAsp: 0.56 ± 0.186
0.809MetGlu: 0.809 ± 0.202
0.56MetPhe: 0.56 ± 0.168
1.681MetGly: 1.681 ± 0.359
0.311MetHis: 0.311 ± 0.141
0.623MetIle: 0.623 ± 0.179
0.685MetLys: 0.685 ± 0.201
1.556MetLeu: 1.556 ± 0.276
0.249MetMet: 0.249 ± 0.13
0.56MetAsn: 0.56 ± 0.155
2.117MetPro: 2.117 ± 0.333
0.56MetGln: 0.56 ± 0.178
2.241MetArg: 2.241 ± 0.574
1.93MetSer: 1.93 ± 0.379
2.366MetThr: 2.366 ± 0.375
0.56MetVal: 0.56 ± 0.249
0.56MetTrp: 0.56 ± 0.279
0.125MetTyr: 0.125 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
2.615AsnAla: 2.615 ± 0.454
0.374AsnCys: 0.374 ± 0.146
1.992AsnAsp: 1.992 ± 0.387
1.307AsnGlu: 1.307 ± 0.245
0.747AsnPhe: 0.747 ± 0.194
3.486AsnGly: 3.486 ± 0.498
0.685AsnHis: 0.685 ± 0.157
0.934AsnIle: 0.934 ± 0.26
0.747AsnLys: 0.747 ± 0.229
1.494AsnLeu: 1.494 ± 0.262
0.685AsnMet: 0.685 ± 0.239
0.747AsnAsn: 0.747 ± 0.259
3.113AsnPro: 3.113 ± 0.52
0.685AsnGln: 0.685 ± 0.194
2.179AsnArg: 2.179 ± 0.416
1.743AsnSer: 1.743 ± 0.353
2.428AsnThr: 2.428 ± 0.447
1.619AsnVal: 1.619 ± 0.36
0.436AsnTrp: 0.436 ± 0.129
0.996AsnTyr: 0.996 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
6.724ProAla: 6.724 ± 0.854
0.934ProCys: 0.934 ± 0.319
4.918ProAsp: 4.918 ± 0.659
4.234ProGlu: 4.234 ± 0.641
1.743ProPhe: 1.743 ± 0.332
5.728ProGly: 5.728 ± 0.625
1.37ProHis: 1.37 ± 0.295
3.486ProIle: 3.486 ± 0.437
2.677ProLys: 2.677 ± 0.303
3.237ProLeu: 3.237 ± 0.459
1.183ProMet: 1.183 ± 0.41
2.055ProAsn: 2.055 ± 0.363
3.051ProPro: 3.051 ± 0.559
1.619ProGln: 1.619 ± 0.263
3.3ProArg: 3.3 ± 0.493
2.988ProSer: 2.988 ± 0.322
3.985ProThr: 3.985 ± 0.631
3.673ProVal: 3.673 ± 0.467
0.934ProTrp: 0.934 ± 0.245
0.996ProTyr: 0.996 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.549GlnAla: 3.549 ± 0.539
0.125GlnCys: 0.125 ± 0.094
1.058GlnAsp: 1.058 ± 0.243
1.432GlnGlu: 1.432 ± 0.308
0.996GlnPhe: 0.996 ± 0.244
2.553GlnGly: 2.553 ± 0.385
1.37GlnHis: 1.37 ± 0.317
1.245GlnIle: 1.245 ± 0.27
1.183GlnLys: 1.183 ± 0.24
3.424GlnLeu: 3.424 ± 0.395
0.872GlnMet: 0.872 ± 0.238
0.685GlnAsn: 0.685 ± 0.21
2.366GlnPro: 2.366 ± 0.54
1.743GlnGln: 1.743 ± 0.389
2.988GlnArg: 2.988 ± 0.42
1.307GlnSer: 1.307 ± 0.248
1.992GlnThr: 1.992 ± 0.386
3.175GlnVal: 3.175 ± 0.479
0.934GlnTrp: 0.934 ± 0.271
1.058GlnTyr: 1.058 ± 0.194
0.0GlnXaa: 0.0 ± 0.0
Arg
7.969ArgAla: 7.969 ± 0.879
0.872ArgCys: 0.872 ± 0.235
5.603ArgAsp: 5.603 ± 0.578
4.545ArgGlu: 4.545 ± 0.518
1.868ArgPhe: 1.868 ± 0.322
6.164ArgGly: 6.164 ± 0.648
1.93ArgHis: 1.93 ± 0.333
3.549ArgIle: 3.549 ± 0.486
1.992ArgLys: 1.992 ± 0.271
4.918ArgLeu: 4.918 ± 0.497
2.615ArgMet: 2.615 ± 0.41
2.241ArgAsn: 2.241 ± 0.376
3.486ArgPro: 3.486 ± 0.51
2.988ArgGln: 2.988 ± 0.502
6.413ArgArg: 6.413 ± 0.839
4.296ArgSer: 4.296 ± 0.479
4.669ArgThr: 4.669 ± 0.55
4.918ArgVal: 4.918 ± 0.677
1.681ArgTrp: 1.681 ± 0.336
1.619ArgTyr: 1.619 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
5.728SerAla: 5.728 ± 0.821
0.311SerCys: 0.311 ± 0.144
2.988SerAsp: 2.988 ± 0.47
3.424SerGlu: 3.424 ± 0.533
1.432SerPhe: 1.432 ± 0.274
5.79SerGly: 5.79 ± 0.603
0.685SerHis: 0.685 ± 0.199
2.802SerIle: 2.802 ± 0.44
1.432SerLys: 1.432 ± 0.235
3.113SerLeu: 3.113 ± 0.415
1.121SerMet: 1.121 ± 0.239
1.93SerAsn: 1.93 ± 0.425
2.802SerPro: 2.802 ± 0.439
1.432SerGln: 1.432 ± 0.309
3.736SerArg: 3.736 ± 0.564
2.117SerSer: 2.117 ± 0.382
4.607SerThr: 4.607 ± 0.521
4.296SerVal: 4.296 ± 0.499
1.183SerTrp: 1.183 ± 0.26
0.872SerTyr: 0.872 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
7.969ThrAla: 7.969 ± 0.605
0.436ThrCys: 0.436 ± 0.191
5.354ThrAsp: 5.354 ± 0.659
4.483ThrGlu: 4.483 ± 0.521
2.49ThrPhe: 2.49 ± 0.472
5.852ThrGly: 5.852 ± 0.624
1.619ThrHis: 1.619 ± 0.383
3.175ThrIle: 3.175 ± 0.418
2.615ThrLys: 2.615 ± 0.39
5.79ThrLeu: 5.79 ± 0.721
1.183ThrMet: 1.183 ± 0.236
1.93ThrAsn: 1.93 ± 0.348
4.545ThrPro: 4.545 ± 0.579
1.992ThrGln: 1.992 ± 0.335
4.234ThrArg: 4.234 ± 0.445
3.362ThrSer: 3.362 ± 0.508
4.296ThrThr: 4.296 ± 0.526
5.728ThrVal: 5.728 ± 0.512
1.37ThrTrp: 1.37 ± 0.272
1.183ThrTyr: 1.183 ± 0.283
0.0ThrXaa: 0.0 ± 0.0
Val
8.529ValAla: 8.529 ± 0.787
1.307ValCys: 1.307 ± 0.332
6.039ValAsp: 6.039 ± 0.719
4.296ValGlu: 4.296 ± 0.602
1.37ValPhe: 1.37 ± 0.27
5.167ValGly: 5.167 ± 0.593
1.121ValHis: 1.121 ± 0.255
4.234ValIle: 4.234 ± 0.451
2.241ValLys: 2.241 ± 0.417
5.105ValLeu: 5.105 ± 0.63
1.868ValMet: 1.868 ± 0.341
1.619ValAsn: 1.619 ± 0.28
3.175ValPro: 3.175 ± 0.478
2.428ValGln: 2.428 ± 0.418
5.977ValArg: 5.977 ± 0.678
3.798ValSer: 3.798 ± 0.529
6.288ValThr: 6.288 ± 0.711
7.471ValVal: 7.471 ± 0.763
1.619ValTrp: 1.619 ± 0.351
2.179ValTyr: 2.179 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
1.681TrpAla: 1.681 ± 0.325
0.187TrpCys: 0.187 ± 0.101
1.058TrpAsp: 1.058 ± 0.366
0.809TrpGlu: 0.809 ± 0.207
0.623TrpPhe: 0.623 ± 0.211
0.934TrpGly: 0.934 ± 0.249
0.56TrpHis: 0.56 ± 0.189
0.685TrpIle: 0.685 ± 0.174
0.809TrpLys: 0.809 ± 0.243
2.366TrpLeu: 2.366 ± 0.363
0.436TrpMet: 0.436 ± 0.157
0.934TrpAsn: 0.934 ± 0.378
1.307TrpPro: 1.307 ± 0.25
0.747TrpGln: 0.747 ± 0.192
1.93TrpArg: 1.93 ± 0.386
1.494TrpSer: 1.494 ± 0.331
1.743TrpThr: 1.743 ± 0.328
1.806TrpVal: 1.806 ± 0.279
0.436TrpTrp: 0.436 ± 0.164
0.56TrpTyr: 0.56 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.463
0.187TyrCys: 0.187 ± 0.124
1.183TyrAsp: 1.183 ± 0.218
1.743TyrGlu: 1.743 ± 0.405
0.56TyrPhe: 0.56 ± 0.18
1.93TyrGly: 1.93 ± 0.317
0.872TyrHis: 0.872 ± 0.234
0.809TyrIle: 0.809 ± 0.318
0.872TyrLys: 0.872 ± 0.193
1.619TyrLeu: 1.619 ± 0.385
0.498TyrMet: 0.498 ± 0.185
0.809TyrAsn: 0.809 ± 0.204
1.245TyrPro: 1.245 ± 0.28
0.685TyrGln: 0.685 ± 0.271
1.93TyrArg: 1.93 ± 0.346
0.996TyrSer: 0.996 ± 0.261
1.743TyrThr: 1.743 ± 0.304
1.681TyrVal: 1.681 ± 0.347
0.374TyrTrp: 0.374 ± 0.157
0.623TyrTyr: 0.623 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (16063 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski