Amino acid dipepetide frequency for Actinoplanes phage phiAsp2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.586AlaAla: 15.586 ± 1.301
0.943AlaCys: 0.943 ± 0.359
6.711AlaAsp: 6.711 ± 0.623
8.708AlaGlu: 8.708 ± 0.884
2.441AlaPhe: 2.441 ± 0.304
10.982AlaGly: 10.982 ± 0.989
2.219AlaHis: 2.219 ± 0.348
4.604AlaIle: 4.604 ± 0.869
2.108AlaLys: 2.108 ± 0.31
11.315AlaLeu: 11.315 ± 0.935
2.496AlaMet: 2.496 ± 0.337
2.108AlaAsn: 2.108 ± 0.479
7.211AlaPro: 7.211 ± 0.674
3.328AlaGln: 3.328 ± 0.388
8.431AlaArg: 8.431 ± 0.744
7.266AlaSer: 7.266 ± 0.984
7.599AlaThr: 7.599 ± 0.644
8.375AlaVal: 8.375 ± 0.675
2.662AlaTrp: 2.662 ± 0.405
2.441AlaTyr: 2.441 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
1.276CysAla: 1.276 ± 0.328
0.333CysCys: 0.333 ± 0.148
0.666CysAsp: 0.666 ± 0.191
0.555CysGlu: 0.555 ± 0.187
0.166CysPhe: 0.166 ± 0.093
0.721CysGly: 0.721 ± 0.235
0.277CysHis: 0.277 ± 0.127
0.111CysIle: 0.111 ± 0.078
0.222CysLys: 0.222 ± 0.12
0.499CysLeu: 0.499 ± 0.179
0.0CysMet: 0.0 ± 0.0
0.333CysAsn: 0.333 ± 0.152
0.499CysPro: 0.499 ± 0.167
0.333CysGln: 0.333 ± 0.138
0.444CysArg: 0.444 ± 0.135
0.61CysSer: 0.61 ± 0.225
0.333CysThr: 0.333 ± 0.147
0.388CysVal: 0.388 ± 0.124
0.388CysTrp: 0.388 ± 0.179
0.333CysTyr: 0.333 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
5.38AspAla: 5.38 ± 0.537
0.666AspCys: 0.666 ± 0.232
3.661AspAsp: 3.661 ± 0.642
4.104AspGlu: 4.104 ± 0.59
1.609AspPhe: 1.609 ± 0.249
6.822AspGly: 6.822 ± 0.925
1.109AspHis: 1.109 ± 0.256
1.553AspIle: 1.553 ± 0.237
1.886AspLys: 1.886 ± 0.404
6.379AspLeu: 6.379 ± 0.494
1.276AspMet: 1.276 ± 0.257
1.442AspAsn: 1.442 ± 0.269
4.493AspPro: 4.493 ± 0.616
2.385AspGln: 2.385 ± 0.445
5.38AspArg: 5.38 ± 0.683
2.995AspSer: 2.995 ± 0.412
3.162AspThr: 3.162 ± 0.391
3.994AspVal: 3.994 ± 0.361
1.83AspTrp: 1.83 ± 0.249
1.054AspTyr: 1.054 ± 0.277
0.0AspXaa: 0.0 ± 0.0
Glu
6.822GluAla: 6.822 ± 0.677
0.998GluCys: 0.998 ± 0.286
4.271GluAsp: 4.271 ± 0.503
3.494GluGlu: 3.494 ± 0.494
1.997GluPhe: 1.997 ± 0.253
5.99GluGly: 5.99 ± 0.66
1.83GluHis: 1.83 ± 0.339
2.607GluIle: 2.607 ± 0.438
2.163GluLys: 2.163 ± 0.409
5.713GluLeu: 5.713 ± 0.652
1.442GluMet: 1.442 ± 0.284
1.331GluAsn: 1.331 ± 0.287
4.104GluPro: 4.104 ± 0.482
0.555GluGln: 0.555 ± 0.254
5.99GluArg: 5.99 ± 0.667
1.997GluSer: 1.997 ± 0.309
3.439GluThr: 3.439 ± 0.48
6.379GluVal: 6.379 ± 0.564
1.109GluTrp: 1.109 ± 0.237
1.109GluTyr: 1.109 ± 0.206
0.0GluXaa: 0.0 ± 0.0
Phe
2.662PheAla: 2.662 ± 0.358
0.055PheCys: 0.055 ± 0.049
1.664PheAsp: 1.664 ± 0.48
1.941PheGlu: 1.941 ± 0.256
0.499PhePhe: 0.499 ± 0.159
2.773PheGly: 2.773 ± 0.323
0.444PheHis: 0.444 ± 0.154
0.777PheIle: 0.777 ± 0.297
0.333PheLys: 0.333 ± 0.124
1.553PheLeu: 1.553 ± 0.298
0.333PheMet: 0.333 ± 0.133
0.777PheAsn: 0.777 ± 0.186
1.276PhePro: 1.276 ± 0.296
0.887PheGln: 0.887 ± 0.186
1.387PheArg: 1.387 ± 0.331
1.83PheSer: 1.83 ± 0.301
1.941PheThr: 1.941 ± 0.41
1.997PheVal: 1.997 ± 0.329
0.777PheTrp: 0.777 ± 0.154
0.388PheTyr: 0.388 ± 0.127
0.0PheXaa: 0.0 ± 0.0
Gly
10.816GlyAla: 10.816 ± 0.838
0.666GlyCys: 0.666 ± 0.197
4.881GlyAsp: 4.881 ± 0.487
5.824GlyGlu: 5.824 ± 0.581
2.441GlyPhe: 2.441 ± 0.367
8.209GlyGly: 8.209 ± 0.956
1.775GlyHis: 1.775 ± 0.391
3.273GlyIle: 3.273 ± 0.535
2.884GlyLys: 2.884 ± 0.52
8.264GlyLeu: 8.264 ± 1.296
1.941GlyMet: 1.941 ± 0.42
2.496GlyAsn: 2.496 ± 0.405
5.491GlyPro: 5.491 ± 0.627
3.328GlyGln: 3.328 ± 0.451
9.152GlyArg: 9.152 ± 0.929
6.822GlySer: 6.822 ± 0.954
7.322GlyThr: 7.322 ± 1.017
7.1GlyVal: 7.1 ± 0.586
2.496GlyTrp: 2.496 ± 0.376
2.829GlyTyr: 2.829 ± 0.329
0.0GlyXaa: 0.0 ± 0.0
His
2.385HisAla: 2.385 ± 0.361
0.055HisCys: 0.055 ± 0.065
1.054HisAsp: 1.054 ± 0.251
1.165HisGlu: 1.165 ± 0.337
0.444HisPhe: 0.444 ± 0.15
2.052HisGly: 2.052 ± 0.278
0.61HisHis: 0.61 ± 0.226
0.388HisIle: 0.388 ± 0.169
0.444HisLys: 0.444 ± 0.146
1.83HisLeu: 1.83 ± 0.341
0.277HisMet: 0.277 ± 0.138
0.555HisAsn: 0.555 ± 0.159
1.442HisPro: 1.442 ± 0.331
0.61HisGln: 0.61 ± 0.172
1.331HisArg: 1.331 ± 0.304
0.887HisSer: 0.887 ± 0.434
2.607HisThr: 2.607 ± 0.794
1.276HisVal: 1.276 ± 0.238
0.444HisTrp: 0.444 ± 0.161
0.333HisTyr: 0.333 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
3.273IleAla: 3.273 ± 0.568
0.111IleCys: 0.111 ± 0.082
2.163IleAsp: 2.163 ± 0.39
2.718IleGlu: 2.718 ± 0.367
0.277IlePhe: 0.277 ± 0.132
4.049IleGly: 4.049 ± 0.647
0.61IleHis: 0.61 ± 0.188
2.551IleIle: 2.551 ± 0.888
1.331IleLys: 1.331 ± 0.266
2.551IleLeu: 2.551 ± 0.409
0.555IleMet: 0.555 ± 0.189
1.331IleAsn: 1.331 ± 0.498
2.884IlePro: 2.884 ± 0.491
1.941IleGln: 1.941 ± 0.516
3.217IleArg: 3.217 ± 0.399
2.607IleSer: 2.607 ± 0.405
3.273IleThr: 3.273 ± 0.342
3.383IleVal: 3.383 ± 0.481
0.333IleTrp: 0.333 ± 0.135
0.388IleTyr: 0.388 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
2.884LysAla: 2.884 ± 0.523
0.222LysCys: 0.222 ± 0.118
2.441LysAsp: 2.441 ± 0.404
0.943LysGlu: 0.943 ± 0.248
0.721LysPhe: 0.721 ± 0.182
2.219LysGly: 2.219 ± 0.329
0.832LysHis: 0.832 ± 0.231
1.553LysIle: 1.553 ± 0.362
1.83LysLys: 1.83 ± 0.425
2.108LysLeu: 2.108 ± 0.438
0.721LysMet: 0.721 ± 0.173
0.887LysAsn: 0.887 ± 0.248
1.719LysPro: 1.719 ± 0.273
0.277LysGln: 0.277 ± 0.163
1.997LysArg: 1.997 ± 0.335
1.054LysSer: 1.054 ± 0.246
1.886LysThr: 1.886 ± 0.375
2.163LysVal: 2.163 ± 0.341
0.333LysTrp: 0.333 ± 0.183
0.887LysTyr: 0.887 ± 0.297
0.0LysXaa: 0.0 ± 0.0
Leu
11.537LeuAla: 11.537 ± 0.785
0.555LeuCys: 0.555 ± 0.151
6.49LeuAsp: 6.49 ± 0.565
6.101LeuGlu: 6.101 ± 0.755
1.775LeuPhe: 1.775 ± 0.288
7.432LeuGly: 7.432 ± 0.629
1.276LeuHis: 1.276 ± 0.218
3.383LeuIle: 3.383 ± 0.583
1.997LeuLys: 1.997 ± 0.347
5.158LeuLeu: 5.158 ± 0.741
1.498LeuMet: 1.498 ± 0.333
1.941LeuAsn: 1.941 ± 0.4
4.493LeuPro: 4.493 ± 0.475
2.662LeuGln: 2.662 ± 0.477
4.271LeuArg: 4.271 ± 0.447
4.715LeuSer: 4.715 ± 0.43
6.767LeuThr: 6.767 ± 0.588
6.268LeuVal: 6.268 ± 0.902
0.832LeuTrp: 0.832 ± 0.222
1.331LeuTyr: 1.331 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
2.607MetAla: 2.607 ± 0.379
0.444MetCys: 0.444 ± 0.157
0.887MetAsp: 0.887 ± 0.231
0.943MetGlu: 0.943 ± 0.211
0.222MetPhe: 0.222 ± 0.093
1.664MetGly: 1.664 ± 0.27
0.277MetHis: 0.277 ± 0.124
1.165MetIle: 1.165 ± 0.234
0.832MetLys: 0.832 ± 0.252
2.108MetLeu: 2.108 ± 0.338
0.166MetMet: 0.166 ± 0.084
0.388MetAsn: 0.388 ± 0.131
0.721MetPro: 0.721 ± 0.212
0.055MetGln: 0.055 ± 0.055
1.22MetArg: 1.22 ± 0.314
1.498MetSer: 1.498 ± 0.327
1.775MetThr: 1.775 ± 0.307
1.664MetVal: 1.664 ± 0.311
0.388MetTrp: 0.388 ± 0.176
0.166MetTyr: 0.166 ± 0.088
0.0MetXaa: 0.0 ± 0.0
Asn
2.94AsnAla: 2.94 ± 0.46
0.222AsnCys: 0.222 ± 0.124
0.943AsnAsp: 0.943 ± 0.259
0.943AsnGlu: 0.943 ± 0.203
0.61AsnPhe: 0.61 ± 0.209
3.106AsnGly: 3.106 ± 0.478
0.666AsnHis: 0.666 ± 0.211
0.832AsnIle: 0.832 ± 0.197
0.499AsnLys: 0.499 ± 0.154
2.163AsnLeu: 2.163 ± 0.287
0.388AsnMet: 0.388 ± 0.132
0.444AsnAsn: 0.444 ± 0.249
1.775AsnPro: 1.775 ± 0.288
0.777AsnGln: 0.777 ± 0.173
1.886AsnArg: 1.886 ± 0.325
1.498AsnSer: 1.498 ± 0.339
1.997AsnThr: 1.997 ± 0.351
1.609AsnVal: 1.609 ± 0.297
0.499AsnTrp: 0.499 ± 0.18
0.388AsnTyr: 0.388 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
8.043ProAla: 8.043 ± 0.817
0.555ProCys: 0.555 ± 0.161
4.715ProAsp: 4.715 ± 0.663
5.547ProGlu: 5.547 ± 0.668
1.498ProPhe: 1.498 ± 0.269
6.545ProGly: 6.545 ± 0.813
0.943ProHis: 0.943 ± 0.217
1.941ProIle: 1.941 ± 0.264
1.664ProLys: 1.664 ± 0.363
3.494ProLeu: 3.494 ± 0.463
1.442ProMet: 1.442 ± 0.206
1.553ProAsn: 1.553 ± 0.323
2.995ProPro: 2.995 ± 0.45
1.997ProGln: 1.997 ± 0.499
3.328ProArg: 3.328 ± 0.387
3.827ProSer: 3.827 ± 0.567
4.049ProThr: 4.049 ± 0.567
5.325ProVal: 5.325 ± 0.621
0.887ProTrp: 0.887 ± 0.232
1.387ProTyr: 1.387 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
3.938GlnAla: 3.938 ± 0.538
0.222GlnCys: 0.222 ± 0.124
1.553GlnAsp: 1.553 ± 0.311
1.941GlnGlu: 1.941 ± 0.276
0.721GlnPhe: 0.721 ± 0.176
3.55GlnGly: 3.55 ± 0.573
0.222GlnHis: 0.222 ± 0.1
1.609GlnIle: 1.609 ± 0.439
0.444GlnLys: 0.444 ± 0.204
2.219GlnLeu: 2.219 ± 0.443
0.61GlnMet: 0.61 ± 0.153
0.998GlnAsn: 0.998 ± 0.202
1.83GlnPro: 1.83 ± 0.359
0.277GlnGln: 0.277 ± 0.139
2.385GlnArg: 2.385 ± 0.376
1.331GlnSer: 1.331 ± 0.252
2.219GlnThr: 2.219 ± 0.346
2.829GlnVal: 2.829 ± 0.336
0.444GlnTrp: 0.444 ± 0.143
0.721GlnTyr: 0.721 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
8.93ArgAla: 8.93 ± 0.946
0.832ArgCys: 0.832 ± 0.248
3.55ArgAsp: 3.55 ± 0.441
4.493ArgGlu: 4.493 ± 0.603
2.385ArgPhe: 2.385 ± 0.386
6.268ArgGly: 6.268 ± 0.727
1.719ArgHis: 1.719 ± 0.254
3.55ArgIle: 3.55 ± 0.454
2.441ArgLys: 2.441 ± 0.529
6.434ArgLeu: 6.434 ± 0.551
1.719ArgMet: 1.719 ± 0.301
1.498ArgAsn: 1.498 ± 0.275
5.491ArgPro: 5.491 ± 0.617
2.607ArgGln: 2.607 ± 0.398
6.933ArgArg: 6.933 ± 0.919
3.55ArgSer: 3.55 ± 0.385
4.715ArgThr: 4.715 ± 0.439
4.77ArgVal: 4.77 ± 0.469
1.886ArgTrp: 1.886 ± 0.316
1.664ArgTyr: 1.664 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
7.155SerAla: 7.155 ± 0.875
0.222SerCys: 0.222 ± 0.133
3.439SerAsp: 3.439 ± 0.461
2.662SerGlu: 2.662 ± 0.415
1.775SerPhe: 1.775 ± 0.361
6.933SerGly: 6.933 ± 0.962
1.442SerHis: 1.442 ± 0.493
2.33SerIle: 2.33 ± 0.462
1.109SerLys: 1.109 ± 0.28
3.55SerLeu: 3.55 ± 0.49
1.331SerMet: 1.331 ± 0.336
1.442SerAsn: 1.442 ± 0.42
3.716SerPro: 3.716 ± 0.463
1.719SerGln: 1.719 ± 0.28
3.827SerArg: 3.827 ± 0.45
3.827SerSer: 3.827 ± 0.5
4.326SerThr: 4.326 ± 0.707
3.328SerVal: 3.328 ± 0.382
1.276SerTrp: 1.276 ± 0.248
0.943SerTyr: 0.943 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
7.432ThrAla: 7.432 ± 0.703
0.666ThrCys: 0.666 ± 0.202
3.328ThrAsp: 3.328 ± 0.425
4.77ThrGlu: 4.77 ± 0.61
1.83ThrPhe: 1.83 ± 0.27
9.374ThrGly: 9.374 ± 0.826
1.664ThrHis: 1.664 ± 0.591
3.328ThrIle: 3.328 ± 0.438
1.941ThrLys: 1.941 ± 0.414
4.826ThrLeu: 4.826 ± 0.514
1.054ThrMet: 1.054 ± 0.311
1.886ThrAsn: 1.886 ± 0.344
3.994ThrPro: 3.994 ± 0.458
2.163ThrGln: 2.163 ± 0.326
4.992ThrArg: 4.992 ± 0.531
3.938ThrSer: 3.938 ± 0.679
5.214ThrThr: 5.214 ± 0.848
7.1ThrVal: 7.1 ± 0.705
1.442ThrTrp: 1.442 ± 0.251
2.052ThrTyr: 2.052 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
8.875ValAla: 8.875 ± 0.673
0.444ValCys: 0.444 ± 0.165
5.824ValAsp: 5.824 ± 0.544
4.271ValGlu: 4.271 ± 0.449
1.719ValPhe: 1.719 ± 0.313
5.658ValGly: 5.658 ± 0.67
1.22ValHis: 1.22 ± 0.23
2.884ValIle: 2.884 ± 0.427
2.274ValLys: 2.274 ± 0.4
6.212ValLeu: 6.212 ± 0.486
1.22ValMet: 1.22 ± 0.267
1.775ValAsn: 1.775 ± 0.27
5.214ValPro: 5.214 ± 0.649
2.662ValGln: 2.662 ± 0.465
6.268ValArg: 6.268 ± 0.474
4.104ValSer: 4.104 ± 0.419
7.322ValThr: 7.322 ± 0.827
6.212ValVal: 6.212 ± 0.696
1.775ValTrp: 1.775 ± 0.325
1.886ValTyr: 1.886 ± 0.359
0.0ValXaa: 0.0 ± 0.0
Trp
2.662TrpAla: 2.662 ± 0.461
0.166TrpCys: 0.166 ± 0.102
1.22TrpAsp: 1.22 ± 0.269
1.054TrpGlu: 1.054 ± 0.229
0.887TrpPhe: 0.887 ± 0.23
1.054TrpGly: 1.054 ± 0.188
0.777TrpHis: 0.777 ± 0.236
0.61TrpIle: 0.61 ± 0.182
0.777TrpLys: 0.777 ± 0.191
1.941TrpLeu: 1.941 ± 0.295
0.444TrpMet: 0.444 ± 0.172
0.499TrpAsn: 0.499 ± 0.158
1.109TrpPro: 1.109 ± 0.244
0.832TrpGln: 0.832 ± 0.288
1.498TrpArg: 1.498 ± 0.34
1.22TrpSer: 1.22 ± 0.284
1.609TrpThr: 1.609 ± 0.272
1.331TrpVal: 1.331 ± 0.275
0.444TrpTrp: 0.444 ± 0.173
0.444TrpTyr: 0.444 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.33TyrAla: 2.33 ± 0.316
0.111TyrCys: 0.111 ± 0.078
1.719TyrAsp: 1.719 ± 0.406
1.165TyrGlu: 1.165 ± 0.25
0.277TyrPhe: 0.277 ± 0.126
2.662TyrGly: 2.662 ± 0.404
0.277TyrHis: 0.277 ± 0.154
0.499TyrIle: 0.499 ± 0.173
0.499TyrLys: 0.499 ± 0.167
2.163TyrLeu: 2.163 ± 0.341
0.222TyrMet: 0.222 ± 0.09
0.499TyrAsn: 0.499 ± 0.175
1.276TyrPro: 1.276 ± 0.27
0.666TyrGln: 0.666 ± 0.17
1.498TyrArg: 1.498 ± 0.302
0.777TyrSer: 0.777 ± 0.175
1.442TyrThr: 1.442 ± 0.325
2.33TyrVal: 2.33 ± 0.34
0.333TyrTrp: 0.333 ± 0.112
0.166TyrTyr: 0.166 ± 0.087
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (18030 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski