Amino acid dipepetide frequency for Listeria phage P70

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.573AlaAla: 2.573 ± 0.715
0.643AlaCys: 0.643 ± 0.172
4.305AlaAsp: 4.305 ± 0.501
5.245AlaGlu: 5.245 ± 0.483
2.276AlaPhe: 2.276 ± 0.281
4.256AlaGly: 4.256 ± 0.651
0.693AlaHis: 0.693 ± 0.189
4.899AlaIle: 4.899 ± 0.502
5.146AlaLys: 5.146 ± 0.581
6.284AlaLeu: 6.284 ± 0.515
2.029AlaMet: 2.029 ± 0.312
4.206AlaAsn: 4.206 ± 0.585
1.831AlaPro: 1.831 ± 0.341
2.227AlaGln: 2.227 ± 0.336
2.573AlaArg: 2.573 ± 0.332
4.948AlaSer: 4.948 ± 0.551
4.849AlaThr: 4.849 ± 0.525
4.651AlaVal: 4.651 ± 0.448
0.792AlaTrp: 0.792 ± 0.168
2.177AlaTyr: 2.177 ± 0.266
0.0AlaXaa: 0.0 ± 0.0
Cys
0.544CysAla: 0.544 ± 0.185
0.099CysCys: 0.099 ± 0.072
0.495CysAsp: 0.495 ± 0.174
0.544CysGlu: 0.544 ± 0.166
0.346CysPhe: 0.346 ± 0.117
0.643CysGly: 0.643 ± 0.18
0.148CysHis: 0.148 ± 0.086
0.544CysIle: 0.544 ± 0.189
0.891CysLys: 0.891 ± 0.321
0.396CysLeu: 0.396 ± 0.132
0.247CysMet: 0.247 ± 0.107
0.346CysAsn: 0.346 ± 0.118
0.594CysPro: 0.594 ± 0.201
0.297CysGln: 0.297 ± 0.118
0.693CysArg: 0.693 ± 0.175
0.643CysSer: 0.643 ± 0.202
0.742CysThr: 0.742 ± 0.169
0.544CysVal: 0.544 ± 0.169
0.198CysTrp: 0.198 ± 0.115
0.297CysTyr: 0.297 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
3.959AspAla: 3.959 ± 0.503
0.495AspCys: 0.495 ± 0.15
3.464AspAsp: 3.464 ± 0.429
5.542AspGlu: 5.542 ± 0.572
2.771AspPhe: 2.771 ± 0.377
5.097AspGly: 5.097 ± 0.5
0.445AspHis: 0.445 ± 0.148
4.998AspIle: 4.998 ± 0.374
4.256AspLys: 4.256 ± 0.364
4.948AspLeu: 4.948 ± 0.573
1.831AspMet: 1.831 ± 0.354
3.167AspAsn: 3.167 ± 0.393
0.99AspPro: 0.99 ± 0.314
0.693AspGln: 0.693 ± 0.198
2.128AspArg: 2.128 ± 0.362
4.256AspSer: 4.256 ± 0.445
4.107AspThr: 4.107 ± 0.455
4.107AspVal: 4.107 ± 0.575
1.386AspTrp: 1.386 ± 0.295
2.722AspTyr: 2.722 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
4.651GluAla: 4.651 ± 0.479
0.396GluCys: 0.396 ± 0.129
5.047GluAsp: 5.047 ± 0.487
8.907GluGlu: 8.907 ± 0.997
4.008GluPhe: 4.008 ± 0.575
4.602GluGly: 4.602 ± 0.52
0.643GluHis: 0.643 ± 0.177
4.354GluIle: 4.354 ± 0.45
5.839GluLys: 5.839 ± 0.692
7.67GluLeu: 7.67 ± 0.715
2.87GluMet: 2.87 ± 0.326
3.761GluAsn: 3.761 ± 0.483
1.93GluPro: 1.93 ± 0.455
3.018GluGln: 3.018 ± 0.502
2.969GluArg: 2.969 ± 0.4
3.117GluSer: 3.117 ± 0.399
4.552GluThr: 4.552 ± 0.482
6.136GluVal: 6.136 ± 0.573
1.237GluTrp: 1.237 ± 0.261
3.959GluTyr: 3.959 ± 0.486
0.0GluXaa: 0.0 ± 0.0
Phe
2.177PheAla: 2.177 ± 0.319
0.297PheCys: 0.297 ± 0.13
2.375PheAsp: 2.375 ± 0.376
3.068PheGlu: 3.068 ± 0.428
1.237PhePhe: 1.237 ± 0.261
2.87PheGly: 2.87 ± 0.378
0.94PheHis: 0.94 ± 0.203
2.672PheIle: 2.672 ± 0.395
3.117PheLys: 3.117 ± 0.399
2.573PheLeu: 2.573 ± 0.359
1.435PheMet: 1.435 ± 0.311
1.979PheAsn: 1.979 ± 0.274
1.484PhePro: 1.484 ± 0.33
1.386PheGln: 1.386 ± 0.267
1.781PheArg: 1.781 ± 0.29
2.474PheSer: 2.474 ± 0.36
2.425PheThr: 2.425 ± 0.325
2.821PheVal: 2.821 ± 0.358
0.346PheTrp: 0.346 ± 0.108
1.781PheTyr: 1.781 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
3.315GlyAla: 3.315 ± 0.425
0.742GlyCys: 0.742 ± 0.202
3.414GlyAsp: 3.414 ± 0.494
4.75GlyGlu: 4.75 ± 0.502
2.672GlyPhe: 2.672 ± 0.322
3.909GlyGly: 3.909 ± 0.428
0.792GlyHis: 0.792 ± 0.16
4.256GlyIle: 4.256 ± 0.391
5.789GlyLys: 5.789 ± 0.499
4.8GlyLeu: 4.8 ± 0.58
2.029GlyMet: 2.029 ± 0.308
3.216GlyAsn: 3.216 ± 0.435
0.049GlyPro: 0.049 ± 0.05
1.386GlyGln: 1.386 ± 0.308
2.722GlyArg: 2.722 ± 0.411
3.216GlySer: 3.216 ± 0.411
4.899GlyThr: 4.899 ± 0.588
4.354GlyVal: 4.354 ± 0.521
1.039GlyTrp: 1.039 ± 0.361
2.375GlyTyr: 2.375 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.594HisAla: 0.594 ± 0.171
0.297HisCys: 0.297 ± 0.141
1.188HisAsp: 1.188 ± 0.252
1.138HisGlu: 1.138 ± 0.191
0.198HisPhe: 0.198 ± 0.097
0.792HisGly: 0.792 ± 0.183
0.495HisHis: 0.495 ± 0.183
0.94HisIle: 0.94 ± 0.219
0.841HisLys: 0.841 ± 0.197
1.633HisLeu: 1.633 ± 0.287
0.396HisMet: 0.396 ± 0.116
0.841HisAsn: 0.841 ± 0.205
0.99HisPro: 0.99 ± 0.199
0.445HisGln: 0.445 ± 0.157
0.792HisArg: 0.792 ± 0.193
0.693HisSer: 0.693 ± 0.159
0.99HisThr: 0.99 ± 0.202
1.336HisVal: 1.336 ± 0.239
0.247HisTrp: 0.247 ± 0.098
0.792HisTyr: 0.792 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.047IleAla: 5.047 ± 0.554
0.693IleCys: 0.693 ± 0.166
4.651IleAsp: 4.651 ± 0.497
4.998IleGlu: 4.998 ± 0.471
2.276IlePhe: 2.276 ± 0.38
3.068IleGly: 3.068 ± 0.336
1.287IleHis: 1.287 ± 0.25
4.899IleIle: 4.899 ± 0.575
5.493IleLys: 5.493 ± 0.443
5.443IleLeu: 5.443 ± 0.567
1.287IleMet: 1.287 ± 0.232
4.849IleAsn: 4.849 ± 0.445
2.375IlePro: 2.375 ± 0.345
1.88IleGln: 1.88 ± 0.282
2.672IleArg: 2.672 ± 0.274
4.651IleSer: 4.651 ± 0.484
3.86IleThr: 3.86 ± 0.396
5.047IleVal: 5.047 ± 0.503
0.891IleTrp: 0.891 ± 0.216
2.87IleTyr: 2.87 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
5.542LysAla: 5.542 ± 0.544
0.544LysCys: 0.544 ± 0.243
5.047LysAsp: 5.047 ± 0.376
7.62LysGlu: 7.62 ± 0.774
2.969LysPhe: 2.969 ± 0.43
3.464LysGly: 3.464 ± 0.387
1.435LysHis: 1.435 ± 0.309
4.503LysIle: 4.503 ± 0.533
5.196LysLys: 5.196 ± 0.573
5.641LysLeu: 5.641 ± 0.598
2.524LysMet: 2.524 ± 0.326
3.959LysAsn: 3.959 ± 0.354
2.425LysPro: 2.425 ± 0.505
3.612LysGln: 3.612 ± 0.566
3.612LysArg: 3.612 ± 0.49
3.513LysSer: 3.513 ± 0.407
4.503LysThr: 4.503 ± 0.519
4.849LysVal: 4.849 ± 0.389
0.544LysTrp: 0.544 ± 0.142
3.563LysTyr: 3.563 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
6.136LeuAla: 6.136 ± 0.572
0.396LeuCys: 0.396 ± 0.14
5.196LeuAsp: 5.196 ± 0.504
7.323LeuGlu: 7.323 ± 0.654
2.573LeuPhe: 2.573 ± 0.407
3.81LeuGly: 3.81 ± 0.411
1.781LeuHis: 1.781 ± 0.391
4.998LeuIle: 4.998 ± 0.528
4.552LeuLys: 4.552 ± 0.398
5.888LeuLeu: 5.888 ± 0.534
1.484LeuMet: 1.484 ± 0.254
4.107LeuAsn: 4.107 ± 0.456
2.919LeuPro: 2.919 ± 0.349
3.068LeuGln: 3.068 ± 0.404
3.959LeuArg: 3.959 ± 0.443
4.503LeuSer: 4.503 ± 0.444
5.592LeuThr: 5.592 ± 0.454
5.394LeuVal: 5.394 ± 0.543
0.891LeuTrp: 0.891 ± 0.213
3.266LeuTyr: 3.266 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
2.474MetAla: 2.474 ± 0.393
0.346MetCys: 0.346 ± 0.134
1.583MetAsp: 1.583 ± 0.257
2.078MetGlu: 2.078 ± 0.333
1.039MetPhe: 1.039 ± 0.213
0.693MetGly: 0.693 ± 0.179
0.297MetHis: 0.297 ± 0.113
1.336MetIle: 1.336 ± 0.227
2.326MetLys: 2.326 ± 0.316
2.227MetLeu: 2.227 ± 0.291
0.495MetMet: 0.495 ± 0.145
1.484MetAsn: 1.484 ± 0.241
0.99MetPro: 0.99 ± 0.212
1.287MetGln: 1.287 ± 0.212
1.237MetArg: 1.237 ± 0.225
2.276MetSer: 2.276 ± 0.309
1.435MetThr: 1.435 ± 0.264
1.93MetVal: 1.93 ± 0.289
0.148MetTrp: 0.148 ± 0.081
0.99MetTyr: 0.99 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
4.008AsnAla: 4.008 ± 0.567
0.396AsnCys: 0.396 ± 0.145
3.018AsnAsp: 3.018 ± 0.378
2.821AsnGlu: 2.821 ± 0.391
2.177AsnPhe: 2.177 ± 0.341
4.107AsnGly: 4.107 ± 0.46
1.089AsnHis: 1.089 ± 0.2
4.107AsnIle: 4.107 ± 0.441
4.75AsnLys: 4.75 ± 0.51
3.761AsnLeu: 3.761 ± 0.545
1.979AsnMet: 1.979 ± 0.321
3.959AsnAsn: 3.959 ± 0.493
2.326AsnPro: 2.326 ± 0.427
2.227AsnGln: 2.227 ± 0.363
2.425AsnArg: 2.425 ± 0.294
3.612AsnSer: 3.612 ± 0.46
3.563AsnThr: 3.563 ± 0.353
3.117AsnVal: 3.117 ± 0.388
1.039AsnTrp: 1.039 ± 0.192
2.821AsnTyr: 2.821 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
2.771ProAla: 2.771 ± 0.483
0.049ProCys: 0.049 ± 0.046
2.227ProAsp: 2.227 ± 0.457
1.682ProGlu: 1.682 ± 0.346
1.039ProPhe: 1.039 ± 0.214
0.445ProGly: 0.445 ± 0.125
0.445ProHis: 0.445 ± 0.138
2.425ProIle: 2.425 ± 0.342
2.276ProLys: 2.276 ± 0.322
1.633ProLeu: 1.633 ± 0.324
0.594ProMet: 0.594 ± 0.136
1.534ProAsn: 1.534 ± 0.307
1.386ProPro: 1.386 ± 0.347
1.237ProGln: 1.237 ± 0.271
1.386ProArg: 1.386 ± 0.26
2.672ProSer: 2.672 ± 0.401
2.425ProThr: 2.425 ± 0.311
2.474ProVal: 2.474 ± 0.363
0.297ProTrp: 0.297 ± 0.133
1.534ProTyr: 1.534 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
2.524GlnAla: 2.524 ± 0.419
0.198GlnCys: 0.198 ± 0.104
1.534GlnAsp: 1.534 ± 0.238
3.365GlnGlu: 3.365 ± 0.512
1.138GlnPhe: 1.138 ± 0.219
2.474GlnGly: 2.474 ± 0.333
0.396GlnHis: 0.396 ± 0.151
2.425GlnIle: 2.425 ± 0.362
2.623GlnLys: 2.623 ± 0.286
2.969GlnLeu: 2.969 ± 0.4
0.396GlnMet: 0.396 ± 0.115
2.474GlnAsn: 2.474 ± 0.355
0.693GlnPro: 0.693 ± 0.142
1.237GlnGln: 1.237 ± 0.302
1.88GlnArg: 1.88 ± 0.289
1.732GlnSer: 1.732 ± 0.415
1.682GlnThr: 1.682 ± 0.277
2.425GlnVal: 2.425 ± 0.293
0.198GlnTrp: 0.198 ± 0.093
1.435GlnTyr: 1.435 ± 0.221
0.0GlnXaa: 0.0 ± 0.0
Arg
2.821ArgAla: 2.821 ± 0.351
0.841ArgCys: 0.841 ± 0.245
2.276ArgAsp: 2.276 ± 0.316
3.959ArgGlu: 3.959 ± 0.451
1.732ArgPhe: 1.732 ± 0.333
2.969ArgGly: 2.969 ± 0.451
0.742ArgHis: 0.742 ± 0.181
3.315ArgIle: 3.315 ± 0.386
2.969ArgLys: 2.969 ± 0.401
3.959ArgLeu: 3.959 ± 0.533
0.99ArgMet: 0.99 ± 0.266
1.633ArgAsn: 1.633 ± 0.264
1.089ArgPro: 1.089 ± 0.231
1.188ArgGln: 1.188 ± 0.205
1.781ArgArg: 1.781 ± 0.319
2.029ArgSer: 2.029 ± 0.248
2.177ArgThr: 2.177 ± 0.262
3.365ArgVal: 3.365 ± 0.422
0.544ArgTrp: 0.544 ± 0.158
1.979ArgTyr: 1.979 ± 0.307
0.0ArgXaa: 0.0 ± 0.0
Ser
4.651SerAla: 4.651 ± 0.765
0.346SerCys: 0.346 ± 0.129
3.711SerAsp: 3.711 ± 0.416
3.612SerGlu: 3.612 ± 0.486
3.167SerPhe: 3.167 ± 0.375
3.86SerGly: 3.86 ± 0.49
0.891SerHis: 0.891 ± 0.206
4.651SerIle: 4.651 ± 0.458
4.701SerLys: 4.701 ± 0.555
5.097SerLeu: 5.097 ± 0.405
1.583SerMet: 1.583 ± 0.28
4.058SerAsn: 4.058 ± 0.482
2.177SerPro: 2.177 ± 0.288
2.524SerGln: 2.524 ± 0.375
2.227SerArg: 2.227 ± 0.322
4.305SerSer: 4.305 ± 0.425
4.206SerThr: 4.206 ± 0.453
4.453SerVal: 4.453 ± 0.481
0.594SerTrp: 0.594 ± 0.162
2.474SerTyr: 2.474 ± 0.32
0.0SerXaa: 0.0 ± 0.0
Thr
4.651ThrAla: 4.651 ± 0.596
0.643ThrCys: 0.643 ± 0.206
3.81ThrAsp: 3.81 ± 0.377
3.563ThrGlu: 3.563 ± 0.427
3.167ThrPhe: 3.167 ± 0.364
5.097ThrGly: 5.097 ± 0.464
1.336ThrHis: 1.336 ± 0.258
4.552ThrIle: 4.552 ± 0.498
4.503ThrLys: 4.503 ± 0.505
4.602ThrLeu: 4.602 ± 0.454
1.386ThrMet: 1.386 ± 0.253
4.651ThrAsn: 4.651 ± 0.467
2.276ThrPro: 2.276 ± 0.369
1.93ThrGln: 1.93 ± 0.332
2.474ThrArg: 2.474 ± 0.386
4.998ThrSer: 4.998 ± 0.458
4.651ThrThr: 4.651 ± 0.57
4.354ThrVal: 4.354 ± 0.483
1.138ThrTrp: 1.138 ± 0.217
2.474ThrTyr: 2.474 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
4.75ValAla: 4.75 ± 0.451
0.891ValCys: 0.891 ± 0.199
4.206ValAsp: 4.206 ± 0.48
5.097ValGlu: 5.097 ± 0.664
2.276ValPhe: 2.276 ± 0.3
3.761ValGly: 3.761 ± 0.464
1.138ValHis: 1.138 ± 0.229
4.354ValIle: 4.354 ± 0.473
6.136ValLys: 6.136 ± 0.516
4.998ValLeu: 4.998 ± 0.478
1.484ValMet: 1.484 ± 0.307
3.761ValAsn: 3.761 ± 0.398
2.672ValPro: 2.672 ± 0.396
2.029ValGln: 2.029 ± 0.257
2.623ValArg: 2.623 ± 0.406
5.443ValSer: 5.443 ± 0.6
5.592ValThr: 5.592 ± 0.567
4.998ValVal: 4.998 ± 0.53
1.633ValTrp: 1.633 ± 0.271
2.672ValTyr: 2.672 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
0.346TrpAla: 0.346 ± 0.164
0.346TrpCys: 0.346 ± 0.136
0.99TrpAsp: 0.99 ± 0.196
1.089TrpGlu: 1.089 ± 0.24
0.495TrpPhe: 0.495 ± 0.179
0.891TrpGly: 0.891 ± 0.188
0.148TrpHis: 0.148 ± 0.08
0.94TrpIle: 0.94 ± 0.232
1.089TrpLys: 1.089 ± 0.26
0.792TrpLeu: 0.792 ± 0.226
0.148TrpMet: 0.148 ± 0.082
1.089TrpAsn: 1.089 ± 0.249
0.0TrpPro: 0.0 ± 0.0
0.495TrpGln: 0.495 ± 0.159
0.792TrpArg: 0.792 ± 0.189
1.188TrpSer: 1.188 ± 0.262
1.089TrpThr: 1.089 ± 0.258
0.792TrpVal: 0.792 ± 0.208
0.297TrpTrp: 0.297 ± 0.119
0.841TrpTyr: 0.841 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.068TyrAla: 3.068 ± 0.408
0.594TyrCys: 0.594 ± 0.171
2.821TyrAsp: 2.821 ± 0.361
3.266TyrGlu: 3.266 ± 0.35
1.682TyrPhe: 1.682 ± 0.272
2.919TyrGly: 2.919 ± 0.33
0.495TyrHis: 0.495 ± 0.169
3.068TyrIle: 3.068 ± 0.38
2.573TyrLys: 2.573 ± 0.288
2.573TyrLeu: 2.573 ± 0.41
1.435TyrMet: 1.435 ± 0.211
2.227TyrAsn: 2.227 ± 0.357
1.336TyrPro: 1.336 ± 0.247
1.633TyrGln: 1.633 ± 0.329
1.633TyrArg: 1.633 ± 0.294
3.018TyrSer: 3.018 ± 0.44
2.919TyrThr: 2.919 ± 0.326
3.365TyrVal: 3.365 ± 0.358
0.396TyrTrp: 0.396 ± 0.125
1.781TyrTyr: 1.781 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 119 proteins (20210 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski