Amino acid dipepetide frequency for Bacteriophage T5-like cott162

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.809AlaAla: 8.809 ± 1.647
0.639AlaCys: 0.639 ± 0.144
4.371AlaAsp: 4.371 ± 0.401
5.178AlaGlu: 5.178 ± 0.438
2.959AlaPhe: 2.959 ± 0.295
5.346AlaGly: 5.346 ± 0.583
1.311AlaHis: 1.311 ± 0.19
5.043AlaIle: 5.043 ± 0.458
6.926AlaLys: 6.926 ± 0.826
6.691AlaLeu: 6.691 ± 0.549
2.286AlaMet: 2.286 ± 0.331
3.295AlaAsn: 3.295 ± 0.306
2.421AlaPro: 2.421 ± 0.322
3.564AlaGln: 3.564 ± 0.422
3.429AlaArg: 3.429 ± 0.333
4.976AlaSer: 4.976 ± 0.748
4.674AlaThr: 4.674 ± 0.714
4.505AlaVal: 4.505 ± 0.426
0.874AlaTrp: 0.874 ± 0.178
2.387AlaTyr: 2.387 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.572CysAla: 0.572 ± 0.141
0.168CysCys: 0.168 ± 0.082
0.572CysAsp: 0.572 ± 0.131
0.773CysGlu: 0.773 ± 0.146
0.538CysPhe: 0.538 ± 0.172
0.908CysGly: 0.908 ± 0.227
0.202CysHis: 0.202 ± 0.082
0.706CysIle: 0.706 ± 0.145
0.74CysLys: 0.74 ± 0.152
0.841CysLeu: 0.841 ± 0.191
0.202CysMet: 0.202 ± 0.08
0.504CysAsn: 0.504 ± 0.13
0.437CysPro: 0.437 ± 0.142
0.37CysGln: 0.37 ± 0.133
0.437CysArg: 0.437 ± 0.126
0.841CysSer: 0.841 ± 0.203
0.639CysThr: 0.639 ± 0.131
0.504CysVal: 0.504 ± 0.123
0.034CysTrp: 0.034 ± 0.034
0.303CysTyr: 0.303 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
4.774AspAla: 4.774 ± 0.414
0.471AspCys: 0.471 ± 0.126
3.228AspAsp: 3.228 ± 0.455
4.842AspGlu: 4.842 ± 0.469
2.925AspPhe: 2.925 ± 0.346
3.564AspGly: 3.564 ± 0.409
1.076AspHis: 1.076 ± 0.19
4.405AspIle: 4.405 ± 0.455
4.64AspLys: 4.64 ± 0.408
5.649AspLeu: 5.649 ± 0.564
2.286AspMet: 2.286 ± 0.259
2.623AspAsn: 2.623 ± 0.298
2.791AspPro: 2.791 ± 0.28
1.479AspGln: 1.479 ± 0.222
2.723AspArg: 2.723 ± 0.293
3.833AspSer: 3.833 ± 0.374
3.732AspThr: 3.732 ± 0.304
3.867AspVal: 3.867 ± 0.359
1.009AspTrp: 1.009 ± 0.175
2.992AspTyr: 2.992 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
6.052GluAla: 6.052 ± 0.444
0.841GluCys: 0.841 ± 0.208
3.967GluAsp: 3.967 ± 0.434
5.548GluGlu: 5.548 ± 0.51
2.959GluPhe: 2.959 ± 0.358
3.665GluGly: 3.665 ± 0.381
1.648GluHis: 1.648 ± 0.237
5.447GluIle: 5.447 ± 0.429
4.707GluLys: 4.707 ± 0.481
7.565GluLeu: 7.565 ± 0.464
2.118GluMet: 2.118 ± 0.265
2.387GluAsn: 2.387 ± 0.297
1.715GluPro: 1.715 ± 0.27
3.127GluGln: 3.127 ± 0.386
2.992GluArg: 2.992 ± 0.365
3.261GluSer: 3.261 ± 0.305
4.505GluThr: 4.505 ± 0.657
4.774GluVal: 4.774 ± 0.411
1.042GluTrp: 1.042 ± 0.196
2.757GluTyr: 2.757 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
2.757PheAla: 2.757 ± 0.316
0.437PheCys: 0.437 ± 0.138
2.824PheAsp: 2.824 ± 0.353
2.892PheGlu: 2.892 ± 0.343
1.681PhePhe: 1.681 ± 0.288
2.925PheGly: 2.925 ± 0.347
1.143PheHis: 1.143 ± 0.207
3.127PheIle: 3.127 ± 0.319
3.161PheLys: 3.161 ± 0.327
2.959PheLeu: 2.959 ± 0.273
0.941PheMet: 0.941 ± 0.173
2.892PheAsn: 2.892 ± 0.291
1.95PhePro: 1.95 ± 0.285
1.143PheGln: 1.143 ± 0.185
1.849PheArg: 1.849 ± 0.305
2.723PheSer: 2.723 ± 0.305
2.555PheThr: 2.555 ± 0.313
2.488PheVal: 2.488 ± 0.228
0.471PheTrp: 0.471 ± 0.124
1.648PheTyr: 1.648 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
4.068GlyAla: 4.068 ± 0.448
1.009GlyCys: 1.009 ± 0.255
3.967GlyAsp: 3.967 ± 0.403
4.909GlyGlu: 4.909 ± 0.412
3.093GlyPhe: 3.093 ± 0.275
3.396GlyGly: 3.396 ± 0.383
1.311GlyHis: 1.311 ± 0.228
4.505GlyIle: 4.505 ± 0.458
5.043GlyLys: 5.043 ± 0.462
4.405GlyLeu: 4.405 ± 0.377
2.051GlyMet: 2.051 ± 0.292
3.261GlyAsn: 3.261 ± 0.428
0.908GlyPro: 0.908 ± 0.173
2.555GlyGln: 2.555 ± 0.299
2.723GlyArg: 2.723 ± 0.336
3.967GlySer: 3.967 ± 0.442
3.598GlyThr: 3.598 ± 0.387
5.178GlyVal: 5.178 ± 0.474
0.874GlyTrp: 0.874 ± 0.194
3.161GlyTyr: 3.161 ± 0.322
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.197
0.202HisCys: 0.202 ± 0.08
1.547HisAsp: 1.547 ± 0.284
1.042HisGlu: 1.042 ± 0.201
0.773HisPhe: 0.773 ± 0.178
1.278HisGly: 1.278 ± 0.18
0.672HisHis: 0.672 ± 0.152
1.446HisIle: 1.446 ± 0.228
1.177HisLys: 1.177 ± 0.205
1.547HisLeu: 1.547 ± 0.242
0.235HisMet: 0.235 ± 0.094
0.841HisAsn: 0.841 ± 0.201
0.807HisPro: 0.807 ± 0.159
0.437HisGln: 0.437 ± 0.137
0.908HisArg: 0.908 ± 0.194
1.446HisSer: 1.446 ± 0.235
0.807HisThr: 0.807 ± 0.159
0.807HisVal: 0.807 ± 0.151
0.269HisTrp: 0.269 ± 0.085
0.706HisTyr: 0.706 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
5.01IleAla: 5.01 ± 0.449
0.773IleCys: 0.773 ± 0.196
4.976IleAsp: 4.976 ± 0.468
4.573IleGlu: 4.573 ± 0.356
2.219IlePhe: 2.219 ± 0.278
4.136IleGly: 4.136 ± 0.413
1.076IleHis: 1.076 ± 0.164
3.766IleIle: 3.766 ± 0.35
4.842IleLys: 4.842 ± 0.398
5.043IleLeu: 5.043 ± 0.446
2.017IleMet: 2.017 ± 0.32
4.001IleAsn: 4.001 ± 0.36
2.354IlePro: 2.354 ± 0.297
1.984IleGln: 1.984 ± 0.251
2.69IleArg: 2.69 ± 0.229
4.236IleSer: 4.236 ± 0.379
4.472IleThr: 4.472 ± 0.395
3.934IleVal: 3.934 ± 0.386
0.706IleTrp: 0.706 ± 0.174
2.421IleTyr: 2.421 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
6.489LysAla: 6.489 ± 0.571
0.437LysCys: 0.437 ± 0.139
5.01LysAsp: 5.01 ± 0.421
5.178LysGlu: 5.178 ± 0.474
3.329LysPhe: 3.329 ± 0.354
3.362LysGly: 3.362 ± 0.347
1.042LysHis: 1.042 ± 0.221
3.497LysIle: 3.497 ± 0.368
4.371LysLys: 4.371 ± 0.42
6.724LysLeu: 6.724 ± 0.593
2.723LysMet: 2.723 ± 0.349
3.799LysAsn: 3.799 ± 0.346
2.286LysPro: 2.286 ± 0.376
3.228LysGln: 3.228 ± 0.325
3.396LysArg: 3.396 ± 0.283
4.203LysSer: 4.203 ± 0.409
4.539LysThr: 4.539 ± 0.613
5.043LysVal: 5.043 ± 0.465
0.773LysTrp: 0.773 ± 0.185
3.228LysTyr: 3.228 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
7.431LeuAla: 7.431 ± 0.586
0.706LeuCys: 0.706 ± 0.143
6.456LeuAsp: 6.456 ± 0.512
6.96LeuGlu: 6.96 ± 0.519
2.959LeuPhe: 2.959 ± 0.385
5.514LeuGly: 5.514 ± 0.453
1.58LeuHis: 1.58 ± 0.244
4.573LeuIle: 4.573 ± 0.41
6.254LeuLys: 6.254 ± 0.446
5.817LeuLeu: 5.817 ± 0.524
2.118LeuMet: 2.118 ± 0.372
4.875LeuAsn: 4.875 ± 0.472
3.261LeuPro: 3.261 ± 0.294
2.992LeuGln: 2.992 ± 0.38
3.799LeuArg: 3.799 ± 0.305
4.808LeuSer: 4.808 ± 0.476
4.236LeuThr: 4.236 ± 0.383
4.976LeuVal: 4.976 ± 0.439
0.672LeuTrp: 0.672 ± 0.159
2.522LeuTyr: 2.522 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
1.816MetAla: 1.816 ± 0.249
0.336MetCys: 0.336 ± 0.11
1.278MetAsp: 1.278 ± 0.227
2.354MetGlu: 2.354 ± 0.363
0.874MetPhe: 0.874 ± 0.214
1.614MetGly: 1.614 ± 0.223
0.572MetHis: 0.572 ± 0.154
2.051MetIle: 2.051 ± 0.29
2.555MetLys: 2.555 ± 0.353
2.421MetLeu: 2.421 ± 0.337
0.639MetMet: 0.639 ± 0.176
1.177MetAsn: 1.177 ± 0.183
0.841MetPro: 0.841 ± 0.151
1.345MetGln: 1.345 ± 0.246
1.009MetArg: 1.009 ± 0.172
2.152MetSer: 2.152 ± 0.273
1.816MetThr: 1.816 ± 0.241
1.311MetVal: 1.311 ± 0.241
0.303MetTrp: 0.303 ± 0.106
1.11MetTyr: 1.11 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
4.472AsnAla: 4.472 ± 1.012
0.572AsnCys: 0.572 ± 0.169
2.69AsnAsp: 2.69 ± 0.32
2.421AsnGlu: 2.421 ± 0.282
2.152AsnPhe: 2.152 ± 0.333
4.438AsnGly: 4.438 ± 0.481
0.941AsnHis: 0.941 ± 0.235
3.53AsnIle: 3.53 ± 0.344
3.564AsnLys: 3.564 ± 0.392
4.371AsnLeu: 4.371 ± 0.334
1.11AsnMet: 1.11 ± 0.167
2.286AsnAsn: 2.286 ± 0.347
2.555AsnPro: 2.555 ± 0.316
1.379AsnGln: 1.379 ± 0.21
2.253AsnArg: 2.253 ± 0.323
3.497AsnSer: 3.497 ± 0.4
2.959AsnThr: 2.959 ± 0.313
3.698AsnVal: 3.698 ± 0.353
0.639AsnTrp: 0.639 ± 0.15
1.681AsnTyr: 1.681 ± 0.265
0.0AsnXaa: 0.0 ± 0.0
Pro
2.387ProAla: 2.387 ± 0.313
0.336ProCys: 0.336 ± 0.106
2.085ProAsp: 2.085 ± 0.292
3.497ProGlu: 3.497 ± 0.34
1.58ProPhe: 1.58 ± 0.229
2.017ProGly: 2.017 ± 0.299
0.437ProHis: 0.437 ± 0.138
1.816ProIle: 1.816 ± 0.324
2.051ProLys: 2.051 ± 0.259
2.118ProLeu: 2.118 ± 0.244
0.605ProMet: 0.605 ± 0.131
2.253ProAsn: 2.253 ± 0.341
1.311ProPro: 1.311 ± 0.269
1.042ProGln: 1.042 ± 0.171
1.648ProArg: 1.648 ± 0.263
1.916ProSer: 1.916 ± 0.263
1.916ProThr: 1.916 ± 0.196
2.623ProVal: 2.623 ± 0.3
0.538ProTrp: 0.538 ± 0.132
1.58ProTyr: 1.58 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
3.362GlnAla: 3.362 ± 0.569
0.572GlnCys: 0.572 ± 0.147
2.185GlnAsp: 2.185 ± 0.301
2.69GlnGlu: 2.69 ± 0.315
1.748GlnPhe: 1.748 ± 0.217
1.849GlnGly: 1.849 ± 0.192
0.471GlnHis: 0.471 ± 0.138
1.95GlnIle: 1.95 ± 0.252
2.892GlnLys: 2.892 ± 0.371
3.463GlnLeu: 3.463 ± 0.36
0.807GlnMet: 0.807 ± 0.174
1.681GlnAsn: 1.681 ± 0.181
0.504GlnPro: 0.504 ± 0.126
1.849GlnGln: 1.849 ± 0.28
1.547GlnArg: 1.547 ± 0.241
2.253GlnSer: 2.253 ± 0.315
1.95GlnThr: 1.95 ± 0.33
3.093GlnVal: 3.093 ± 0.334
0.504GlnTrp: 0.504 ± 0.15
1.244GlnTyr: 1.244 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
3.161ArgAla: 3.161 ± 0.344
0.134ArgCys: 0.134 ± 0.076
2.892ArgAsp: 2.892 ± 0.272
2.959ArgGlu: 2.959 ± 0.317
1.816ArgPhe: 1.816 ± 0.253
3.429ArgGly: 3.429 ± 0.346
0.605ArgHis: 0.605 ± 0.164
2.925ArgIle: 2.925 ± 0.306
2.824ArgLys: 2.824 ± 0.396
3.698ArgLeu: 3.698 ± 0.321
1.513ArgMet: 1.513 ± 0.21
2.32ArgAsn: 2.32 ± 0.271
1.177ArgPro: 1.177 ± 0.18
1.547ArgGln: 1.547 ± 0.231
2.421ArgArg: 2.421 ± 0.353
2.286ArgSer: 2.286 ± 0.252
2.723ArgThr: 2.723 ± 0.295
3.127ArgVal: 3.127 ± 0.31
0.538ArgTrp: 0.538 ± 0.159
1.681ArgTyr: 1.681 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
4.573SerAla: 4.573 ± 0.807
0.672SerCys: 0.672 ± 0.163
3.093SerAsp: 3.093 ± 0.308
4.27SerGlu: 4.27 ± 0.823
3.06SerPhe: 3.06 ± 0.243
4.707SerGly: 4.707 ± 0.41
0.706SerHis: 0.706 ± 0.172
4.808SerIle: 4.808 ± 0.481
4.438SerLys: 4.438 ± 0.416
5.649SerLeu: 5.649 ± 0.422
1.479SerMet: 1.479 ± 0.214
3.396SerAsn: 3.396 ± 0.358
1.916SerPro: 1.916 ± 0.269
1.816SerGln: 1.816 ± 0.238
2.623SerArg: 2.623 ± 0.328
4.405SerSer: 4.405 ± 0.349
3.698SerThr: 3.698 ± 0.355
3.934SerVal: 3.934 ± 0.383
1.177SerTrp: 1.177 ± 0.216
2.421SerTyr: 2.421 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
4.909ThrAla: 4.909 ± 0.638
0.504ThrCys: 0.504 ± 0.13
3.295ThrAsp: 3.295 ± 0.287
3.228ThrGlu: 3.228 ± 0.363
2.555ThrPhe: 2.555 ± 0.253
4.707ThrGly: 4.707 ± 0.465
1.076ThrHis: 1.076 ± 0.175
4.102ThrIle: 4.102 ± 0.387
4.001ThrLys: 4.001 ± 0.417
4.405ThrLeu: 4.405 ± 0.361
1.412ThrMet: 1.412 ± 0.195
3.665ThrAsn: 3.665 ± 0.661
2.421ThrPro: 2.421 ± 0.248
2.286ThrGln: 2.286 ± 0.319
2.387ThrArg: 2.387 ± 0.263
4.472ThrSer: 4.472 ± 0.857
2.959ThrThr: 2.959 ± 0.347
4.236ThrVal: 4.236 ± 0.387
0.672ThrTrp: 0.672 ± 0.14
2.017ThrTyr: 2.017 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
4.808ValAla: 4.808 ± 0.411
0.672ValCys: 0.672 ± 0.172
4.438ValAsp: 4.438 ± 0.419
4.674ValGlu: 4.674 ± 0.377
2.69ValPhe: 2.69 ± 0.272
4.169ValGly: 4.169 ± 0.399
1.009ValHis: 1.009 ± 0.203
4.371ValIle: 4.371 ± 0.434
4.774ValLys: 4.774 ± 0.35
5.01ValLeu: 5.01 ± 0.331
1.547ValMet: 1.547 ± 0.189
3.026ValAsn: 3.026 ± 0.303
2.623ValPro: 2.623 ± 0.299
2.421ValGln: 2.421 ± 0.298
2.791ValArg: 2.791 ± 0.311
4.337ValSer: 4.337 ± 0.419
4.27ValThr: 4.27 ± 0.438
4.203ValVal: 4.203 ± 0.48
0.639ValTrp: 0.639 ± 0.174
2.723ValTyr: 2.723 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.403TrpAla: 0.403 ± 0.129
0.202TrpCys: 0.202 ± 0.073
1.244TrpAsp: 1.244 ± 0.21
0.941TrpGlu: 0.941 ± 0.192
0.572TrpPhe: 0.572 ± 0.147
0.706TrpGly: 0.706 ± 0.142
0.134TrpHis: 0.134 ± 0.062
0.572TrpIle: 0.572 ± 0.165
1.042TrpLys: 1.042 ± 0.178
1.345TrpLeu: 1.345 ± 0.262
0.336TrpMet: 0.336 ± 0.117
0.605TrpAsn: 0.605 ± 0.143
0.269TrpPro: 0.269 ± 0.1
0.74TrpGln: 0.74 ± 0.192
0.572TrpArg: 0.572 ± 0.13
0.807TrpSer: 0.807 ± 0.199
0.672TrpThr: 0.672 ± 0.148
0.706TrpVal: 0.706 ± 0.148
0.235TrpTrp: 0.235 ± 0.111
0.303TrpTyr: 0.303 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.656TyrAla: 2.656 ± 0.293
0.605TyrCys: 0.605 ± 0.128
2.791TyrAsp: 2.791 ± 0.293
2.152TyrGlu: 2.152 ± 0.273
2.085TyrPhe: 2.085 ± 0.296
2.253TyrGly: 2.253 ± 0.316
1.11TyrHis: 1.11 ± 0.161
2.589TyrIle: 2.589 ± 0.292
2.656TyrLys: 2.656 ± 0.268
2.791TyrLeu: 2.791 ± 0.323
1.143TyrMet: 1.143 ± 0.197
2.32TyrAsn: 2.32 ± 0.219
1.21TyrPro: 1.21 ± 0.186
1.311TyrGln: 1.311 ± 0.184
1.58TyrArg: 1.58 ± 0.239
2.421TyrSer: 2.421 ± 0.26
2.589TyrThr: 2.589 ± 0.325
2.185TyrVal: 2.185 ± 0.277
0.437TyrTrp: 0.437 ± 0.117
1.412TyrTyr: 1.412 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 153 proteins (29743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski