Amino acid dipepetide frequency for Microbacterium phage Leaf

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.158AlaAla: 13.158 ± 1.04
0.396AlaCys: 0.396 ± 0.096
5.111AlaAsp: 5.111 ± 0.416
8.772AlaGlu: 8.772 ± 0.704
3.693AlaPhe: 3.693 ± 0.376
7.948AlaGly: 7.948 ± 0.782
1.946AlaHis: 1.946 ± 0.237
4.584AlaIle: 4.584 ± 0.391
3.825AlaLys: 3.825 ± 0.377
9.267AlaLeu: 9.267 ± 0.51
2.341AlaMet: 2.341 ± 0.27
2.506AlaAsn: 2.506 ± 0.28
5.474AlaPro: 5.474 ± 0.433
5.046AlaGln: 5.046 ± 0.417
5.936AlaArg: 5.936 ± 0.466
5.606AlaSer: 5.606 ± 0.515
6.529AlaThr: 6.529 ± 0.509
7.09AlaVal: 7.09 ± 0.544
2.473AlaTrp: 2.473 ± 0.262
2.473AlaTyr: 2.473 ± 0.269
0.0AlaXaa: 0.0 ± 0.0
Cys
0.791CysAla: 0.791 ± 0.18
0.099CysCys: 0.099 ± 0.053
0.594CysAsp: 0.594 ± 0.15
0.66CysGlu: 0.66 ± 0.152
0.33CysPhe: 0.33 ± 0.094
1.22CysGly: 1.22 ± 0.24
0.198CysHis: 0.198 ± 0.068
0.297CysIle: 0.297 ± 0.083
0.264CysLys: 0.264 ± 0.099
0.495CysLeu: 0.495 ± 0.127
0.165CysMet: 0.165 ± 0.066
0.198CysAsn: 0.198 ± 0.088
0.66CysPro: 0.66 ± 0.162
0.264CysGln: 0.264 ± 0.11
0.231CysArg: 0.231 ± 0.092
0.33CysSer: 0.33 ± 0.109
0.594CysThr: 0.594 ± 0.185
0.429CysVal: 0.429 ± 0.11
0.099CysTrp: 0.099 ± 0.053
0.132CysTyr: 0.132 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
5.672AspAla: 5.672 ± 0.418
0.758AspCys: 0.758 ± 0.169
5.144AspAsp: 5.144 ± 0.537
5.078AspGlu: 5.078 ± 0.522
1.979AspPhe: 1.979 ± 0.216
6.826AspGly: 6.826 ± 0.438
1.385AspHis: 1.385 ± 0.254
2.638AspIle: 2.638 ± 0.3
1.715AspLys: 1.715 ± 0.238
4.716AspLeu: 4.716 ± 0.423
1.385AspMet: 1.385 ± 0.177
1.484AspAsn: 1.484 ± 0.192
4.287AspPro: 4.287 ± 0.362
1.814AspGln: 1.814 ± 0.239
4.188AspArg: 4.188 ± 0.419
2.836AspSer: 2.836 ± 0.363
3.364AspThr: 3.364 ± 0.332
4.353AspVal: 4.353 ± 0.439
1.616AspTrp: 1.616 ± 0.249
1.913AspTyr: 1.913 ± 0.245
0.0AspXaa: 0.0 ± 0.0
Glu
8.409GluAla: 8.409 ± 0.634
0.594GluCys: 0.594 ± 0.157
4.485GluAsp: 4.485 ± 0.33
6.661GluGlu: 6.661 ± 0.717
2.012GluPhe: 2.012 ± 0.269
5.375GluGly: 5.375 ± 0.388
1.451GluHis: 1.451 ± 0.261
2.869GluIle: 2.869 ± 0.299
4.089GluLys: 4.089 ± 0.429
6.431GluLeu: 6.431 ± 0.478
2.308GluMet: 2.308 ± 0.269
2.473GluAsn: 2.473 ± 0.339
3.693GluPro: 3.693 ± 0.375
3.792GluGln: 3.792 ± 0.396
5.903GluArg: 5.903 ± 0.565
3.496GluSer: 3.496 ± 0.389
3.891GluThr: 3.891 ± 0.409
5.837GluVal: 5.837 ± 0.489
1.55GluTrp: 1.55 ± 0.227
1.979GluTyr: 1.979 ± 0.22
0.0GluXaa: 0.0 ± 0.0
Phe
2.737PheAla: 2.737 ± 0.348
0.231PheCys: 0.231 ± 0.071
2.045PheAsp: 2.045 ± 0.255
2.275PheGlu: 2.275 ± 0.273
0.791PhePhe: 0.791 ± 0.161
3.001PheGly: 3.001 ± 0.356
0.396PheHis: 0.396 ± 0.117
1.451PheIle: 1.451 ± 0.286
0.627PheLys: 0.627 ± 0.134
2.374PheLeu: 2.374 ± 0.32
0.594PheMet: 0.594 ± 0.122
0.693PheAsn: 0.693 ± 0.169
1.286PhePro: 1.286 ± 0.209
1.253PheGln: 1.253 ± 0.202
1.979PheArg: 1.979 ± 0.242
1.847PheSer: 1.847 ± 0.248
2.374PheThr: 2.374 ± 0.254
1.946PheVal: 1.946 ± 0.271
0.89PheTrp: 0.89 ± 0.2
0.824PheTyr: 0.824 ± 0.18
0.0PheXaa: 0.0 ± 0.0
Gly
7.354GlyAla: 7.354 ± 0.68
0.627GlyCys: 0.627 ± 0.186
4.485GlyAsp: 4.485 ± 0.36
6.332GlyGlu: 6.332 ± 0.492
2.968GlyPhe: 2.968 ± 0.405
7.552GlyGly: 7.552 ± 0.752
1.748GlyHis: 1.748 ± 0.231
3.298GlyIle: 3.298 ± 0.456
3.199GlyLys: 3.199 ± 0.357
6.068GlyLeu: 6.068 ± 0.545
2.308GlyMet: 2.308 ± 0.257
2.341GlyAsn: 2.341 ± 0.3
3.825GlyPro: 3.825 ± 0.309
3.034GlyGln: 3.034 ± 0.27
5.408GlyArg: 5.408 ± 0.438
5.21GlySer: 5.21 ± 0.576
5.837GlyThr: 5.837 ± 0.609
5.243GlyVal: 5.243 ± 0.467
2.242GlyTrp: 2.242 ± 0.296
2.77GlyTyr: 2.77 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
1.847HisAla: 1.847 ± 0.307
0.066HisCys: 0.066 ± 0.046
1.22HisAsp: 1.22 ± 0.17
1.616HisGlu: 1.616 ± 0.237
0.462HisPhe: 0.462 ± 0.138
1.682HisGly: 1.682 ± 0.262
0.231HisHis: 0.231 ± 0.085
0.989HisIle: 0.989 ± 0.164
0.528HisLys: 0.528 ± 0.122
1.583HisLeu: 1.583 ± 0.218
0.231HisMet: 0.231 ± 0.097
0.528HisAsn: 0.528 ± 0.123
1.22HisPro: 1.22 ± 0.205
0.725HisGln: 0.725 ± 0.151
1.715HisArg: 1.715 ± 0.285
0.824HisSer: 0.824 ± 0.188
1.121HisThr: 1.121 ± 0.191
1.484HisVal: 1.484 ± 0.21
0.33HisTrp: 0.33 ± 0.094
0.857HisTyr: 0.857 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.749IleAla: 4.749 ± 0.503
0.231IleCys: 0.231 ± 0.08
4.023IleAsp: 4.023 ± 0.363
3.529IleGlu: 3.529 ± 0.338
0.956IlePhe: 0.956 ± 0.165
3.232IleGly: 3.232 ± 0.306
0.956IleHis: 0.956 ± 0.171
1.583IleIle: 1.583 ± 0.219
0.923IleLys: 0.923 ± 0.156
2.737IleLeu: 2.737 ± 0.298
0.956IleMet: 0.956 ± 0.174
0.989IleAsn: 0.989 ± 0.181
2.803IlePro: 2.803 ± 0.225
2.045IleGln: 2.045 ± 0.274
3.001IleArg: 3.001 ± 0.298
2.539IleSer: 2.539 ± 0.448
3.298IleThr: 3.298 ± 0.347
3.562IleVal: 3.562 ± 0.358
1.22IleTrp: 1.22 ± 0.217
1.187IleTyr: 1.187 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
5.144LysAla: 5.144 ± 0.438
0.297LysCys: 0.297 ± 0.1
1.847LysAsp: 1.847 ± 0.299
2.407LysGlu: 2.407 ± 0.277
0.824LysPhe: 0.824 ± 0.179
2.539LysGly: 2.539 ± 0.264
0.824LysHis: 0.824 ± 0.182
1.715LysIle: 1.715 ± 0.217
1.682LysLys: 1.682 ± 0.276
2.737LysLeu: 2.737 ± 0.312
0.956LysMet: 0.956 ± 0.158
1.055LysAsn: 1.055 ± 0.168
2.44LysPro: 2.44 ± 0.341
1.385LysGln: 1.385 ± 0.195
2.77LysArg: 2.77 ± 0.295
1.814LysSer: 1.814 ± 0.254
2.012LysThr: 2.012 ± 0.259
2.308LysVal: 2.308 ± 0.291
0.725LysTrp: 0.725 ± 0.126
0.824LysTyr: 0.824 ± 0.16
0.0LysXaa: 0.0 ± 0.0
Leu
7.189LeuAla: 7.189 ± 0.663
0.693LeuCys: 0.693 ± 0.167
5.705LeuAsp: 5.705 ± 0.539
6.595LeuGlu: 6.595 ± 0.519
1.352LeuPhe: 1.352 ± 0.219
5.606LeuGly: 5.606 ± 0.371
1.913LeuHis: 1.913 ± 0.275
3.496LeuIle: 3.496 ± 0.359
3.001LeuLys: 3.001 ± 0.346
5.903LeuLeu: 5.903 ± 0.478
2.111LeuMet: 2.111 ± 0.242
2.671LeuAsn: 2.671 ± 0.265
4.947LeuPro: 4.947 ± 0.455
2.473LeuGln: 2.473 ± 0.329
5.309LeuArg: 5.309 ± 0.464
5.111LeuSer: 5.111 ± 0.438
5.177LeuThr: 5.177 ± 0.47
5.474LeuVal: 5.474 ± 0.438
1.616LeuTrp: 1.616 ± 0.222
1.979LeuTyr: 1.979 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
3.001MetAla: 3.001 ± 0.267
0.132MetCys: 0.132 ± 0.073
1.418MetAsp: 1.418 ± 0.201
1.451MetGlu: 1.451 ± 0.239
0.824MetPhe: 0.824 ± 0.157
1.781MetGly: 1.781 ± 0.25
0.594MetHis: 0.594 ± 0.147
0.989MetIle: 0.989 ± 0.142
1.319MetLys: 1.319 ± 0.232
1.286MetLeu: 1.286 ± 0.195
0.923MetMet: 0.923 ± 0.168
1.22MetAsn: 1.22 ± 0.18
1.781MetPro: 1.781 ± 0.243
0.561MetGln: 0.561 ± 0.127
1.88MetArg: 1.88 ± 0.256
2.473MetSer: 2.473 ± 0.296
2.144MetThr: 2.144 ± 0.278
1.517MetVal: 1.517 ± 0.211
0.363MetTrp: 0.363 ± 0.108
0.462MetTyr: 0.462 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
2.902AsnAla: 2.902 ± 0.393
0.231AsnCys: 0.231 ± 0.086
1.979AsnAsp: 1.979 ± 0.258
2.572AsnGlu: 2.572 ± 0.318
0.627AsnPhe: 0.627 ± 0.156
3.001AsnGly: 3.001 ± 0.419
0.66AsnHis: 0.66 ± 0.155
1.319AsnIle: 1.319 ± 0.277
0.495AsnLys: 0.495 ± 0.122
2.209AsnLeu: 2.209 ± 0.301
0.66AsnMet: 0.66 ± 0.131
1.055AsnAsn: 1.055 ± 0.218
2.473AsnPro: 2.473 ± 0.334
1.022AsnGln: 1.022 ± 0.167
1.814AsnArg: 1.814 ± 0.23
1.253AsnSer: 1.253 ± 0.22
1.484AsnThr: 1.484 ± 0.27
2.671AsnVal: 2.671 ± 0.284
0.857AsnTrp: 0.857 ± 0.177
0.594AsnTyr: 0.594 ± 0.127
0.0AsnXaa: 0.0 ± 0.0
Pro
5.111ProAla: 5.111 ± 0.572
0.462ProCys: 0.462 ± 0.137
3.496ProAsp: 3.496 ± 0.387
5.078ProGlu: 5.078 ± 0.484
1.913ProPhe: 1.913 ± 0.225
4.584ProGly: 4.584 ± 0.511
1.055ProHis: 1.055 ± 0.192
3.001ProIle: 3.001 ± 0.292
2.144ProLys: 2.144 ± 0.229
3.496ProLeu: 3.496 ± 0.394
1.517ProMet: 1.517 ± 0.226
1.715ProAsn: 1.715 ± 0.23
2.968ProPro: 2.968 ± 0.444
1.649ProGln: 1.649 ± 0.251
3.067ProArg: 3.067 ± 0.334
3.232ProSer: 3.232 ± 0.355
4.386ProThr: 4.386 ± 0.378
4.452ProVal: 4.452 ± 0.331
1.088ProTrp: 1.088 ± 0.183
1.187ProTyr: 1.187 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
3.496GlnAla: 3.496 ± 0.365
0.363GlnCys: 0.363 ± 0.086
2.078GlnAsp: 2.078 ± 0.237
3.1GlnGlu: 3.1 ± 0.339
0.89GlnPhe: 0.89 ± 0.199
2.902GlnGly: 2.902 ± 0.345
0.791GlnHis: 0.791 ± 0.146
2.012GlnIle: 2.012 ± 0.248
1.484GlnLys: 1.484 ± 0.2
3.496GlnLeu: 3.496 ± 0.362
0.923GlnMet: 0.923 ± 0.216
0.989GlnAsn: 0.989 ± 0.161
1.781GlnPro: 1.781 ± 0.272
1.517GlnGln: 1.517 ± 0.244
2.935GlnArg: 2.935 ± 0.314
1.946GlnSer: 1.946 ± 0.287
2.506GlnThr: 2.506 ± 0.291
2.836GlnVal: 2.836 ± 0.253
0.89GlnTrp: 0.89 ± 0.15
0.725GlnTyr: 0.725 ± 0.147
0.0GlnXaa: 0.0 ± 0.0
Arg
8.31ArgAla: 8.31 ± 0.545
0.528ArgCys: 0.528 ± 0.155
3.858ArgAsp: 3.858 ± 0.393
5.013ArgGlu: 5.013 ± 0.386
2.176ArgPhe: 2.176 ± 0.258
5.078ArgGly: 5.078 ± 0.489
1.187ArgHis: 1.187 ± 0.233
3.1ArgIle: 3.1 ± 0.297
2.407ArgLys: 2.407 ± 0.312
6.035ArgLeu: 6.035 ± 0.504
2.176ArgMet: 2.176 ± 0.245
1.88ArgAsn: 1.88 ± 0.242
3.1ArgPro: 3.1 ± 0.351
2.671ArgGln: 2.671 ± 0.241
5.672ArgArg: 5.672 ± 0.617
3.001ArgSer: 3.001 ± 0.332
3.265ArgThr: 3.265 ± 0.348
4.617ArgVal: 4.617 ± 0.424
1.715ArgTrp: 1.715 ± 0.289
1.781ArgTyr: 1.781 ± 0.246
0.0ArgXaa: 0.0 ± 0.0
Ser
5.375SerAla: 5.375 ± 0.487
0.495SerCys: 0.495 ± 0.14
3.562SerAsp: 3.562 ± 0.369
3.858SerGlu: 3.858 ± 0.476
1.847SerPhe: 1.847 ± 0.222
5.078SerGly: 5.078 ± 0.497
0.693SerHis: 0.693 ± 0.152
2.473SerIle: 2.473 ± 0.253
1.814SerLys: 1.814 ± 0.261
4.683SerLeu: 4.683 ± 0.33
2.045SerMet: 2.045 ± 0.221
1.748SerAsn: 1.748 ± 0.285
2.275SerPro: 2.275 ± 0.32
1.814SerGln: 1.814 ± 0.22
3.529SerArg: 3.529 ± 0.298
2.737SerSer: 2.737 ± 0.36
3.265SerThr: 3.265 ± 0.328
4.155SerVal: 4.155 ± 0.367
1.418SerTrp: 1.418 ± 0.243
1.682SerTyr: 1.682 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
6.958ThrAla: 6.958 ± 0.61
0.857ThrCys: 0.857 ± 0.237
3.595ThrAsp: 3.595 ± 0.291
4.452ThrGlu: 4.452 ± 0.378
1.781ThrPhe: 1.781 ± 0.23
5.837ThrGly: 5.837 ± 0.485
0.956ThrHis: 0.956 ± 0.165
3.562ThrIle: 3.562 ± 0.324
1.979ThrLys: 1.979 ± 0.39
5.046ThrLeu: 5.046 ± 0.426
1.352ThrMet: 1.352 ± 0.201
1.682ThrAsn: 1.682 ± 0.234
4.122ThrPro: 4.122 ± 0.432
1.946ThrGln: 1.946 ± 0.271
3.463ThrArg: 3.463 ± 0.377
3.792ThrSer: 3.792 ± 0.365
4.782ThrThr: 4.782 ± 0.532
5.573ThrVal: 5.573 ± 0.397
1.352ThrTrp: 1.352 ± 0.222
1.649ThrTyr: 1.649 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
7.552ValAla: 7.552 ± 0.573
0.462ValCys: 0.462 ± 0.146
5.309ValAsp: 5.309 ± 0.42
4.749ValGlu: 4.749 ± 0.516
1.946ValPhe: 1.946 ± 0.241
4.815ValGly: 4.815 ± 0.56
1.121ValHis: 1.121 ± 0.218
3.331ValIle: 3.331 ± 0.367
3.199ValLys: 3.199 ± 0.306
5.936ValLeu: 5.936 ± 0.475
1.847ValMet: 1.847 ± 0.25
2.506ValAsn: 2.506 ± 0.282
4.287ValPro: 4.287 ± 0.444
2.539ValGln: 2.539 ± 0.244
5.078ValArg: 5.078 ± 0.434
3.693ValSer: 3.693 ± 0.314
5.936ValThr: 5.936 ± 0.414
5.408ValVal: 5.408 ± 0.414
1.649ValTrp: 1.649 ± 0.24
1.319ValTyr: 1.319 ± 0.216
0.0ValXaa: 0.0 ± 0.0
Trp
2.078TrpAla: 2.078 ± 0.287
0.231TrpCys: 0.231 ± 0.083
1.484TrpAsp: 1.484 ± 0.244
1.385TrpGlu: 1.385 ± 0.208
1.286TrpPhe: 1.286 ± 0.168
1.517TrpGly: 1.517 ± 0.227
0.627TrpHis: 0.627 ± 0.141
1.121TrpIle: 1.121 ± 0.213
0.758TrpLys: 0.758 ± 0.158
1.649TrpLeu: 1.649 ± 0.224
0.561TrpMet: 0.561 ± 0.129
1.088TrpAsn: 1.088 ± 0.354
1.088TrpPro: 1.088 ± 0.188
0.956TrpGln: 0.956 ± 0.169
1.484TrpArg: 1.484 ± 0.19
1.517TrpSer: 1.517 ± 0.237
1.319TrpThr: 1.319 ± 0.228
2.209TrpVal: 2.209 ± 0.297
0.462TrpTrp: 0.462 ± 0.136
0.429TrpTyr: 0.429 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.341TyrAla: 2.341 ± 0.264
0.297TyrCys: 0.297 ± 0.107
1.715TyrAsp: 1.715 ± 0.297
1.715TyrGlu: 1.715 ± 0.201
0.956TyrPhe: 0.956 ± 0.163
2.275TyrGly: 2.275 ± 0.264
0.528TyrHis: 0.528 ± 0.152
0.693TyrIle: 0.693 ± 0.127
0.857TyrLys: 0.857 ± 0.159
2.176TyrLeu: 2.176 ± 0.266
0.725TyrMet: 0.725 ± 0.161
1.055TyrAsn: 1.055 ± 0.233
1.088TyrPro: 1.088 ± 0.193
1.154TyrGln: 1.154 ± 0.184
2.242TyrArg: 2.242 ± 0.261
1.286TyrSer: 1.286 ± 0.188
1.451TyrThr: 1.451 ± 0.205
1.517TyrVal: 1.517 ± 0.258
0.693TyrTrp: 0.693 ± 0.17
0.66TyrTyr: 0.66 ± 0.17
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 156 proteins (30325 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski