Amino acid dipepetide frequency for Acinetobacter phage YMC13/03/R2096

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.477AlaAla: 2.477 ± 0.387
0.574AlaCys: 0.574 ± 0.134
3.733AlaAsp: 3.733 ± 0.399
4.056AlaGlu: 4.056 ± 0.326
2.513AlaPhe: 2.513 ± 0.341
3.195AlaGly: 3.195 ± 0.398
0.646AlaHis: 0.646 ± 0.152
4.271AlaIle: 4.271 ± 0.38
5.312AlaLys: 5.312 ± 0.44
5.994AlaLeu: 5.994 ± 0.484
1.687AlaMet: 1.687 ± 0.229
2.584AlaAsn: 2.584 ± 0.312
1.256AlaPro: 1.256 ± 0.186
2.369AlaGln: 2.369 ± 0.353
2.836AlaArg: 2.836 ± 0.31
3.374AlaSer: 3.374 ± 0.417
3.266AlaThr: 3.266 ± 0.505
3.41AlaVal: 3.41 ± 0.337
0.861AlaTrp: 0.861 ± 0.176
2.405AlaTyr: 2.405 ± 0.294
0.0AlaXaa: 0.0 ± 0.0
Cys
0.826CysAla: 0.826 ± 0.199
0.072CysCys: 0.072 ± 0.053
0.79CysAsp: 0.79 ± 0.167
0.718CysGlu: 0.718 ± 0.163
0.61CysPhe: 0.61 ± 0.14
0.826CysGly: 0.826 ± 0.177
0.323CysHis: 0.323 ± 0.12
0.754CysIle: 0.754 ± 0.162
0.897CysLys: 0.897 ± 0.179
0.969CysLeu: 0.969 ± 0.181
0.287CysMet: 0.287 ± 0.098
0.503CysAsn: 0.503 ± 0.116
0.287CysPro: 0.287 ± 0.091
0.538CysGln: 0.538 ± 0.172
0.646CysArg: 0.646 ± 0.142
0.61CysSer: 0.61 ± 0.19
0.79CysThr: 0.79 ± 0.139
0.646CysVal: 0.646 ± 0.22
0.395CysTrp: 0.395 ± 0.103
0.574CysTyr: 0.574 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
3.518AspAla: 3.518 ± 0.414
0.682AspCys: 0.682 ± 0.165
3.553AspAsp: 3.553 ± 0.322
4.774AspGlu: 4.774 ± 0.492
3.625AspPhe: 3.625 ± 0.374
4.953AspGly: 4.953 ± 0.372
1.077AspHis: 1.077 ± 0.191
4.128AspIle: 4.128 ± 0.404
4.128AspLys: 4.128 ± 0.44
6.174AspLeu: 6.174 ± 0.505
1.615AspMet: 1.615 ± 0.192
2.943AspAsn: 2.943 ± 0.312
1.902AspPro: 1.902 ± 0.239
2.046AspGln: 2.046 ± 0.217
2.513AspArg: 2.513 ± 0.261
3.302AspSer: 3.302 ± 0.324
3.087AspThr: 3.087 ± 0.346
5.456AspVal: 5.456 ± 0.407
1.113AspTrp: 1.113 ± 0.19
2.764AspTyr: 2.764 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
3.446GluAla: 3.446 ± 0.349
1.005GluCys: 1.005 ± 0.194
5.205GluAsp: 5.205 ± 0.496
5.851GluGlu: 5.851 ± 0.605
3.051GluPhe: 3.051 ± 0.267
4.559GluGly: 4.559 ± 0.368
1.184GluHis: 1.184 ± 0.164
5.779GluIle: 5.779 ± 0.512
5.815GluLys: 5.815 ± 0.54
5.24GluLeu: 5.24 ± 0.41
2.548GluMet: 2.548 ± 0.287
3.482GluAsn: 3.482 ± 0.327
1.436GluPro: 1.436 ± 0.217
2.907GluGln: 2.907 ± 0.333
2.513GluArg: 2.513 ± 0.298
3.948GluSer: 3.948 ± 0.33
3.015GluThr: 3.015 ± 0.321
6.604GluVal: 6.604 ± 0.56
1.113GluTrp: 1.113 ± 0.227
2.907GluTyr: 2.907 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
2.333PheAla: 2.333 ± 0.299
0.431PheCys: 0.431 ± 0.122
3.302PheAsp: 3.302 ± 0.307
3.446PheGlu: 3.446 ± 0.383
1.543PhePhe: 1.543 ± 0.297
2.907PheGly: 2.907 ± 0.303
0.646PheHis: 0.646 ± 0.157
2.979PheIle: 2.979 ± 0.379
3.625PheLys: 3.625 ± 0.405
3.159PheLeu: 3.159 ± 0.37
0.969PheMet: 0.969 ± 0.213
2.872PheAsn: 2.872 ± 0.364
0.861PhePro: 0.861 ± 0.151
1.149PheGln: 1.149 ± 0.218
1.579PheArg: 1.579 ± 0.239
3.015PheSer: 3.015 ± 0.366
2.728PheThr: 2.728 ± 0.305
2.907PheVal: 2.907 ± 0.282
0.395PheTrp: 0.395 ± 0.106
1.938PheTyr: 1.938 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
3.912GlyAla: 3.912 ± 0.357
1.041GlyCys: 1.041 ± 0.193
5.205GlyAsp: 5.205 ± 0.509
4.092GlyGlu: 4.092 ± 0.355
3.266GlyPhe: 3.266 ± 0.351
4.917GlyGly: 4.917 ± 0.542
1.472GlyHis: 1.472 ± 0.235
3.553GlyIle: 3.553 ± 0.322
5.958GlyLys: 5.958 ± 0.507
5.851GlyLeu: 5.851 ± 0.481
2.369GlyMet: 2.369 ± 0.251
3.553GlyAsn: 3.553 ± 0.33
0.574GlyPro: 0.574 ± 0.149
2.046GlyGln: 2.046 ± 0.278
3.159GlyArg: 3.159 ± 0.352
4.056GlySer: 4.056 ± 0.338
3.769GlyThr: 3.769 ± 0.376
4.953GlyVal: 4.953 ± 0.399
1.292GlyTrp: 1.292 ± 0.191
4.056GlyTyr: 4.056 ± 0.363
0.0GlyXaa: 0.0 ± 0.0
His
0.861HisAla: 0.861 ± 0.175
0.323HisCys: 0.323 ± 0.108
0.897HisAsp: 0.897 ± 0.169
1.22HisGlu: 1.22 ± 0.201
0.61HisPhe: 0.61 ± 0.138
0.861HisGly: 0.861 ± 0.216
0.323HisHis: 0.323 ± 0.117
1.113HisIle: 1.113 ± 0.234
1.902HisLys: 1.902 ± 0.253
1.938HisLeu: 1.938 ± 0.257
0.503HisMet: 0.503 ± 0.137
1.041HisAsn: 1.041 ± 0.226
0.826HisPro: 0.826 ± 0.191
0.682HisGln: 0.682 ± 0.166
0.826HisArg: 0.826 ± 0.183
1.615HisSer: 1.615 ± 0.22
1.005HisThr: 1.005 ± 0.18
1.292HisVal: 1.292 ± 0.219
0.359HisTrp: 0.359 ± 0.127
0.718HisTyr: 0.718 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.092IleAla: 4.092 ± 0.391
0.754IleCys: 0.754 ± 0.174
3.912IleAsp: 3.912 ± 0.365
4.846IleGlu: 4.846 ± 0.43
2.297IlePhe: 2.297 ± 0.282
4.379IleGly: 4.379 ± 0.443
1.149IleHis: 1.149 ± 0.183
3.446IleIle: 3.446 ± 0.365
5.061IleLys: 5.061 ± 0.407
4.917IleLeu: 4.917 ± 0.42
1.579IleMet: 1.579 ± 0.208
3.266IleAsn: 3.266 ± 0.491
2.441IlePro: 2.441 ± 0.352
2.62IleGln: 2.62 ± 0.326
3.087IleArg: 3.087 ± 0.305
4.989IleSer: 4.989 ± 0.516
3.697IleThr: 3.697 ± 0.443
5.133IleVal: 5.133 ± 0.427
0.754IleTrp: 0.754 ± 0.173
2.154IleTyr: 2.154 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
4.594LysAla: 4.594 ± 0.496
0.897LysCys: 0.897 ± 0.183
5.061LysAsp: 5.061 ± 0.407
4.738LysGlu: 4.738 ± 0.431
2.907LysPhe: 2.907 ± 0.313
4.594LysGly: 4.594 ± 0.426
1.974LysHis: 1.974 ± 0.298
4.738LysIle: 4.738 ± 0.401
4.666LysLys: 4.666 ± 0.495
7.215LysLeu: 7.215 ± 0.434
2.297LysMet: 2.297 ± 0.256
2.979LysAsn: 2.979 ± 0.33
2.979LysPro: 2.979 ± 0.306
2.8LysGln: 2.8 ± 0.323
3.733LysArg: 3.733 ± 0.44
4.594LysSer: 4.594 ± 0.373
3.984LysThr: 3.984 ± 0.439
7.322LysVal: 7.322 ± 0.583
1.041LysTrp: 1.041 ± 0.216
3.518LysTyr: 3.518 ± 0.375
0.0LysXaa: 0.0 ± 0.0
Leu
5.492LeuAla: 5.492 ± 0.486
1.005LeuCys: 1.005 ± 0.206
5.779LeuAsp: 5.779 ± 0.367
6.892LeuGlu: 6.892 ± 0.517
3.374LeuPhe: 3.374 ± 0.41
6.174LeuGly: 6.174 ± 0.435
1.4LeuHis: 1.4 ± 0.276
4.774LeuIle: 4.774 ± 0.474
6.533LeuLys: 6.533 ± 0.525
5.887LeuLeu: 5.887 ± 0.472
2.513LeuMet: 2.513 ± 0.299
4.271LeuAsn: 4.271 ± 0.442
2.477LeuPro: 2.477 ± 0.316
3.302LeuGln: 3.302 ± 0.379
2.943LeuArg: 2.943 ± 0.294
5.743LeuSer: 5.743 ± 0.506
4.594LeuThr: 4.594 ± 0.37
5.635LeuVal: 5.635 ± 0.427
0.969LeuTrp: 0.969 ± 0.202
2.8LeuTyr: 2.8 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
2.405MetAla: 2.405 ± 0.38
0.323MetCys: 0.323 ± 0.109
1.256MetAsp: 1.256 ± 0.246
2.118MetGlu: 2.118 ± 0.266
1.005MetPhe: 1.005 ± 0.187
1.436MetGly: 1.436 ± 0.218
0.503MetHis: 0.503 ± 0.136
1.651MetIle: 1.651 ± 0.25
2.154MetLys: 2.154 ± 0.297
2.118MetLeu: 2.118 ± 0.312
0.826MetMet: 0.826 ± 0.192
1.149MetAsn: 1.149 ± 0.215
1.4MetPro: 1.4 ± 0.223
0.897MetGln: 0.897 ± 0.187
1.256MetArg: 1.256 ± 0.256
1.902MetSer: 1.902 ± 0.235
1.436MetThr: 1.436 ± 0.265
1.579MetVal: 1.579 ± 0.214
0.431MetTrp: 0.431 ± 0.136
1.4MetTyr: 1.4 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
2.297AsnAla: 2.297 ± 0.382
0.467AsnCys: 0.467 ± 0.142
2.046AsnAsp: 2.046 ± 0.253
2.656AsnGlu: 2.656 ± 0.309
2.01AsnPhe: 2.01 ± 0.289
4.846AsnGly: 4.846 ± 0.402
0.61AsnHis: 0.61 ± 0.17
3.553AsnIle: 3.553 ± 0.401
3.195AsnLys: 3.195 ± 0.299
4.235AsnLeu: 4.235 ± 0.424
1.579AsnMet: 1.579 ± 0.225
2.692AsnAsn: 2.692 ± 0.383
3.159AsnPro: 3.159 ± 0.319
1.938AsnGln: 1.938 ± 0.331
1.687AsnArg: 1.687 ± 0.293
3.051AsnSer: 3.051 ± 0.416
3.769AsnThr: 3.769 ± 0.453
3.302AsnVal: 3.302 ± 0.39
0.538AsnTrp: 0.538 ± 0.14
1.508AsnTyr: 1.508 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
1.256ProAla: 1.256 ± 0.217
0.395ProCys: 0.395 ± 0.128
2.225ProAsp: 2.225 ± 0.267
3.266ProGlu: 3.266 ± 0.354
1.077ProPhe: 1.077 ± 0.21
0.0ProGly: 0.0 ± 0.0
0.574ProHis: 0.574 ± 0.159
2.405ProIle: 2.405 ± 0.291
2.872ProLys: 2.872 ± 0.328
2.01ProLeu: 2.01 ± 0.232
0.646ProMet: 0.646 ± 0.184
1.795ProAsn: 1.795 ± 0.299
0.718ProPro: 0.718 ± 0.145
1.149ProGln: 1.149 ± 0.158
1.328ProArg: 1.328 ± 0.243
2.656ProSer: 2.656 ± 0.333
2.333ProThr: 2.333 ± 0.253
2.441ProVal: 2.441 ± 0.26
0.215ProTrp: 0.215 ± 0.076
1.22ProTyr: 1.22 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
2.8GlnAla: 2.8 ± 0.36
0.287GlnCys: 0.287 ± 0.1
2.369GlnAsp: 2.369 ± 0.291
2.907GlnGlu: 2.907 ± 0.275
1.328GlnPhe: 1.328 ± 0.197
2.261GlnGly: 2.261 ± 0.267
0.79GlnHis: 0.79 ± 0.204
2.297GlnIle: 2.297 ± 0.33
2.513GlnLys: 2.513 ± 0.297
3.23GlnLeu: 3.23 ± 0.378
0.861GlnMet: 0.861 ± 0.174
1.579GlnAsn: 1.579 ± 0.249
1.22GlnPro: 1.22 ± 0.227
1.723GlnGln: 1.723 ± 0.272
1.364GlnArg: 1.364 ± 0.217
1.974GlnSer: 1.974 ± 0.252
2.333GlnThr: 2.333 ± 0.26
2.297GlnVal: 2.297 ± 0.272
0.682GlnTrp: 0.682 ± 0.142
1.974GlnTyr: 1.974 ± 0.235
0.0GlnXaa: 0.0 ± 0.0
Arg
2.154ArgAla: 2.154 ± 0.255
0.359ArgCys: 0.359 ± 0.113
2.369ArgAsp: 2.369 ± 0.27
3.374ArgGlu: 3.374 ± 0.39
2.369ArgPhe: 2.369 ± 0.275
2.8ArgGly: 2.8 ± 0.364
1.077ArgHis: 1.077 ± 0.181
2.584ArgIle: 2.584 ± 0.268
3.266ArgLys: 3.266 ± 0.321
3.661ArgLeu: 3.661 ± 0.371
1.077ArgMet: 1.077 ± 0.16
2.513ArgAsn: 2.513 ± 0.306
1.184ArgPro: 1.184 ± 0.2
1.472ArgGln: 1.472 ± 0.215
1.723ArgArg: 1.723 ± 0.25
2.728ArgSer: 2.728 ± 0.31
2.261ArgThr: 2.261 ± 0.264
3.518ArgVal: 3.518 ± 0.294
0.754ArgTrp: 0.754 ± 0.14
2.19ArgTyr: 2.19 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
3.733SerAla: 3.733 ± 0.452
1.005SerCys: 1.005 ± 0.213
3.625SerAsp: 3.625 ± 0.445
4.774SerGlu: 4.774 ± 0.468
3.266SerPhe: 3.266 ± 0.381
4.81SerGly: 4.81 ± 0.389
1.256SerHis: 1.256 ± 0.239
3.877SerIle: 3.877 ± 0.38
4.81SerLys: 4.81 ± 0.432
5.42SerLeu: 5.42 ± 0.37
1.292SerMet: 1.292 ± 0.209
3.087SerAsn: 3.087 ± 0.359
2.225SerPro: 2.225 ± 0.309
2.333SerGln: 2.333 ± 0.256
3.302SerArg: 3.302 ± 0.306
4.702SerSer: 4.702 ± 0.468
3.984SerThr: 3.984 ± 0.458
4.128SerVal: 4.128 ± 0.324
0.754SerTrp: 0.754 ± 0.187
2.872SerTyr: 2.872 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
3.661ThrAla: 3.661 ± 0.458
0.574ThrCys: 0.574 ± 0.142
3.769ThrAsp: 3.769 ± 0.434
3.159ThrGlu: 3.159 ± 0.332
2.441ThrPhe: 2.441 ± 0.271
5.205ThrGly: 5.205 ± 0.451
1.113ThrHis: 1.113 ± 0.224
4.164ThrIle: 4.164 ± 0.373
3.841ThrLys: 3.841 ± 0.349
4.415ThrLeu: 4.415 ± 0.418
1.292ThrMet: 1.292 ± 0.224
2.01ThrAsn: 2.01 ± 0.284
2.082ThrPro: 2.082 ± 0.273
2.441ThrGln: 2.441 ± 0.305
2.261ThrArg: 2.261 ± 0.254
4.307ThrSer: 4.307 ± 0.425
3.446ThrThr: 3.446 ± 0.372
3.984ThrVal: 3.984 ± 0.398
0.538ThrTrp: 0.538 ± 0.122
2.441ThrTyr: 2.441 ± 0.271
0.0ThrXaa: 0.0 ± 0.0
Val
4.02ValAla: 4.02 ± 0.382
0.646ValCys: 0.646 ± 0.13
4.451ValAsp: 4.451 ± 0.361
4.882ValGlu: 4.882 ± 0.459
3.41ValPhe: 3.41 ± 0.335
5.958ValGly: 5.958 ± 0.439
1.508ValHis: 1.508 ± 0.208
5.456ValIle: 5.456 ± 0.358
5.456ValLys: 5.456 ± 0.415
5.851ValLeu: 5.851 ± 0.481
1.795ValMet: 1.795 ± 0.273
3.446ValAsn: 3.446 ± 0.461
2.154ValPro: 2.154 ± 0.259
2.333ValGln: 2.333 ± 0.218
3.338ValArg: 3.338 ± 0.395
5.133ValSer: 5.133 ± 0.469
4.559ValThr: 4.559 ± 0.338
5.887ValVal: 5.887 ± 0.43
0.969ValTrp: 0.969 ± 0.185
3.374ValTyr: 3.374 ± 0.335
0.0ValXaa: 0.0 ± 0.0
Trp
0.61TrpAla: 0.61 ± 0.146
0.287TrpCys: 0.287 ± 0.103
1.005TrpAsp: 1.005 ± 0.182
1.077TrpGlu: 1.077 ± 0.196
0.718TrpPhe: 0.718 ± 0.198
0.933TrpGly: 0.933 ± 0.175
0.323TrpHis: 0.323 ± 0.123
0.574TrpIle: 0.574 ± 0.148
1.041TrpLys: 1.041 ± 0.192
1.256TrpLeu: 1.256 ± 0.246
0.359TrpMet: 0.359 ± 0.117
0.79TrpAsn: 0.79 ± 0.192
0.072TrpPro: 0.072 ± 0.048
0.538TrpGln: 0.538 ± 0.177
0.754TrpArg: 0.754 ± 0.165
1.113TrpSer: 1.113 ± 0.176
0.467TrpThr: 0.467 ± 0.155
1.113TrpVal: 1.113 ± 0.198
0.323TrpTrp: 0.323 ± 0.101
0.682TrpTyr: 0.682 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 0.257
0.933TyrCys: 0.933 ± 0.196
2.8TyrAsp: 2.8 ± 0.294
2.764TyrGlu: 2.764 ± 0.261
1.364TyrPhe: 1.364 ± 0.223
3.482TyrGly: 3.482 ± 0.38
0.969TyrHis: 0.969 ± 0.178
2.477TyrIle: 2.477 ± 0.279
3.625TyrLys: 3.625 ± 0.401
3.015TyrLeu: 3.015 ± 0.333
1.113TyrMet: 1.113 ± 0.199
2.584TyrAsn: 2.584 ± 0.28
1.328TyrPro: 1.328 ± 0.203
1.508TyrGln: 1.508 ± 0.212
2.584TyrArg: 2.584 ± 0.31
2.441TyrSer: 2.441 ± 0.259
2.62TyrThr: 2.62 ± 0.303
3.015TyrVal: 3.015 ± 0.313
0.538TyrTrp: 0.538 ± 0.115
1.866TyrTyr: 1.866 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 162 proteins (27861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski