Amino acid dipepetide frequency for Agrobacterium phage Milano

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.175AlaAla: 10.175 ± 0.849
1.115AlaCys: 1.115 ± 0.254
5.25AlaAsp: 5.25 ± 0.463
6.133AlaGlu: 6.133 ± 0.462
4.6AlaPhe: 4.6 ± 0.524
6.969AlaGly: 6.969 ± 0.589
1.301AlaHis: 1.301 ± 0.25
5.436AlaIle: 5.436 ± 0.364
5.157AlaLys: 5.157 ± 0.552
6.458AlaLeu: 6.458 ± 0.75
2.509AlaMet: 2.509 ± 0.382
3.02AlaAsn: 3.02 ± 0.429
4.042AlaPro: 4.042 ± 0.589
2.973AlaGln: 2.973 ± 0.363
5.064AlaArg: 5.064 ± 0.534
3.856AlaSer: 3.856 ± 0.402
5.203AlaThr: 5.203 ± 0.513
6.319AlaVal: 6.319 ± 0.666
1.301AlaTrp: 1.301 ± 0.251
2.416AlaTyr: 2.416 ± 0.242
0.0AlaXaa: 0.0 ± 0.0
Cys
1.022CysAla: 1.022 ± 0.258
0.511CysCys: 0.511 ± 0.173
1.301CysAsp: 1.301 ± 0.24
0.929CysGlu: 0.929 ± 0.212
0.558CysPhe: 0.558 ± 0.174
1.533CysGly: 1.533 ± 0.337
0.418CysHis: 0.418 ± 0.115
0.372CysIle: 0.372 ± 0.132
0.697CysLys: 0.697 ± 0.187
0.836CysLeu: 0.836 ± 0.234
0.232CysMet: 0.232 ± 0.119
0.558CysAsn: 0.558 ± 0.162
0.65CysPro: 0.65 ± 0.17
0.232CysGln: 0.232 ± 0.102
0.976CysArg: 0.976 ± 0.232
0.697CysSer: 0.697 ± 0.241
0.743CysThr: 0.743 ± 0.215
1.301CysVal: 1.301 ± 0.226
0.325CysTrp: 0.325 ± 0.13
0.232CysTyr: 0.232 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
5.575AspAla: 5.575 ± 0.587
1.022AspCys: 1.022 ± 0.234
4.042AspAsp: 4.042 ± 0.559
4.46AspGlu: 4.46 ± 0.52
2.416AspPhe: 2.416 ± 0.326
6.133AspGly: 6.133 ± 0.625
1.115AspHis: 1.115 ± 0.258
3.066AspIle: 3.066 ± 0.348
2.323AspLys: 2.323 ± 0.323
3.903AspLeu: 3.903 ± 0.371
1.208AspMet: 1.208 ± 0.266
2.369AspAsn: 2.369 ± 0.355
3.02AspPro: 3.02 ± 0.408
1.254AspGln: 1.254 ± 0.25
3.81AspArg: 3.81 ± 0.421
2.184AspSer: 2.184 ± 0.362
3.438AspThr: 3.438 ± 0.336
5.064AspVal: 5.064 ± 0.438
1.069AspTrp: 1.069 ± 0.267
2.184AspTyr: 2.184 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
4.832GluAla: 4.832 ± 0.401
1.301GluCys: 1.301 ± 0.296
2.555GluAsp: 2.555 ± 0.284
3.949GluGlu: 3.949 ± 0.494
2.927GluPhe: 2.927 ± 0.412
4.6GluGly: 4.6 ± 0.452
1.347GluHis: 1.347 ± 0.232
4.507GluIle: 4.507 ± 0.425
3.438GluLys: 3.438 ± 0.477
5.854GluLeu: 5.854 ± 0.492
1.812GluMet: 1.812 ± 0.271
2.881GluAsn: 2.881 ± 0.412
2.695GluPro: 2.695 ± 0.416
2.369GluGln: 2.369 ± 0.342
4.832GluArg: 4.832 ± 0.521
3.484GluSer: 3.484 ± 0.407
3.392GluThr: 3.392 ± 0.475
3.531GluVal: 3.531 ± 0.451
1.161GluTrp: 1.161 ± 0.224
2.741GluTyr: 2.741 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
3.392PheAla: 3.392 ± 0.438
0.65PheCys: 0.65 ± 0.183
3.392PheAsp: 3.392 ± 0.431
3.159PheGlu: 3.159 ± 0.359
1.998PhePhe: 1.998 ± 0.318
3.903PheGly: 3.903 ± 0.39
0.79PheHis: 0.79 ± 0.188
3.206PheIle: 3.206 ± 0.394
2.23PheLys: 2.23 ± 0.27
2.881PheLeu: 2.881 ± 0.345
0.65PheMet: 0.65 ± 0.157
1.905PheAsn: 1.905 ± 0.286
1.673PhePro: 1.673 ± 0.294
1.301PheGln: 1.301 ± 0.225
2.648PheArg: 2.648 ± 0.36
2.555PheSer: 2.555 ± 0.393
2.509PheThr: 2.509 ± 0.394
3.392PheVal: 3.392 ± 0.381
0.279PheTrp: 0.279 ± 0.135
1.858PheTyr: 1.858 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
5.9GlyAla: 5.9 ± 0.517
1.208GlyCys: 1.208 ± 0.251
5.111GlyAsp: 5.111 ± 0.619
4.692GlyGlu: 4.692 ± 0.487
3.299GlyPhe: 3.299 ± 0.371
5.296GlyGly: 5.296 ± 0.888
1.301GlyHis: 1.301 ± 0.241
4.042GlyIle: 4.042 ± 0.457
4.971GlyLys: 4.971 ± 0.541
5.157GlyLeu: 5.157 ± 0.542
2.044GlyMet: 2.044 ± 0.353
3.856GlyAsn: 3.856 ± 0.425
2.927GlyPro: 2.927 ± 0.387
2.509GlyGln: 2.509 ± 0.383
4.181GlyArg: 4.181 ± 0.366
4.46GlySer: 4.46 ± 0.411
5.25GlyThr: 5.25 ± 0.498
6.644GlyVal: 6.644 ± 0.614
1.487GlyTrp: 1.487 ± 0.268
2.927GlyTyr: 2.927 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.58HisAla: 1.58 ± 0.334
0.139HisCys: 0.139 ± 0.093
1.208HisAsp: 1.208 ± 0.253
1.115HisGlu: 1.115 ± 0.239
0.836HisPhe: 0.836 ± 0.16
1.719HisGly: 1.719 ± 0.316
0.558HisHis: 0.558 ± 0.148
0.976HisIle: 0.976 ± 0.194
0.697HisLys: 0.697 ± 0.201
0.79HisLeu: 0.79 ± 0.175
0.604HisMet: 0.604 ± 0.16
0.836HisAsn: 0.836 ± 0.22
0.929HisPro: 0.929 ± 0.198
0.279HisGln: 0.279 ± 0.122
0.976HisArg: 0.976 ± 0.184
1.161HisSer: 1.161 ± 0.288
0.604HisThr: 0.604 ± 0.161
1.812HisVal: 1.812 ± 0.32
0.232HisTrp: 0.232 ± 0.1
0.558HisTyr: 0.558 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
5.575IleAla: 5.575 ± 0.599
0.65IleCys: 0.65 ± 0.155
3.81IleAsp: 3.81 ± 0.375
4.228IleGlu: 4.228 ± 0.543
2.23IlePhe: 2.23 ± 0.353
3.903IleGly: 3.903 ± 0.447
1.069IleHis: 1.069 ± 0.258
2.927IleIle: 2.927 ± 0.35
2.23IleLys: 2.23 ± 0.31
3.113IleLeu: 3.113 ± 0.386
1.022IleMet: 1.022 ± 0.205
2.788IleAsn: 2.788 ± 0.395
3.066IlePro: 3.066 ± 0.368
1.347IleGln: 1.347 ± 0.294
3.949IleArg: 3.949 ± 0.441
3.438IleSer: 3.438 ± 0.41
3.531IleThr: 3.531 ± 0.4
4.321IleVal: 4.321 ± 0.405
0.976IleTrp: 0.976 ± 0.216
1.626IleTyr: 1.626 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
5.203LysAla: 5.203 ± 0.466
0.418LysCys: 0.418 ± 0.146
2.555LysAsp: 2.555 ± 0.354
3.113LysGlu: 3.113 ± 0.427
2.509LysPhe: 2.509 ± 0.339
3.996LysGly: 3.996 ± 0.387
1.208LysHis: 1.208 ± 0.255
2.602LysIle: 2.602 ± 0.342
2.509LysLys: 2.509 ± 0.409
4.228LysLeu: 4.228 ± 0.451
1.533LysMet: 1.533 ± 0.288
2.137LysAsn: 2.137 ± 0.298
2.462LysPro: 2.462 ± 0.368
1.719LysGln: 1.719 ± 0.326
3.763LysArg: 3.763 ± 0.371
3.02LysSer: 3.02 ± 0.374
3.392LysThr: 3.392 ± 0.387
3.438LysVal: 3.438 ± 0.425
0.883LysTrp: 0.883 ± 0.25
1.44LysTyr: 1.44 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
6.272LeuAla: 6.272 ± 0.456
1.069LeuCys: 1.069 ± 0.239
3.392LeuAsp: 3.392 ± 0.337
4.321LeuGlu: 4.321 ± 0.43
2.184LeuPhe: 2.184 ± 0.348
4.646LeuGly: 4.646 ± 0.562
1.254LeuHis: 1.254 ± 0.346
3.438LeuIle: 3.438 ± 0.389
5.018LeuLys: 5.018 ± 0.545
4.181LeuLeu: 4.181 ± 0.485
2.044LeuMet: 2.044 ± 0.3
3.159LeuAsn: 3.159 ± 0.429
3.577LeuPro: 3.577 ± 0.42
3.345LeuGln: 3.345 ± 0.373
4.367LeuArg: 4.367 ± 0.466
4.739LeuSer: 4.739 ± 0.502
5.668LeuThr: 5.668 ± 0.489
4.367LeuVal: 4.367 ± 0.414
0.697LeuTrp: 0.697 ± 0.176
2.137LeuTyr: 2.137 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
2.973MetAla: 2.973 ± 0.324
0.372MetCys: 0.372 ± 0.127
1.208MetAsp: 1.208 ± 0.246
1.347MetGlu: 1.347 ± 0.252
1.022MetPhe: 1.022 ± 0.207
1.533MetGly: 1.533 ± 0.319
0.325MetHis: 0.325 ± 0.143
1.858MetIle: 1.858 ± 0.232
1.533MetLys: 1.533 ± 0.295
1.301MetLeu: 1.301 ± 0.221
0.372MetMet: 0.372 ± 0.15
1.44MetAsn: 1.44 ± 0.268
0.79MetPro: 0.79 ± 0.168
0.883MetGln: 0.883 ± 0.212
1.487MetArg: 1.487 ± 0.271
1.626MetSer: 1.626 ± 0.297
3.299MetThr: 3.299 ± 0.433
1.394MetVal: 1.394 ± 0.261
0.279MetTrp: 0.279 ± 0.094
0.418MetTyr: 0.418 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.274AsnAla: 4.274 ± 0.552
0.79AsnCys: 0.79 ± 0.205
2.695AsnAsp: 2.695 ± 0.315
2.416AsnGlu: 2.416 ± 0.34
1.998AsnPhe: 1.998 ± 0.372
4.878AsnGly: 4.878 ± 0.562
0.743AsnHis: 0.743 ± 0.193
2.648AsnIle: 2.648 ± 0.313
1.44AsnLys: 1.44 ± 0.255
3.02AsnLeu: 3.02 ± 0.365
1.254AsnMet: 1.254 ± 0.234
2.23AsnAsn: 2.23 ± 0.446
2.834AsnPro: 2.834 ± 0.384
1.161AsnGln: 1.161 ± 0.254
2.602AsnArg: 2.602 ± 0.279
2.044AsnSer: 2.044 ± 0.255
1.858AsnThr: 1.858 ± 0.302
3.252AsnVal: 3.252 ± 0.46
0.697AsnTrp: 0.697 ± 0.156
1.673AsnTyr: 1.673 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
5.296ProAla: 5.296 ± 0.666
0.65ProCys: 0.65 ± 0.158
2.788ProAsp: 2.788 ± 0.375
3.392ProGlu: 3.392 ± 0.351
2.137ProPhe: 2.137 ± 0.317
4.367ProGly: 4.367 ± 0.519
0.418ProHis: 0.418 ± 0.158
2.462ProIle: 2.462 ± 0.395
2.277ProLys: 2.277 ± 0.324
3.066ProLeu: 3.066 ± 0.351
1.069ProMet: 1.069 ± 0.239
2.277ProAsn: 2.277 ± 0.43
3.206ProPro: 3.206 ± 0.518
1.951ProGln: 1.951 ± 0.625
2.369ProArg: 2.369 ± 0.391
3.252ProSer: 3.252 ± 0.31
2.741ProThr: 2.741 ± 0.311
4.321ProVal: 4.321 ± 0.442
0.372ProTrp: 0.372 ± 0.124
1.58ProTyr: 1.58 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
2.927GlnAla: 2.927 ± 0.394
0.372GlnCys: 0.372 ± 0.136
1.254GlnAsp: 1.254 ± 0.22
2.23GlnGlu: 2.23 ± 0.317
1.347GlnPhe: 1.347 ± 0.253
2.137GlnGly: 2.137 ± 0.333
0.186GlnHis: 0.186 ± 0.11
1.812GlnIle: 1.812 ± 0.276
1.394GlnLys: 1.394 ± 0.235
2.462GlnLeu: 2.462 ± 0.322
0.929GlnMet: 0.929 ± 0.189
1.533GlnAsn: 1.533 ± 0.251
1.998GlnPro: 1.998 ± 0.415
1.347GlnGln: 1.347 ± 0.303
2.648GlnArg: 2.648 ± 0.349
1.487GlnSer: 1.487 ± 0.298
1.998GlnThr: 1.998 ± 0.305
2.555GlnVal: 2.555 ± 0.57
0.65GlnTrp: 0.65 ± 0.162
1.347GlnTyr: 1.347 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
4.042ArgAla: 4.042 ± 0.471
0.558ArgCys: 0.558 ± 0.139
3.949ArgAsp: 3.949 ± 0.404
4.321ArgGlu: 4.321 ± 0.534
3.717ArgPhe: 3.717 ± 0.499
3.717ArgGly: 3.717 ± 0.508
1.58ArgHis: 1.58 ± 0.316
3.949ArgIle: 3.949 ± 0.402
3.531ArgLys: 3.531 ± 0.408
5.018ArgLeu: 5.018 ± 0.514
1.44ArgMet: 1.44 ± 0.222
3.299ArgAsn: 3.299 ± 0.348
2.741ArgPro: 2.741 ± 0.399
2.834ArgGln: 2.834 ± 0.397
4.507ArgArg: 4.507 ± 0.576
3.438ArgSer: 3.438 ± 0.434
3.113ArgThr: 3.113 ± 0.511
3.996ArgVal: 3.996 ± 0.413
0.836ArgTrp: 0.836 ± 0.209
2.137ArgTyr: 2.137 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
5.482SerAla: 5.482 ± 0.548
0.836SerCys: 0.836 ± 0.289
3.577SerAsp: 3.577 ± 0.44
2.881SerGlu: 2.881 ± 0.413
2.927SerPhe: 2.927 ± 0.408
4.274SerGly: 4.274 ± 0.468
0.929SerHis: 0.929 ± 0.229
2.973SerIle: 2.973 ± 0.424
3.113SerLys: 3.113 ± 0.367
3.252SerLeu: 3.252 ± 0.417
1.533SerMet: 1.533 ± 0.278
2.044SerAsn: 2.044 ± 0.397
2.555SerPro: 2.555 ± 0.359
1.347SerGln: 1.347 ± 0.243
3.763SerArg: 3.763 ± 0.435
2.741SerSer: 2.741 ± 0.339
3.577SerThr: 3.577 ± 0.458
5.25SerVal: 5.25 ± 0.426
1.069SerTrp: 1.069 ± 0.218
1.394SerTyr: 1.394 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
4.785ThrAla: 4.785 ± 0.492
0.976ThrCys: 0.976 ± 0.241
3.438ThrAsp: 3.438 ± 0.448
3.996ThrGlu: 3.996 ± 0.385
2.602ThrPhe: 2.602 ± 0.334
5.111ThrGly: 5.111 ± 0.505
0.929ThrHis: 0.929 ± 0.199
4.181ThrIle: 4.181 ± 0.425
2.973ThrLys: 2.973 ± 0.426
5.018ThrLeu: 5.018 ± 0.498
1.765ThrMet: 1.765 ± 0.25
2.602ThrAsn: 2.602 ± 0.416
4.228ThrPro: 4.228 ± 0.464
1.347ThrGln: 1.347 ± 0.234
3.113ThrArg: 3.113 ± 0.39
3.206ThrSer: 3.206 ± 0.321
4.088ThrThr: 4.088 ± 0.658
5.25ThrVal: 5.25 ± 0.587
0.79ThrTrp: 0.79 ± 0.204
2.369ThrTyr: 2.369 ± 0.278
0.0ThrXaa: 0.0 ± 0.0
Val
5.947ValAla: 5.947 ± 0.543
0.697ValCys: 0.697 ± 0.18
5.296ValAsp: 5.296 ± 0.59
5.018ValGlu: 5.018 ± 0.427
3.113ValPhe: 3.113 ± 0.414
5.064ValGly: 5.064 ± 0.457
1.115ValHis: 1.115 ± 0.217
3.299ValIle: 3.299 ± 0.381
4.6ValLys: 4.6 ± 0.466
5.807ValLeu: 5.807 ± 0.531
2.323ValMet: 2.323 ± 0.274
3.159ValAsn: 3.159 ± 0.424
3.763ValPro: 3.763 ± 0.564
2.602ValGln: 2.602 ± 0.312
4.042ValArg: 4.042 ± 0.478
5.203ValSer: 5.203 ± 0.479
5.715ValThr: 5.715 ± 0.542
5.203ValVal: 5.203 ± 0.563
0.743ValTrp: 0.743 ± 0.227
2.462ValTyr: 2.462 ± 0.359
0.0ValXaa: 0.0 ± 0.0
Trp
1.022TrpAla: 1.022 ± 0.238
0.279TrpCys: 0.279 ± 0.102
0.976TrpAsp: 0.976 ± 0.261
0.883TrpGlu: 0.883 ± 0.187
0.883TrpPhe: 0.883 ± 0.236
0.697TrpGly: 0.697 ± 0.162
0.372TrpHis: 0.372 ± 0.129
0.604TrpIle: 0.604 ± 0.172
0.929TrpLys: 0.929 ± 0.202
1.301TrpLeu: 1.301 ± 0.254
0.418TrpMet: 0.418 ± 0.133
0.883TrpAsn: 0.883 ± 0.191
0.743TrpPro: 0.743 ± 0.21
0.511TrpGln: 0.511 ± 0.162
1.254TrpArg: 1.254 ± 0.193
0.743TrpSer: 0.743 ± 0.164
0.883TrpThr: 0.883 ± 0.185
0.697TrpVal: 0.697 ± 0.166
0.186TrpTrp: 0.186 ± 0.114
0.232TrpTyr: 0.232 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.02TyrAla: 3.02 ± 0.418
0.511TyrCys: 0.511 ± 0.182
1.858TyrAsp: 1.858 ± 0.262
1.673TyrGlu: 1.673 ± 0.283
1.069TyrPhe: 1.069 ± 0.24
2.788TyrGly: 2.788 ± 0.364
0.65TyrHis: 0.65 ± 0.153
1.44TyrIle: 1.44 ± 0.293
1.069TyrLys: 1.069 ± 0.189
2.23TyrLeu: 2.23 ± 0.339
0.65TyrMet: 0.65 ± 0.164
1.533TyrAsn: 1.533 ± 0.251
2.091TyrPro: 2.091 ± 0.559
1.301TyrGln: 1.301 ± 0.232
2.369TyrArg: 2.369 ± 0.376
2.184TyrSer: 2.184 ± 0.38
1.765TyrThr: 1.765 ± 0.292
3.066TyrVal: 3.066 ± 0.329
0.465TyrTrp: 0.465 ± 0.158
0.697TyrTyr: 0.697 ± 0.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (21525 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski