Amino acid dipepetide frequency for Gordonia phage Nymphadora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.696AlaAla: 18.696 ± 1.998
1.204AlaCys: 1.204 ± 0.339
8.66AlaAsp: 8.66 ± 0.688
7.685AlaGlu: 7.685 ± 0.736
2.581AlaPhe: 2.581 ± 0.422
9.233AlaGly: 9.233 ± 0.81
1.606AlaHis: 1.606 ± 0.33
4.76AlaIle: 4.76 ± 0.56
4.129AlaLys: 4.129 ± 0.552
8.316AlaLeu: 8.316 ± 0.721
3.498AlaMet: 3.498 ± 0.515
3.384AlaAsn: 3.384 ± 0.437
5.104AlaPro: 5.104 ± 0.447
4.473AlaGln: 4.473 ± 0.673
8.029AlaArg: 8.029 ± 0.572
6.71AlaSer: 6.71 ± 0.819
8.258AlaThr: 8.258 ± 1.001
9.119AlaVal: 9.119 ± 0.724
2.007AlaTrp: 2.007 ± 0.329
2.638AlaTyr: 2.638 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.221
0.115CysCys: 0.115 ± 0.093
0.803CysAsp: 0.803 ± 0.285
0.516CysGlu: 0.516 ± 0.16
0.229CysPhe: 0.229 ± 0.135
1.147CysGly: 1.147 ± 0.391
0.115CysHis: 0.115 ± 0.093
0.115CysIle: 0.115 ± 0.072
0.057CysLys: 0.057 ± 0.058
0.229CysLeu: 0.229 ± 0.124
0.172CysMet: 0.172 ± 0.09
0.344CysAsn: 0.344 ± 0.155
0.401CysPro: 0.401 ± 0.169
0.287CysGln: 0.287 ± 0.172
0.803CysArg: 0.803 ± 0.255
0.516CysSer: 0.516 ± 0.176
0.516CysThr: 0.516 ± 0.181
0.344CysVal: 0.344 ± 0.15
0.287CysTrp: 0.287 ± 0.119
0.229CysTyr: 0.229 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
7.054AspAla: 7.054 ± 0.7
0.516AspCys: 0.516 ± 0.192
6.251AspAsp: 6.251 ± 0.907
4.359AspGlu: 4.359 ± 0.449
1.95AspPhe: 1.95 ± 0.361
6.366AspGly: 6.366 ± 0.851
2.065AspHis: 2.065 ± 0.387
2.523AspIle: 2.523 ± 0.364
1.72AspLys: 1.72 ± 0.306
6.825AspLeu: 6.825 ± 0.663
1.09AspMet: 1.09 ± 0.24
2.409AspAsn: 2.409 ± 0.401
5.333AspPro: 5.333 ± 0.58
2.581AspGln: 2.581 ± 0.381
5.391AspArg: 5.391 ± 0.697
2.695AspSer: 2.695 ± 0.395
3.384AspThr: 3.384 ± 0.554
4.531AspVal: 4.531 ± 0.514
1.434AspTrp: 1.434 ± 0.289
1.376AspTyr: 1.376 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
7.054GluAla: 7.054 ± 0.761
0.573GluCys: 0.573 ± 0.206
2.925GluAsp: 2.925 ± 0.373
3.04GluGlu: 3.04 ± 0.449
2.007GluPhe: 2.007 ± 0.298
3.9GluGly: 3.9 ± 0.494
1.147GluHis: 1.147 ± 0.257
2.982GluIle: 2.982 ± 0.434
2.179GluLys: 2.179 ± 0.332
4.703GluLeu: 4.703 ± 0.498
0.86GluMet: 0.86 ± 0.206
1.434GluAsn: 1.434 ± 0.311
2.867GluPro: 2.867 ± 0.532
2.753GluGln: 2.753 ± 0.365
4.875GluArg: 4.875 ± 0.637
2.925GluSer: 2.925 ± 0.406
3.785GluThr: 3.785 ± 0.513
4.645GluVal: 4.645 ± 0.486
1.09GluTrp: 1.09 ± 0.248
0.918GluTyr: 0.918 ± 0.203
0.0GluXaa: 0.0 ± 0.0
Phe
2.982PheAla: 2.982 ± 0.369
0.172PheCys: 0.172 ± 0.105
2.409PheAsp: 2.409 ± 0.428
1.606PheGlu: 1.606 ± 0.302
0.746PhePhe: 0.746 ± 0.174
2.867PheGly: 2.867 ± 0.426
0.516PheHis: 0.516 ± 0.153
0.86PheIle: 0.86 ± 0.222
1.09PheLys: 1.09 ± 0.272
1.032PheLeu: 1.032 ± 0.224
0.401PheMet: 0.401 ± 0.115
1.032PheAsn: 1.032 ± 0.225
1.491PhePro: 1.491 ± 0.327
0.688PheGln: 0.688 ± 0.204
2.179PheArg: 2.179 ± 0.363
1.434PheSer: 1.434 ± 0.266
2.122PheThr: 2.122 ± 0.341
1.434PheVal: 1.434 ± 0.279
1.032PheTrp: 1.032 ± 0.224
0.459PheTyr: 0.459 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
8.029GlyAla: 8.029 ± 0.751
0.803GlyCys: 0.803 ± 0.359
4.817GlyAsp: 4.817 ± 0.423
4.588GlyGlu: 4.588 ± 0.515
2.122GlyPhe: 2.122 ± 0.355
7.857GlyGly: 7.857 ± 1.113
1.262GlyHis: 1.262 ± 0.225
3.785GlyIle: 3.785 ± 0.345
3.441GlyLys: 3.441 ± 0.495
6.308GlyLeu: 6.308 ± 0.668
2.179GlyMet: 2.179 ± 0.388
3.212GlyAsn: 3.212 ± 0.447
3.785GlyPro: 3.785 ± 0.492
3.9GlyGln: 3.9 ± 0.514
6.423GlyArg: 6.423 ± 0.672
4.76GlySer: 4.76 ± 0.59
6.997GlyThr: 6.997 ± 1.106
5.964GlyVal: 5.964 ± 0.656
1.778GlyTrp: 1.778 ± 0.393
1.893GlyTyr: 1.893 ± 0.272
0.0GlyXaa: 0.0 ± 0.0
His
2.007HisAla: 2.007 ± 0.363
0.115HisCys: 0.115 ± 0.079
1.434HisAsp: 1.434 ± 0.324
1.319HisGlu: 1.319 ± 0.343
0.516HisPhe: 0.516 ± 0.154
1.376HisGly: 1.376 ± 0.232
0.746HisHis: 0.746 ± 0.236
0.918HisIle: 0.918 ± 0.237
0.573HisLys: 0.573 ± 0.201
1.72HisLeu: 1.72 ± 0.385
0.115HisMet: 0.115 ± 0.091
0.287HisAsn: 0.287 ± 0.136
1.491HisPro: 1.491 ± 0.352
0.86HisGln: 0.86 ± 0.211
1.548HisArg: 1.548 ± 0.308
0.631HisSer: 0.631 ± 0.169
1.262HisThr: 1.262 ± 0.241
1.319HisVal: 1.319 ± 0.329
0.459HisTrp: 0.459 ± 0.166
1.032HisTyr: 1.032 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
5.506IleAla: 5.506 ± 0.616
0.401IleCys: 0.401 ± 0.168
4.473IleAsp: 4.473 ± 0.457
3.842IleGlu: 3.842 ± 0.527
0.631IlePhe: 0.631 ± 0.176
4.244IleGly: 4.244 ± 0.505
1.147IleHis: 1.147 ± 0.239
1.548IleIle: 1.548 ± 0.274
0.975IleLys: 0.975 ± 0.247
1.893IleLeu: 1.893 ± 0.39
0.344IleMet: 0.344 ± 0.133
1.262IleAsn: 1.262 ± 0.305
2.695IlePro: 2.695 ± 0.357
2.122IleGln: 2.122 ± 0.427
3.842IleArg: 3.842 ± 0.415
2.351IleSer: 2.351 ± 0.321
2.638IleThr: 2.638 ± 0.443
3.498IleVal: 3.498 ± 0.407
0.401IleTrp: 0.401 ± 0.163
0.975IleTyr: 0.975 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
3.728LysAla: 3.728 ± 0.669
0.115LysCys: 0.115 ± 0.087
1.434LysAsp: 1.434 ± 0.287
1.548LysGlu: 1.548 ± 0.338
0.918LysPhe: 0.918 ± 0.199
2.638LysGly: 2.638 ± 0.478
0.746LysHis: 0.746 ± 0.199
1.835LysIle: 1.835 ± 0.349
1.147LysLys: 1.147 ± 0.306
2.81LysLeu: 2.81 ± 0.393
0.287LysMet: 0.287 ± 0.155
0.86LysAsn: 0.86 ± 0.252
2.466LysPro: 2.466 ± 0.378
1.204LysGln: 1.204 ± 0.279
2.466LysArg: 2.466 ± 0.402
1.72LysSer: 1.72 ± 0.295
2.351LysThr: 2.351 ± 0.451
2.294LysVal: 2.294 ± 0.308
0.975LysTrp: 0.975 ± 0.265
0.86LysTyr: 0.86 ± 0.216
0.0LysXaa: 0.0 ± 0.0
Leu
10.782LeuAla: 10.782 ± 0.859
0.746LeuCys: 0.746 ± 0.227
6.48LeuAsp: 6.48 ± 0.671
4.129LeuGlu: 4.129 ± 0.579
2.523LeuPhe: 2.523 ± 0.462
5.563LeuGly: 5.563 ± 0.419
1.376LeuHis: 1.376 ± 0.371
3.842LeuIle: 3.842 ± 0.551
2.237LeuLys: 2.237 ± 0.376
4.588LeuLeu: 4.588 ± 0.55
1.319LeuMet: 1.319 ± 0.305
2.179LeuAsn: 2.179 ± 0.415
4.359LeuPro: 4.359 ± 0.495
2.351LeuGln: 2.351 ± 0.297
6.308LeuArg: 6.308 ± 0.672
4.129LeuSer: 4.129 ± 0.499
5.219LeuThr: 5.219 ± 0.55
5.448LeuVal: 5.448 ± 0.756
1.434LeuTrp: 1.434 ± 0.283
1.147LeuTyr: 1.147 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
2.294MetAla: 2.294 ± 0.451
0.229MetCys: 0.229 ± 0.106
0.344MetAsp: 0.344 ± 0.159
0.573MetGlu: 0.573 ± 0.183
0.287MetPhe: 0.287 ± 0.112
1.72MetGly: 1.72 ± 0.371
0.516MetHis: 0.516 ± 0.155
0.86MetIle: 0.86 ± 0.227
0.86MetLys: 0.86 ± 0.218
1.262MetLeu: 1.262 ± 0.307
0.344MetMet: 0.344 ± 0.164
0.746MetAsn: 0.746 ± 0.207
1.204MetPro: 1.204 ± 0.235
0.746MetGln: 0.746 ± 0.191
2.065MetArg: 2.065 ± 0.306
1.548MetSer: 1.548 ± 0.305
2.982MetThr: 2.982 ± 0.391
1.262MetVal: 1.262 ± 0.268
0.516MetTrp: 0.516 ± 0.148
0.287MetTyr: 0.287 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
3.842AsnAla: 3.842 ± 0.403
0.057AsnCys: 0.057 ± 0.046
2.237AsnAsp: 2.237 ± 0.339
0.746AsnGlu: 0.746 ± 0.216
0.344AsnPhe: 0.344 ± 0.127
2.695AsnGly: 2.695 ± 0.388
1.032AsnHis: 1.032 ± 0.277
0.803AsnIle: 0.803 ± 0.193
0.688AsnLys: 0.688 ± 0.208
2.237AsnLeu: 2.237 ± 0.302
0.401AsnMet: 0.401 ± 0.124
0.86AsnAsn: 0.86 ± 0.221
2.409AsnPro: 2.409 ± 0.327
0.803AsnGln: 0.803 ± 0.203
2.581AsnArg: 2.581 ± 0.348
1.893AsnSer: 1.893 ± 0.251
2.466AsnThr: 2.466 ± 0.515
2.007AsnVal: 2.007 ± 0.337
0.516AsnTrp: 0.516 ± 0.151
0.918AsnTyr: 0.918 ± 0.249
0.0AsnXaa: 0.0 ± 0.0
Pro
6.767ProAla: 6.767 ± 0.593
0.516ProCys: 0.516 ± 0.201
4.875ProAsp: 4.875 ± 0.686
3.728ProGlu: 3.728 ± 0.463
1.548ProPhe: 1.548 ± 0.279
6.538ProGly: 6.538 ± 0.612
0.918ProHis: 0.918 ± 0.262
2.581ProIle: 2.581 ± 0.351
2.007ProLys: 2.007 ± 0.384
3.728ProLeu: 3.728 ± 0.454
1.548ProMet: 1.548 ± 0.31
1.893ProAsn: 1.893 ± 0.309
2.695ProPro: 2.695 ± 0.52
1.376ProGln: 1.376 ± 0.268
3.67ProArg: 3.67 ± 0.521
2.81ProSer: 2.81 ± 0.435
3.842ProThr: 3.842 ± 0.474
3.785ProVal: 3.785 ± 0.446
1.204ProTrp: 1.204 ± 0.219
0.918ProTyr: 0.918 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
4.301GlnAla: 4.301 ± 0.559
0.344GlnCys: 0.344 ± 0.134
1.434GlnAsp: 1.434 ± 0.243
1.548GlnGlu: 1.548 ± 0.29
0.918GlnPhe: 0.918 ± 0.174
2.466GlnGly: 2.466 ± 0.587
0.975GlnHis: 0.975 ± 0.256
2.351GlnIle: 2.351 ± 0.363
1.376GlnLys: 1.376 ± 0.297
4.588GlnLeu: 4.588 ± 0.607
1.434GlnMet: 1.434 ± 0.246
0.918GlnAsn: 0.918 ± 0.223
2.122GlnPro: 2.122 ± 0.286
1.548GlnGln: 1.548 ± 0.32
3.556GlnArg: 3.556 ± 0.464
1.319GlnSer: 1.319 ± 0.324
1.72GlnThr: 1.72 ± 0.351
2.695GlnVal: 2.695 ± 0.434
0.86GlnTrp: 0.86 ± 0.2
0.401GlnTyr: 0.401 ± 0.109
0.0GlnXaa: 0.0 ± 0.0
Arg
8.201ArgAla: 8.201 ± 0.766
0.688ArgCys: 0.688 ± 0.245
4.875ArgAsp: 4.875 ± 0.516
4.989ArgGlu: 4.989 ± 0.547
2.122ArgPhe: 2.122 ± 0.341
4.703ArgGly: 4.703 ± 0.469
1.262ArgHis: 1.262 ± 0.272
3.785ArgIle: 3.785 ± 0.495
3.498ArgLys: 3.498 ± 0.438
7.627ArgLeu: 7.627 ± 0.702
1.893ArgMet: 1.893 ± 0.332
1.95ArgAsn: 1.95 ± 0.35
4.531ArgPro: 4.531 ± 0.673
2.753ArgGln: 2.753 ± 0.521
6.71ArgArg: 6.71 ± 0.974
4.359ArgSer: 4.359 ± 0.538
5.104ArgThr: 5.104 ± 0.548
5.792ArgVal: 5.792 ± 0.622
1.491ArgTrp: 1.491 ± 0.325
2.466ArgTyr: 2.466 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
5.62SerAla: 5.62 ± 0.672
0.229SerCys: 0.229 ± 0.121
3.785SerAsp: 3.785 ± 0.491
3.212SerGlu: 3.212 ± 0.341
1.376SerPhe: 1.376 ± 0.282
5.563SerGly: 5.563 ± 0.636
0.688SerHis: 0.688 ± 0.189
2.753SerIle: 2.753 ± 0.485
1.893SerLys: 1.893 ± 0.278
4.072SerLeu: 4.072 ± 0.409
1.204SerMet: 1.204 ± 0.318
2.122SerAsn: 2.122 ± 0.318
2.409SerPro: 2.409 ± 0.508
2.237SerGln: 2.237 ± 0.465
3.957SerArg: 3.957 ± 0.499
2.351SerSer: 2.351 ± 0.315
3.728SerThr: 3.728 ± 0.594
4.244SerVal: 4.244 ± 0.526
1.376SerTrp: 1.376 ± 0.239
1.262SerTyr: 1.262 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
9.864ThrAla: 9.864 ± 1.31
0.401ThrCys: 0.401 ± 0.143
4.645ThrAsp: 4.645 ± 0.431
2.753ThrGlu: 2.753 ± 0.428
1.72ThrPhe: 1.72 ± 0.302
6.538ThrGly: 6.538 ± 0.869
1.147ThrHis: 1.147 ± 0.228
3.842ThrIle: 3.842 ± 0.514
1.606ThrLys: 1.606 ± 0.296
5.104ThrLeu: 5.104 ± 0.495
1.548ThrMet: 1.548 ± 0.273
1.376ThrAsn: 1.376 ± 0.271
4.932ThrPro: 4.932 ± 0.591
1.893ThrGln: 1.893 ± 0.321
4.875ThrArg: 4.875 ± 0.432
5.047ThrSer: 5.047 ± 0.791
5.735ThrThr: 5.735 ± 0.639
5.907ThrVal: 5.907 ± 0.518
1.09ThrTrp: 1.09 ± 0.222
1.376ThrTyr: 1.376 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
8.316ValAla: 8.316 ± 0.648
0.631ValCys: 0.631 ± 0.225
5.735ValAsp: 5.735 ± 0.487
4.129ValGlu: 4.129 ± 0.466
2.638ValPhe: 2.638 ± 0.427
5.104ValGly: 5.104 ± 0.621
1.319ValHis: 1.319 ± 0.265
3.67ValIle: 3.67 ± 0.37
1.835ValLys: 1.835 ± 0.388
5.907ValLeu: 5.907 ± 0.7
1.032ValMet: 1.032 ± 0.221
2.179ValAsn: 2.179 ± 0.361
4.244ValPro: 4.244 ± 0.475
2.638ValGln: 2.638 ± 0.525
5.219ValArg: 5.219 ± 0.564
4.473ValSer: 4.473 ± 0.573
6.366ValThr: 6.366 ± 0.652
5.448ValVal: 5.448 ± 0.645
0.975ValTrp: 0.975 ± 0.282
1.032ValTyr: 1.032 ± 0.187
0.0ValXaa: 0.0 ± 0.0
Trp
1.893TrpAla: 1.893 ± 0.314
0.115TrpCys: 0.115 ± 0.086
0.975TrpAsp: 0.975 ± 0.249
1.262TrpGlu: 1.262 ± 0.328
0.86TrpPhe: 0.86 ± 0.242
1.032TrpGly: 1.032 ± 0.233
0.746TrpHis: 0.746 ± 0.217
0.516TrpIle: 0.516 ± 0.147
0.459TrpLys: 0.459 ± 0.152
1.72TrpLeu: 1.72 ± 0.289
0.459TrpMet: 0.459 ± 0.164
0.631TrpAsn: 0.631 ± 0.165
1.09TrpPro: 1.09 ± 0.206
0.918TrpGln: 0.918 ± 0.25
2.523TrpArg: 2.523 ± 0.44
0.86TrpSer: 0.86 ± 0.229
1.376TrpThr: 1.376 ± 0.245
1.434TrpVal: 1.434 ± 0.316
0.229TrpTrp: 0.229 ± 0.113
0.401TrpTyr: 0.401 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.237TyrAla: 2.237 ± 0.363
0.172TyrCys: 0.172 ± 0.099
1.262TyrAsp: 1.262 ± 0.282
1.09TyrGlu: 1.09 ± 0.31
0.516TyrPhe: 0.516 ± 0.156
2.351TyrGly: 2.351 ± 0.317
0.401TyrHis: 0.401 ± 0.132
0.459TyrIle: 0.459 ± 0.165
0.631TyrLys: 0.631 ± 0.218
1.491TyrLeu: 1.491 ± 0.333
0.459TyrMet: 0.459 ± 0.155
0.459TyrAsn: 0.459 ± 0.146
1.319TyrPro: 1.319 ± 0.288
0.746TyrGln: 0.746 ± 0.194
1.835TyrArg: 1.835 ± 0.353
1.548TyrSer: 1.548 ± 0.258
1.548TyrThr: 1.548 ± 0.269
1.72TyrVal: 1.72 ± 0.323
0.344TyrTrp: 0.344 ± 0.126
0.459TyrTyr: 0.459 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski