Amino acid dipepetide frequency for Gordonia phage Luker

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.663AlaAla: 13.663 ± 4.601
0.571AlaCys: 0.571 ± 0.182
4.964AlaAsp: 4.964 ± 0.442
6.985AlaGlu: 6.985 ± 0.737
2.856AlaPhe: 2.856 ± 0.373
7.161AlaGly: 7.161 ± 0.825
1.142AlaHis: 1.142 ± 0.271
5.184AlaIle: 5.184 ± 0.824
4.218AlaLys: 4.218 ± 0.521
8.347AlaLeu: 8.347 ± 0.941
3.119AlaMet: 3.119 ± 0.454
3.075AlaAsn: 3.075 ± 0.515
4.613AlaPro: 4.613 ± 0.524
3.866AlaGln: 3.866 ± 0.703
5.579AlaArg: 5.579 ± 0.57
5.799AlaSer: 5.799 ± 0.811
6.019AlaThr: 6.019 ± 0.566
6.458AlaVal: 6.458 ± 0.84
1.977AlaTrp: 1.977 ± 0.603
2.636AlaTyr: 2.636 ± 0.292
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.183
0.132CysCys: 0.132 ± 0.087
0.571CysAsp: 0.571 ± 0.192
0.351CysGlu: 0.351 ± 0.111
0.308CysPhe: 0.308 ± 0.127
1.01CysGly: 1.01 ± 0.296
0.264CysHis: 0.264 ± 0.13
0.615CysIle: 0.615 ± 0.178
0.439CysLys: 0.439 ± 0.172
0.308CysLeu: 0.308 ± 0.142
0.176CysMet: 0.176 ± 0.086
0.351CysAsn: 0.351 ± 0.124
0.615CysPro: 0.615 ± 0.21
0.264CysGln: 0.264 ± 0.103
0.703CysArg: 0.703 ± 0.227
0.747CysSer: 0.747 ± 0.221
0.659CysThr: 0.659 ± 0.214
0.615CysVal: 0.615 ± 0.212
0.088CysTrp: 0.088 ± 0.072
0.351CysTyr: 0.351 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
5.052AspAla: 5.052 ± 0.502
0.571AspCys: 0.571 ± 0.224
4.042AspAsp: 4.042 ± 0.761
5.096AspGlu: 5.096 ± 0.772
2.328AspPhe: 2.328 ± 0.325
4.657AspGly: 4.657 ± 0.611
0.967AspHis: 0.967 ± 0.195
3.383AspIle: 3.383 ± 0.414
2.768AspLys: 2.768 ± 0.383
4.305AspLeu: 4.305 ± 0.473
1.318AspMet: 1.318 ± 0.255
2.856AspAsn: 2.856 ± 0.348
3.998AspPro: 3.998 ± 0.635
2.065AspGln: 2.065 ± 0.242
2.46AspArg: 2.46 ± 0.47
3.427AspSer: 3.427 ± 0.441
3.559AspThr: 3.559 ± 0.397
4.218AspVal: 4.218 ± 0.634
1.274AspTrp: 1.274 ± 0.255
2.241AspTyr: 2.241 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
6.766GluAla: 6.766 ± 0.525
0.923GluCys: 0.923 ± 0.264
4.174GluAsp: 4.174 ± 0.617
5.887GluGlu: 5.887 ± 0.708
2.372GluPhe: 2.372 ± 0.467
4.657GluGly: 4.657 ± 0.524
0.923GluHis: 0.923 ± 0.214
3.778GluIle: 3.778 ± 0.4
2.856GluLys: 2.856 ± 0.37
6.107GluLeu: 6.107 ± 0.817
1.538GluMet: 1.538 ± 0.224
1.801GluAsn: 1.801 ± 0.234
2.504GluPro: 2.504 ± 0.527
3.031GluGln: 3.031 ± 0.413
5.008GluArg: 5.008 ± 0.82
4.13GluSer: 4.13 ± 0.347
3.646GluThr: 3.646 ± 0.503
5.184GluVal: 5.184 ± 0.403
1.186GluTrp: 1.186 ± 0.277
2.416GluTyr: 2.416 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 0.337
0.351PheCys: 0.351 ± 0.137
2.812PheAsp: 2.812 ± 0.37
2.768PheGlu: 2.768 ± 0.458
1.098PhePhe: 1.098 ± 0.212
3.427PheGly: 3.427 ± 0.446
0.703PheHis: 0.703 ± 0.221
1.757PheIle: 1.757 ± 0.352
2.109PheLys: 2.109 ± 0.296
1.933PheLeu: 1.933 ± 0.312
1.01PheMet: 1.01 ± 0.201
1.669PheAsn: 1.669 ± 0.218
1.626PhePro: 1.626 ± 0.392
1.098PheGln: 1.098 ± 0.177
1.626PheArg: 1.626 ± 0.312
2.416PheSer: 2.416 ± 0.377
2.109PheThr: 2.109 ± 0.301
1.933PheVal: 1.933 ± 0.333
0.351PheTrp: 0.351 ± 0.117
0.967PheTyr: 0.967 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
6.238GlyAla: 6.238 ± 0.947
0.835GlyCys: 0.835 ± 0.262
4.833GlyAsp: 4.833 ± 0.508
4.569GlyGlu: 4.569 ± 0.454
2.944GlyPhe: 2.944 ± 0.363
8.128GlyGly: 8.128 ± 1.139
1.186GlyHis: 1.186 ± 0.229
3.954GlyIle: 3.954 ± 0.663
4.92GlyLys: 4.92 ± 0.524
5.316GlyLeu: 5.316 ± 0.679
1.845GlyMet: 1.845 ± 0.294
3.602GlyAsn: 3.602 ± 0.438
3.866GlyPro: 3.866 ± 0.412
2.241GlyGln: 2.241 ± 0.38
4.218GlyArg: 4.218 ± 0.579
5.536GlySer: 5.536 ± 0.577
5.755GlyThr: 5.755 ± 0.584
5.316GlyVal: 5.316 ± 0.448
1.626GlyTrp: 1.626 ± 0.323
2.592GlyTyr: 2.592 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
1.274HisAla: 1.274 ± 0.247
0.308HisCys: 0.308 ± 0.122
0.747HisAsp: 0.747 ± 0.226
0.923HisGlu: 0.923 ± 0.239
0.395HisPhe: 0.395 ± 0.121
1.054HisGly: 1.054 ± 0.239
0.308HisHis: 0.308 ± 0.138
1.098HisIle: 1.098 ± 0.233
0.967HisLys: 0.967 ± 0.251
1.538HisLeu: 1.538 ± 0.254
0.439HisMet: 0.439 ± 0.135
0.659HisAsn: 0.659 ± 0.146
1.142HisPro: 1.142 ± 0.248
0.791HisGln: 0.791 ± 0.234
1.054HisArg: 1.054 ± 0.232
1.054HisSer: 1.054 ± 0.228
0.747HisThr: 0.747 ± 0.181
1.01HisVal: 1.01 ± 0.226
0.264HisTrp: 0.264 ± 0.135
0.571HisTyr: 0.571 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
5.316IleAla: 5.316 ± 0.842
0.308IleCys: 0.308 ± 0.104
4.042IleAsp: 4.042 ± 0.454
3.954IleGlu: 3.954 ± 0.481
1.845IlePhe: 1.845 ± 0.305
4.789IleGly: 4.789 ± 0.659
0.703IleHis: 0.703 ± 0.205
3.119IleIle: 3.119 ± 0.385
3.075IleLys: 3.075 ± 0.446
2.9IleLeu: 2.9 ± 0.391
0.923IleMet: 0.923 ± 0.173
2.197IleAsn: 2.197 ± 0.258
2.768IlePro: 2.768 ± 0.416
2.68IleGln: 2.68 ± 0.371
3.031IleArg: 3.031 ± 0.377
3.163IleSer: 3.163 ± 0.313
2.636IleThr: 2.636 ± 0.279
3.559IleVal: 3.559 ± 0.432
0.879IleTrp: 0.879 ± 0.189
0.967IleTyr: 0.967 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
5.008LysAla: 5.008 ± 0.787
0.351LysCys: 0.351 ± 0.12
3.207LysAsp: 3.207 ± 0.601
2.68LysGlu: 2.68 ± 0.381
1.977LysPhe: 1.977 ± 0.316
3.339LysGly: 3.339 ± 0.538
0.791LysHis: 0.791 ± 0.224
2.592LysIle: 2.592 ± 0.339
3.207LysLys: 3.207 ± 0.405
4.701LysLeu: 4.701 ± 0.495
1.318LysMet: 1.318 ± 0.368
2.372LysAsn: 2.372 ± 0.353
2.416LysPro: 2.416 ± 0.464
1.933LysGln: 1.933 ± 0.337
3.339LysArg: 3.339 ± 0.52
2.68LysSer: 2.68 ± 0.576
2.856LysThr: 2.856 ± 0.288
3.866LysVal: 3.866 ± 0.414
0.615LysTrp: 0.615 ± 0.19
1.538LysTyr: 1.538 ± 0.218
0.0LysXaa: 0.0 ± 0.0
Leu
6.854LeuAla: 6.854 ± 0.73
0.747LeuCys: 0.747 ± 0.234
4.437LeuAsp: 4.437 ± 0.429
6.326LeuGlu: 6.326 ± 0.697
2.724LeuPhe: 2.724 ± 0.337
5.14LeuGly: 5.14 ± 0.473
1.318LeuHis: 1.318 ± 0.241
3.602LeuIle: 3.602 ± 0.392
3.69LeuLys: 3.69 ± 0.469
5.667LeuLeu: 5.667 ± 0.477
2.065LeuMet: 2.065 ± 0.295
3.031LeuAsn: 3.031 ± 0.433
3.866LeuPro: 3.866 ± 0.388
3.119LeuGln: 3.119 ± 0.575
4.305LeuArg: 4.305 ± 0.491
4.305LeuSer: 4.305 ± 0.476
4.701LeuThr: 4.701 ± 0.469
4.657LeuVal: 4.657 ± 0.493
1.318LeuTrp: 1.318 ± 0.228
1.626LeuTyr: 1.626 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
2.46MetAla: 2.46 ± 0.358
0.22MetCys: 0.22 ± 0.109
1.142MetAsp: 1.142 ± 0.209
1.318MetGlu: 1.318 ± 0.26
0.659MetPhe: 0.659 ± 0.207
1.626MetGly: 1.626 ± 0.285
0.395MetHis: 0.395 ± 0.166
0.967MetIle: 0.967 ± 0.191
1.626MetLys: 1.626 ± 0.25
1.889MetLeu: 1.889 ± 0.287
0.703MetMet: 0.703 ± 0.217
1.274MetAsn: 1.274 ± 0.227
1.45MetPro: 1.45 ± 0.316
0.967MetGln: 0.967 ± 0.231
1.362MetArg: 1.362 ± 0.239
1.933MetSer: 1.933 ± 0.288
2.065MetThr: 2.065 ± 0.299
1.713MetVal: 1.713 ± 0.409
0.176MetTrp: 0.176 ± 0.071
1.054MetTyr: 1.054 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
4.261AsnAla: 4.261 ± 0.749
0.308AsnCys: 0.308 ± 0.148
2.416AsnAsp: 2.416 ± 0.33
1.757AsnGlu: 1.757 ± 0.244
1.23AsnPhe: 1.23 ± 0.253
4.174AsnGly: 4.174 ± 0.467
0.527AsnHis: 0.527 ± 0.166
2.46AsnIle: 2.46 ± 0.285
2.021AsnLys: 2.021 ± 0.35
3.031AsnLeu: 3.031 ± 0.353
1.098AsnMet: 1.098 ± 0.213
1.801AsnAsn: 1.801 ± 0.275
2.636AsnPro: 2.636 ± 0.3
1.757AsnGln: 1.757 ± 0.255
2.021AsnArg: 2.021 ± 0.39
2.372AsnSer: 2.372 ± 0.353
2.109AsnThr: 2.109 ± 0.276
2.724AsnVal: 2.724 ± 0.34
0.571AsnTrp: 0.571 ± 0.144
1.318AsnTyr: 1.318 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
4.92ProAla: 4.92 ± 0.523
0.264ProCys: 0.264 ± 0.119
3.646ProAsp: 3.646 ± 0.492
4.13ProGlu: 4.13 ± 0.767
1.845ProPhe: 1.845 ± 0.301
4.877ProGly: 4.877 ± 0.555
0.879ProHis: 0.879 ± 0.234
2.372ProIle: 2.372 ± 0.397
2.724ProLys: 2.724 ± 0.449
2.856ProLeu: 2.856 ± 0.411
1.318ProMet: 1.318 ± 0.194
2.285ProAsn: 2.285 ± 0.338
2.592ProPro: 2.592 ± 0.49
1.889ProGln: 1.889 ± 0.407
1.494ProArg: 1.494 ± 0.332
3.427ProSer: 3.427 ± 0.581
3.427ProThr: 3.427 ± 0.579
4.481ProVal: 4.481 ± 0.395
1.098ProTrp: 1.098 ± 0.252
1.757ProTyr: 1.757 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
4.042GlnAla: 4.042 ± 0.717
0.176GlnCys: 0.176 ± 0.081
1.582GlnAsp: 1.582 ± 0.359
2.636GlnGlu: 2.636 ± 0.417
1.494GlnPhe: 1.494 ± 0.272
2.241GlnGly: 2.241 ± 0.311
0.615GlnHis: 0.615 ± 0.17
2.197GlnIle: 2.197 ± 0.475
1.626GlnLys: 1.626 ± 0.299
2.987GlnLeu: 2.987 ± 0.398
0.791GlnMet: 0.791 ± 0.175
1.362GlnAsn: 1.362 ± 0.348
1.801GlnPro: 1.801 ± 0.345
1.626GlnGln: 1.626 ± 0.282
2.592GlnArg: 2.592 ± 0.483
2.021GlnSer: 2.021 ± 0.314
1.977GlnThr: 1.977 ± 0.33
3.383GlnVal: 3.383 ± 0.298
0.703GlnTrp: 0.703 ± 0.214
1.186GlnTyr: 1.186 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
4.745ArgAla: 4.745 ± 0.52
0.791ArgCys: 0.791 ± 0.227
2.812ArgAsp: 2.812 ± 0.467
3.954ArgGlu: 3.954 ± 0.482
2.021ArgPhe: 2.021 ± 0.308
3.778ArgGly: 3.778 ± 0.371
1.362ArgHis: 1.362 ± 0.294
3.295ArgIle: 3.295 ± 0.458
2.944ArgLys: 2.944 ± 0.433
4.833ArgLeu: 4.833 ± 0.571
1.713ArgMet: 1.713 ± 0.252
2.46ArgAsn: 2.46 ± 0.371
3.295ArgPro: 3.295 ± 0.579
2.328ArgGln: 2.328 ± 0.342
4.481ArgArg: 4.481 ± 0.727
2.548ArgSer: 2.548 ± 0.332
2.812ArgThr: 2.812 ± 0.332
4.349ArgVal: 4.349 ± 0.624
0.659ArgTrp: 0.659 ± 0.21
2.153ArgTyr: 2.153 ± 0.314
0.0ArgXaa: 0.0 ± 0.0
Ser
5.843SerAla: 5.843 ± 0.514
0.308SerCys: 0.308 ± 0.124
4.13SerAsp: 4.13 ± 0.538
3.339SerGlu: 3.339 ± 0.387
2.46SerPhe: 2.46 ± 0.455
4.833SerGly: 4.833 ± 0.499
1.098SerHis: 1.098 ± 0.264
4.218SerIle: 4.218 ± 0.512
3.163SerLys: 3.163 ± 0.327
3.954SerLeu: 3.954 ± 0.481
1.713SerMet: 1.713 ± 0.212
2.46SerAsn: 2.46 ± 0.381
3.075SerPro: 3.075 ± 0.364
1.845SerGln: 1.845 ± 0.306
3.602SerArg: 3.602 ± 0.334
5.14SerSer: 5.14 ± 0.595
3.822SerThr: 3.822 ± 0.362
4.525SerVal: 4.525 ± 0.474
0.923SerTrp: 0.923 ± 0.197
1.845SerTyr: 1.845 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
6.414ThrAla: 6.414 ± 0.923
0.571ThrCys: 0.571 ± 0.219
3.646ThrAsp: 3.646 ± 0.504
3.471ThrGlu: 3.471 ± 0.466
1.933ThrPhe: 1.933 ± 0.294
5.623ThrGly: 5.623 ± 0.482
0.879ThrHis: 0.879 ± 0.247
2.724ThrIle: 2.724 ± 0.342
2.46ThrLys: 2.46 ± 0.287
4.437ThrLeu: 4.437 ± 0.413
1.054ThrMet: 1.054 ± 0.275
2.372ThrAsn: 2.372 ± 0.319
3.998ThrPro: 3.998 ± 0.638
1.45ThrGln: 1.45 ± 0.273
2.812ThrArg: 2.812 ± 0.405
4.174ThrSer: 4.174 ± 0.578
4.964ThrThr: 4.964 ± 0.684
5.404ThrVal: 5.404 ± 0.555
1.23ThrTrp: 1.23 ± 0.207
1.801ThrTyr: 1.801 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
6.897ValAla: 6.897 ± 0.868
0.703ValCys: 0.703 ± 0.236
4.349ValAsp: 4.349 ± 0.418
5.579ValGlu: 5.579 ± 0.421
2.9ValPhe: 2.9 ± 0.356
5.052ValGly: 5.052 ± 0.648
1.538ValHis: 1.538 ± 0.282
3.515ValIle: 3.515 ± 0.474
3.559ValLys: 3.559 ± 0.398
4.964ValLeu: 4.964 ± 0.464
1.669ValMet: 1.669 ± 0.26
2.768ValAsn: 2.768 ± 0.299
3.866ValPro: 3.866 ± 0.467
2.285ValGln: 2.285 ± 0.353
4.218ValArg: 4.218 ± 0.338
5.008ValSer: 5.008 ± 0.635
4.525ValThr: 4.525 ± 0.463
4.964ValVal: 4.964 ± 0.524
1.406ValTrp: 1.406 ± 0.246
1.713ValTyr: 1.713 ± 0.316
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 0.536
0.395TrpCys: 0.395 ± 0.152
1.186TrpAsp: 1.186 ± 0.254
1.362TrpGlu: 1.362 ± 0.235
0.703TrpPhe: 0.703 ± 0.165
1.318TrpGly: 1.318 ± 0.238
0.351TrpHis: 0.351 ± 0.152
0.703TrpIle: 0.703 ± 0.204
1.01TrpLys: 1.01 ± 0.196
1.054TrpLeu: 1.054 ± 0.21
0.264TrpMet: 0.264 ± 0.107
0.703TrpAsn: 0.703 ± 0.238
0.791TrpPro: 0.791 ± 0.168
0.703TrpGln: 0.703 ± 0.242
1.23TrpArg: 1.23 ± 0.244
0.967TrpSer: 0.967 ± 0.242
1.142TrpThr: 1.142 ± 0.254
0.879TrpVal: 0.879 ± 0.206
0.176TrpTrp: 0.176 ± 0.087
0.527TrpTyr: 0.527 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.592TyrAla: 2.592 ± 0.416
0.483TyrCys: 0.483 ± 0.158
1.889TyrAsp: 1.889 ± 0.338
1.801TyrGlu: 1.801 ± 0.377
1.142TyrPhe: 1.142 ± 0.248
2.636TyrGly: 2.636 ± 0.365
0.395TyrHis: 0.395 ± 0.119
1.318TyrIle: 1.318 ± 0.295
1.406TyrLys: 1.406 ± 0.205
2.372TyrLeu: 2.372 ± 0.368
0.879TyrMet: 0.879 ± 0.227
1.494TyrAsn: 1.494 ± 0.283
1.362TyrPro: 1.362 ± 0.248
1.098TyrGln: 1.098 ± 0.221
2.197TyrArg: 2.197 ± 0.352
1.45TyrSer: 1.45 ± 0.31
1.889TyrThr: 1.889 ± 0.34
2.241TyrVal: 2.241 ± 0.443
0.615TyrTrp: 0.615 ± 0.173
0.923TyrTyr: 0.923 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (22763 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski