Amino acid dipepetide frequency for Gordonia phage Leonard

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.1AlaAla: 19.1 ± 2.194
0.994AlaCys: 0.994 ± 0.231
9.262AlaAsp: 9.262 ± 0.803
8.059AlaGlu: 8.059 ± 0.815
2.773AlaPhe: 2.773 ± 0.493
9.262AlaGly: 9.262 ± 0.84
2.041AlaHis: 2.041 ± 0.34
5.181AlaIle: 5.181 ± 0.671
2.878AlaLys: 2.878 ± 0.384
10.518AlaLeu: 10.518 ± 0.773
3.14AlaMet: 3.14 ± 0.414
2.826AlaAsn: 2.826 ± 0.433
7.274AlaPro: 7.274 ± 0.544
5.181AlaGln: 5.181 ± 0.576
9.995AlaArg: 9.995 ± 0.775
5.338AlaSer: 5.338 ± 0.596
7.431AlaThr: 7.431 ± 0.802
8.216AlaVal: 8.216 ± 0.684
2.512AlaTrp: 2.512 ± 0.373
2.564AlaTyr: 2.564 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.785CysAla: 0.785 ± 0.218
0.157CysCys: 0.157 ± 0.108
0.837CysAsp: 0.837 ± 0.2
0.628CysGlu: 0.628 ± 0.212
0.209CysPhe: 0.209 ± 0.088
1.256CysGly: 1.256 ± 0.336
0.209CysHis: 0.209 ± 0.112
0.471CysIle: 0.471 ± 0.163
0.105CysLys: 0.105 ± 0.073
0.262CysLeu: 0.262 ± 0.158
0.157CysMet: 0.157 ± 0.086
0.471CysAsn: 0.471 ± 0.151
1.151CysPro: 1.151 ± 0.266
0.262CysGln: 0.262 ± 0.111
0.733CysArg: 0.733 ± 0.198
0.471CysSer: 0.471 ± 0.147
0.576CysThr: 0.576 ± 0.176
0.314CysVal: 0.314 ± 0.11
0.105CysTrp: 0.105 ± 0.077
0.052CysTyr: 0.052 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
9.524AspAla: 9.524 ± 0.692
0.837AspCys: 0.837 ± 0.21
7.64AspAsp: 7.64 ± 0.927
6.227AspGlu: 6.227 ± 0.938
0.994AspPhe: 0.994 ± 0.196
6.803AspGly: 6.803 ± 0.538
1.518AspHis: 1.518 ± 0.346
2.093AspIle: 2.093 ± 0.23
1.465AspLys: 1.465 ± 0.217
5.233AspLeu: 5.233 ± 0.596
1.57AspMet: 1.57 ± 0.294
1.832AspAsn: 1.832 ± 0.35
6.018AspPro: 6.018 ± 0.664
2.25AspGln: 2.25 ± 0.365
4.762AspArg: 4.762 ± 0.446
2.878AspSer: 2.878 ± 0.417
4.919AspThr: 4.919 ± 0.502
5.181AspVal: 5.181 ± 0.444
1.779AspTrp: 1.779 ± 0.286
1.308AspTyr: 1.308 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
6.279GluAla: 6.279 ± 0.662
0.471GluCys: 0.471 ± 0.17
2.564GluAsp: 2.564 ± 0.391
1.361GluGlu: 1.361 ± 0.252
2.093GluPhe: 2.093 ± 0.281
3.297GluGly: 3.297 ± 0.423
1.988GluHis: 1.988 ± 0.369
2.041GluIle: 2.041 ± 0.331
0.68GluLys: 0.68 ± 0.222
5.495GluLeu: 5.495 ± 0.722
1.361GluMet: 1.361 ± 0.266
1.675GluAsn: 1.675 ± 0.353
3.454GluPro: 3.454 ± 0.465
4.186GluGln: 4.186 ± 0.615
4.448GluArg: 4.448 ± 0.705
2.459GluSer: 2.459 ± 0.405
3.297GluThr: 3.297 ± 0.417
4.396GluVal: 4.396 ± 0.561
1.936GluTrp: 1.936 ± 0.367
1.727GluTyr: 1.727 ± 0.249
0.0GluXaa: 0.0 ± 0.0
Phe
2.826PheAla: 2.826 ± 0.4
0.262PheCys: 0.262 ± 0.144
2.355PheAsp: 2.355 ± 0.383
1.361PheGlu: 1.361 ± 0.264
0.628PhePhe: 0.628 ± 0.261
2.25PheGly: 2.25 ± 0.305
0.733PheHis: 0.733 ± 0.202
0.733PheIle: 0.733 ± 0.248
0.68PheLys: 0.68 ± 0.209
1.413PheLeu: 1.413 ± 0.26
0.471PheMet: 0.471 ± 0.161
0.733PheAsn: 0.733 ± 0.197
1.256PhePro: 1.256 ± 0.284
0.628PheGln: 0.628 ± 0.175
1.308PheArg: 1.308 ± 0.28
0.785PheSer: 0.785 ± 0.165
1.832PheThr: 1.832 ± 0.27
1.936PheVal: 1.936 ± 0.279
0.523PheTrp: 0.523 ± 0.164
0.314PheTyr: 0.314 ± 0.14
0.0PheXaa: 0.0 ± 0.0
Gly
9.367GlyAla: 9.367 ± 0.936
0.471GlyCys: 0.471 ± 0.179
6.279GlyAsp: 6.279 ± 0.681
4.029GlyGlu: 4.029 ± 0.435
1.675GlyPhe: 1.675 ± 0.354
8.634GlyGly: 8.634 ± 1.279
1.988GlyHis: 1.988 ± 0.312
4.448GlyIle: 4.448 ± 0.535
2.459GlyLys: 2.459 ± 0.429
4.919GlyLeu: 4.919 ± 0.618
1.884GlyMet: 1.884 ± 0.36
2.93GlyAsn: 2.93 ± 0.52
3.611GlyPro: 3.611 ± 0.52
3.087GlyGln: 3.087 ± 0.363
5.39GlyArg: 5.39 ± 0.565
5.024GlySer: 5.024 ± 0.654
6.018GlyThr: 6.018 ± 0.771
6.646GlyVal: 6.646 ± 0.617
1.413GlyTrp: 1.413 ± 0.247
2.25GlyTyr: 2.25 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
2.198HisAla: 2.198 ± 0.388
0.262HisCys: 0.262 ± 0.12
1.727HisAsp: 1.727 ± 0.336
1.361HisGlu: 1.361 ± 0.245
0.314HisPhe: 0.314 ± 0.128
1.675HisGly: 1.675 ± 0.261
0.628HisHis: 0.628 ± 0.228
0.994HisIle: 0.994 ± 0.246
0.209HisLys: 0.209 ± 0.09
2.041HisLeu: 2.041 ± 0.389
0.576HisMet: 0.576 ± 0.167
0.471HisAsn: 0.471 ± 0.161
1.465HisPro: 1.465 ± 0.326
0.89HisGln: 0.89 ± 0.182
2.407HisArg: 2.407 ± 0.428
0.785HisSer: 0.785 ± 0.222
1.622HisThr: 1.622 ± 0.304
1.57HisVal: 1.57 ± 0.308
0.262HisTrp: 0.262 ± 0.106
0.576HisTyr: 0.576 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
5.913IleAla: 5.913 ± 0.754
0.314IleCys: 0.314 ± 0.149
4.134IleAsp: 4.134 ± 0.447
2.983IleGlu: 2.983 ± 0.297
0.523IlePhe: 0.523 ± 0.16
4.239IleGly: 4.239 ± 0.764
0.628IleHis: 0.628 ± 0.217
1.465IleIle: 1.465 ± 0.311
1.465IleLys: 1.465 ± 0.408
2.25IleLeu: 2.25 ± 0.335
0.471IleMet: 0.471 ± 0.134
1.779IleAsn: 1.779 ± 0.225
2.564IlePro: 2.564 ± 0.344
0.994IleGln: 0.994 ± 0.258
2.616IleArg: 2.616 ± 0.323
1.518IleSer: 1.518 ± 0.306
3.401IleThr: 3.401 ± 0.367
3.558IleVal: 3.558 ± 0.419
0.733IleTrp: 0.733 ± 0.187
0.733IleTyr: 0.733 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
4.029LysAla: 4.029 ± 0.37
0.262LysCys: 0.262 ± 0.125
1.099LysAsp: 1.099 ± 0.236
0.471LysGlu: 0.471 ± 0.139
0.837LysPhe: 0.837 ± 0.212
1.57LysGly: 1.57 ± 0.275
0.471LysHis: 0.471 ± 0.13
1.256LysIle: 1.256 ± 0.252
0.314LysLys: 0.314 ± 0.11
2.459LysLeu: 2.459 ± 0.352
0.733LysMet: 0.733 ± 0.212
0.942LysAsn: 0.942 ± 0.24
1.465LysPro: 1.465 ± 0.243
0.994LysGln: 0.994 ± 0.274
2.302LysArg: 2.302 ± 0.332
1.413LysSer: 1.413 ± 0.36
1.518LysThr: 1.518 ± 0.291
2.093LysVal: 2.093 ± 0.296
0.314LysTrp: 0.314 ± 0.126
1.047LysTyr: 1.047 ± 0.211
0.0LysXaa: 0.0 ± 0.0
Leu
10.204LeuAla: 10.204 ± 0.724
0.628LeuCys: 0.628 ± 0.191
5.547LeuAsp: 5.547 ± 0.813
5.128LeuGlu: 5.128 ± 0.521
1.675LeuPhe: 1.675 ± 0.33
7.378LeuGly: 7.378 ± 0.966
1.361LeuHis: 1.361 ± 0.288
3.454LeuIle: 3.454 ± 0.42
2.041LeuLys: 2.041 ± 0.293
5.547LeuLeu: 5.547 ± 0.543
1.622LeuMet: 1.622 ± 0.223
2.041LeuAsn: 2.041 ± 0.341
5.076LeuPro: 5.076 ± 0.489
1.832LeuGln: 1.832 ± 0.455
5.024LeuArg: 5.024 ± 0.506
3.035LeuSer: 3.035 ± 0.424
5.495LeuThr: 5.495 ± 0.423
6.122LeuVal: 6.122 ± 0.57
1.518LeuTrp: 1.518 ± 0.319
1.465LeuTyr: 1.465 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
2.407MetAla: 2.407 ± 0.406
0.157MetCys: 0.157 ± 0.093
1.047MetAsp: 1.047 ± 0.236
0.419MetGlu: 0.419 ± 0.104
0.366MetPhe: 0.366 ± 0.126
1.308MetGly: 1.308 ± 0.255
0.366MetHis: 0.366 ± 0.137
1.151MetIle: 1.151 ± 0.249
0.523MetLys: 0.523 ± 0.136
1.779MetLeu: 1.779 ± 0.323
0.733MetMet: 0.733 ± 0.167
0.785MetAsn: 0.785 ± 0.18
2.25MetPro: 2.25 ± 0.355
0.576MetGln: 0.576 ± 0.189
2.25MetArg: 2.25 ± 0.343
1.518MetSer: 1.518 ± 0.305
3.349MetThr: 3.349 ± 0.35
1.518MetVal: 1.518 ± 0.32
0.523MetTrp: 0.523 ± 0.162
0.262MetTyr: 0.262 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
3.087AsnAla: 3.087 ± 0.524
0.157AsnCys: 0.157 ± 0.09
2.198AsnAsp: 2.198 ± 0.361
1.361AsnGlu: 1.361 ± 0.297
0.523AsnPhe: 0.523 ± 0.146
3.035AsnGly: 3.035 ± 0.41
0.68AsnHis: 0.68 ± 0.168
1.256AsnIle: 1.256 ± 0.303
0.68AsnLys: 0.68 ± 0.152
2.459AsnLeu: 2.459 ± 0.254
0.366AsnMet: 0.366 ± 0.139
0.837AsnAsn: 0.837 ± 0.233
2.302AsnPro: 2.302 ± 0.37
0.628AsnGln: 0.628 ± 0.163
1.57AsnArg: 1.57 ± 0.326
1.57AsnSer: 1.57 ± 0.293
1.779AsnThr: 1.779 ± 0.284
2.093AsnVal: 2.093 ± 0.42
0.68AsnTrp: 0.68 ± 0.179
0.419AsnTyr: 0.419 ± 0.125
0.0AsnXaa: 0.0 ± 0.0
Pro
7.797ProAla: 7.797 ± 0.647
0.733ProCys: 0.733 ± 0.208
6.227ProAsp: 6.227 ± 0.583
3.506ProGlu: 3.506 ± 0.577
1.57ProPhe: 1.57 ± 0.276
5.233ProGly: 5.233 ± 0.569
1.675ProHis: 1.675 ± 0.331
2.459ProIle: 2.459 ± 0.342
1.936ProLys: 1.936 ± 0.335
3.087ProLeu: 3.087 ± 0.417
1.465ProMet: 1.465 ± 0.257
2.041ProAsn: 2.041 ± 0.345
5.39ProPro: 5.39 ± 0.691
1.779ProGln: 1.779 ± 0.287
4.971ProArg: 4.971 ± 0.79
2.407ProSer: 2.407 ± 0.358
5.181ProThr: 5.181 ± 0.684
5.024ProVal: 5.024 ± 0.461
1.622ProTrp: 1.622 ± 0.333
1.413ProTyr: 1.413 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
4.71GlnAla: 4.71 ± 0.591
0.471GlnCys: 0.471 ± 0.133
1.256GlnAsp: 1.256 ± 0.252
1.099GlnGlu: 1.099 ± 0.246
1.151GlnPhe: 1.151 ± 0.213
1.779GlnGly: 1.779 ± 0.381
0.628GlnHis: 0.628 ± 0.178
1.832GlnIle: 1.832 ± 0.304
0.994GlnLys: 0.994 ± 0.246
4.029GlnLeu: 4.029 ± 0.463
1.413GlnMet: 1.413 ± 0.227
0.89GlnAsn: 0.89 ± 0.211
2.145GlnPro: 2.145 ± 0.287
2.512GlnGln: 2.512 ± 0.437
3.349GlnArg: 3.349 ± 0.448
1.779GlnSer: 1.779 ± 0.316
1.832GlnThr: 1.832 ± 0.288
3.663GlnVal: 3.663 ± 0.371
0.89GlnTrp: 0.89 ± 0.217
0.523GlnTyr: 0.523 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
8.268ArgAla: 8.268 ± 0.867
0.994ArgCys: 0.994 ± 0.227
4.971ArgAsp: 4.971 ± 0.482
4.396ArgGlu: 4.396 ± 0.587
1.361ArgPhe: 1.361 ± 0.252
4.919ArgGly: 4.919 ± 0.577
1.936ArgHis: 1.936 ± 0.354
3.82ArgIle: 3.82 ± 0.435
2.93ArgLys: 2.93 ± 0.415
5.861ArgLeu: 5.861 ± 0.596
2.041ArgMet: 2.041 ± 0.342
2.25ArgAsn: 2.25 ± 0.345
3.663ArgPro: 3.663 ± 0.632
3.087ArgGln: 3.087 ± 0.354
8.059ArgArg: 8.059 ± 0.999
3.087ArgSer: 3.087 ± 0.363
4.553ArgThr: 4.553 ± 0.502
5.756ArgVal: 5.756 ± 0.631
1.465ArgTrp: 1.465 ± 0.325
1.936ArgTyr: 1.936 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
6.122SerAla: 6.122 ± 0.78
0.262SerCys: 0.262 ± 0.152
2.616SerAsp: 2.616 ± 0.374
1.779SerGlu: 1.779 ± 0.324
0.837SerPhe: 0.837 ± 0.201
4.971SerGly: 4.971 ± 0.516
0.785SerHis: 0.785 ± 0.209
1.832SerIle: 1.832 ± 0.395
1.308SerLys: 1.308 ± 0.238
3.82SerLeu: 3.82 ± 0.423
1.308SerMet: 1.308 ± 0.229
0.942SerAsn: 0.942 ± 0.198
2.407SerPro: 2.407 ± 0.367
1.413SerGln: 1.413 ± 0.294
2.721SerArg: 2.721 ± 0.414
3.454SerSer: 3.454 ± 0.604
4.029SerThr: 4.029 ± 0.381
3.925SerVal: 3.925 ± 0.466
1.047SerTrp: 1.047 ± 0.221
0.942SerTyr: 0.942 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
8.32ThrAla: 8.32 ± 0.936
0.68ThrCys: 0.68 ± 0.143
5.338ThrAsp: 5.338 ± 0.608
4.5ThrGlu: 4.5 ± 0.614
2.041ThrPhe: 2.041 ± 0.3
6.227ThrGly: 6.227 ± 0.629
1.518ThrHis: 1.518 ± 0.288
3.558ThrIle: 3.558 ± 0.446
1.832ThrLys: 1.832 ± 0.227
4.657ThrLeu: 4.657 ± 0.504
1.57ThrMet: 1.57 ± 0.25
1.727ThrAsn: 1.727 ± 0.333
6.332ThrPro: 6.332 ± 0.68
1.413ThrGln: 1.413 ± 0.239
4.762ThrArg: 4.762 ± 0.477
3.349ThrSer: 3.349 ± 0.531
5.076ThrThr: 5.076 ± 0.492
5.076ThrVal: 5.076 ± 0.584
1.204ThrTrp: 1.204 ± 0.26
1.256ThrTyr: 1.256 ± 0.236
0.0ThrXaa: 0.0 ± 0.0
Val
9.419ValAla: 9.419 ± 0.701
0.576ValCys: 0.576 ± 0.2
6.332ValAsp: 6.332 ± 0.625
4.553ValGlu: 4.553 ± 0.463
1.936ValPhe: 1.936 ± 0.285
5.547ValGly: 5.547 ± 0.545
1.832ValHis: 1.832 ± 0.294
2.93ValIle: 2.93 ± 0.394
2.093ValLys: 2.093 ± 0.302
6.227ValLeu: 6.227 ± 0.545
1.361ValMet: 1.361 ± 0.267
1.518ValAsn: 1.518 ± 0.282
4.71ValPro: 4.71 ± 0.414
3.349ValGln: 3.349 ± 0.374
5.181ValArg: 5.181 ± 0.483
3.035ValSer: 3.035 ± 0.389
5.808ValThr: 5.808 ± 0.57
6.122ValVal: 6.122 ± 0.697
1.57ValTrp: 1.57 ± 0.324
1.988ValTyr: 1.988 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
2.355TrpAla: 2.355 ± 0.337
0.419TrpCys: 0.419 ± 0.134
1.308TrpAsp: 1.308 ± 0.347
0.68TrpGlu: 0.68 ± 0.179
0.733TrpPhe: 0.733 ± 0.188
1.099TrpGly: 1.099 ± 0.208
0.628TrpHis: 0.628 ± 0.158
0.628TrpIle: 0.628 ± 0.161
0.523TrpLys: 0.523 ± 0.16
2.355TrpLeu: 2.355 ± 0.374
0.523TrpMet: 0.523 ± 0.183
0.419TrpAsn: 0.419 ± 0.212
1.57TrpPro: 1.57 ± 0.342
1.099TrpGln: 1.099 ± 0.26
1.779TrpArg: 1.779 ± 0.31
1.57TrpSer: 1.57 ± 0.354
1.204TrpThr: 1.204 ± 0.275
1.465TrpVal: 1.465 ± 0.283
0.576TrpTrp: 0.576 ± 0.222
0.209TrpTyr: 0.209 ± 0.085
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.832TyrAla: 1.832 ± 0.316
0.105TyrCys: 0.105 ± 0.066
2.093TyrAsp: 2.093 ± 0.367
1.308TyrGlu: 1.308 ± 0.298
0.785TyrPhe: 0.785 ± 0.183
2.041TyrGly: 2.041 ± 0.239
0.419TyrHis: 0.419 ± 0.164
0.68TyrIle: 0.68 ± 0.161
0.471TyrLys: 0.471 ± 0.166
1.779TyrLeu: 1.779 ± 0.246
0.471TyrMet: 0.471 ± 0.153
0.576TyrAsn: 0.576 ± 0.155
1.465TyrPro: 1.465 ± 0.269
0.785TyrGln: 0.785 ± 0.221
1.727TyrArg: 1.727 ± 0.3
1.047TyrSer: 1.047 ± 0.193
1.518TyrThr: 1.518 ± 0.291
1.413TyrVal: 1.413 ± 0.302
0.471TyrTrp: 0.471 ± 0.134
0.628TyrTyr: 0.628 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (19111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski