Amino acid dipepetide frequency for Gordonia phage NadineRae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.386AlaAla: 13.386 ± 1.427
1.022AlaCys: 1.022 ± 0.224
7.107AlaAsp: 7.107 ± 0.635
7.399AlaGlu: 7.399 ± 0.731
3.261AlaPhe: 3.261 ± 0.373
8.518AlaGly: 8.518 ± 0.804
2.677AlaHis: 2.677 ± 0.521
4.868AlaIle: 4.868 ± 0.517
4.916AlaLys: 4.916 ± 0.462
8.713AlaLeu: 8.713 ± 0.957
2.288AlaMet: 2.288 ± 0.351
3.018AlaAsn: 3.018 ± 0.38
4.283AlaPro: 4.283 ± 0.563
3.845AlaGln: 3.845 ± 0.517
6.766AlaArg: 6.766 ± 0.828
5.16AlaSer: 5.16 ± 0.602
5.89AlaThr: 5.89 ± 0.618
6.912AlaVal: 6.912 ± 0.629
1.655AlaTrp: 1.655 ± 0.329
2.336AlaTyr: 2.336 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.181
0.097CysCys: 0.097 ± 0.069
0.876CysAsp: 0.876 ± 0.213
0.681CysGlu: 0.681 ± 0.163
0.146CysPhe: 0.146 ± 0.08
0.681CysGly: 0.681 ± 0.173
0.243CysHis: 0.243 ± 0.111
0.243CysIle: 0.243 ± 0.111
0.243CysLys: 0.243 ± 0.107
0.974CysLeu: 0.974 ± 0.264
0.0CysMet: 0.0 ± 0.0
0.195CysAsn: 0.195 ± 0.079
0.633CysPro: 0.633 ± 0.171
0.438CysGln: 0.438 ± 0.127
0.487CysArg: 0.487 ± 0.148
0.633CysSer: 0.633 ± 0.188
0.389CysThr: 0.389 ± 0.135
0.974CysVal: 0.974 ± 0.248
0.146CysTrp: 0.146 ± 0.076
0.243CysTyr: 0.243 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
7.593AspAla: 7.593 ± 0.504
0.584AspCys: 0.584 ± 0.198
7.253AspAsp: 7.253 ± 1.3
4.868AspGlu: 4.868 ± 0.743
1.996AspPhe: 1.996 ± 0.279
7.058AspGly: 7.058 ± 0.631
1.606AspHis: 1.606 ± 0.259
3.456AspIle: 3.456 ± 0.488
1.996AspLys: 1.996 ± 0.285
5.744AspLeu: 5.744 ± 0.54
1.558AspMet: 1.558 ± 0.231
2.093AspAsn: 2.093 ± 0.468
4.381AspPro: 4.381 ± 0.425
1.947AspGln: 1.947 ± 0.298
4.868AspArg: 4.868 ± 0.47
3.164AspSer: 3.164 ± 0.447
3.651AspThr: 3.651 ± 0.452
5.014AspVal: 5.014 ± 0.439
1.266AspTrp: 1.266 ± 0.242
1.947AspTyr: 1.947 ± 0.435
0.0AspXaa: 0.0 ± 0.0
Glu
5.646GluAla: 5.646 ± 0.476
0.584GluCys: 0.584 ± 0.194
4.576GluAsp: 4.576 ± 0.522
4.43GluGlu: 4.43 ± 0.524
2.19GluPhe: 2.19 ± 0.309
3.748GluGly: 3.748 ± 0.487
1.217GluHis: 1.217 ± 0.201
4.722GluIle: 4.722 ± 0.44
2.775GluLys: 2.775 ± 0.401
5.987GluLeu: 5.987 ± 0.593
1.801GluMet: 1.801 ± 0.269
2.19GluAsn: 2.19 ± 0.377
3.359GluPro: 3.359 ± 0.6
2.142GluGln: 2.142 ± 0.332
5.208GluArg: 5.208 ± 0.706
3.699GluSer: 3.699 ± 0.364
3.456GluThr: 3.456 ± 0.421
4.576GluVal: 4.576 ± 0.385
0.925GluTrp: 0.925 ± 0.201
1.898GluTyr: 1.898 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
2.58PheAla: 2.58 ± 0.348
0.389PheCys: 0.389 ± 0.127
2.58PheAsp: 2.58 ± 0.347
2.19PheGlu: 2.19 ± 0.281
0.73PhePhe: 0.73 ± 0.188
2.921PheGly: 2.921 ± 0.426
0.827PheHis: 0.827 ± 0.169
1.12PheIle: 1.12 ± 0.24
0.876PheLys: 0.876 ± 0.206
2.239PheLeu: 2.239 ± 0.361
0.584PheMet: 0.584 ± 0.137
0.779PheAsn: 0.779 ± 0.157
1.363PhePro: 1.363 ± 0.247
0.73PheGln: 0.73 ± 0.198
2.288PheArg: 2.288 ± 0.328
2.093PheSer: 2.093 ± 0.402
1.996PheThr: 1.996 ± 0.281
2.19PheVal: 2.19 ± 0.304
0.827PheTrp: 0.827 ± 0.207
1.168PheTyr: 1.168 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
7.934GlyAla: 7.934 ± 0.766
0.779GlyCys: 0.779 ± 0.205
6.182GlyAsp: 6.182 ± 0.505
5.744GlyGlu: 5.744 ± 0.585
2.726GlyPhe: 2.726 ± 0.343
7.837GlyGly: 7.837 ± 1.569
1.801GlyHis: 1.801 ± 0.326
4.673GlyIle: 4.673 ± 0.67
3.553GlyLys: 3.553 ± 0.455
5.987GlyLeu: 5.987 ± 0.749
1.801GlyMet: 1.801 ± 0.357
2.385GlyAsn: 2.385 ± 0.314
3.456GlyPro: 3.456 ± 0.502
3.407GlyGln: 3.407 ± 0.45
6.425GlyArg: 6.425 ± 0.537
5.987GlySer: 5.987 ± 0.818
5.014GlyThr: 5.014 ± 0.451
5.987GlyVal: 5.987 ± 0.453
1.655GlyTrp: 1.655 ± 0.267
2.823GlyTyr: 2.823 ± 0.423
0.0GlyXaa: 0.0 ± 0.0
His
1.85HisAla: 1.85 ± 0.323
0.243HisCys: 0.243 ± 0.099
1.412HisAsp: 1.412 ± 0.287
1.022HisGlu: 1.022 ± 0.222
0.925HisPhe: 0.925 ± 0.19
1.898HisGly: 1.898 ± 0.374
0.341HisHis: 0.341 ± 0.134
0.876HisIle: 0.876 ± 0.164
0.681HisLys: 0.681 ± 0.163
1.266HisLeu: 1.266 ± 0.264
0.633HisMet: 0.633 ± 0.189
0.292HisAsn: 0.292 ± 0.149
1.752HisPro: 1.752 ± 0.331
0.146HisGln: 0.146 ± 0.078
1.85HisArg: 1.85 ± 0.368
0.876HisSer: 0.876 ± 0.191
1.363HisThr: 1.363 ± 0.33
1.85HisVal: 1.85 ± 0.321
0.487HisTrp: 0.487 ± 0.135
0.681HisTyr: 0.681 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
5.598IleAla: 5.598 ± 0.58
0.243IleCys: 0.243 ± 0.129
3.31IleAsp: 3.31 ± 0.418
4.478IleGlu: 4.478 ± 0.512
0.827IlePhe: 0.827 ± 0.219
4.722IleGly: 4.722 ± 0.777
1.12IleHis: 1.12 ± 0.25
1.947IleIle: 1.947 ± 0.287
2.044IleLys: 2.044 ± 0.335
2.969IleLeu: 2.969 ± 0.35
0.779IleMet: 0.779 ± 0.194
2.142IleAsn: 2.142 ± 0.275
2.434IlePro: 2.434 ± 0.332
1.509IleGln: 1.509 ± 0.296
2.629IleArg: 2.629 ± 0.373
2.677IleSer: 2.677 ± 0.304
2.336IleThr: 2.336 ± 0.37
2.775IleVal: 2.775 ± 0.251
0.876IleTrp: 0.876 ± 0.21
1.217IleTyr: 1.217 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
4.089LysAla: 4.089 ± 0.556
0.243LysCys: 0.243 ± 0.088
2.872LysAsp: 2.872 ± 0.373
1.85LysGlu: 1.85 ± 0.331
1.071LysPhe: 1.071 ± 0.221
3.164LysGly: 3.164 ± 0.404
0.633LysHis: 0.633 ± 0.127
2.093LysIle: 2.093 ± 0.272
2.482LysLys: 2.482 ± 0.335
3.213LysLeu: 3.213 ± 0.428
1.022LysMet: 1.022 ± 0.236
1.412LysAsn: 1.412 ± 0.258
2.19LysPro: 2.19 ± 0.297
1.412LysGln: 1.412 ± 0.241
3.115LysArg: 3.115 ± 0.425
1.558LysSer: 1.558 ± 0.297
2.726LysThr: 2.726 ± 0.397
3.407LysVal: 3.407 ± 0.366
0.925LysTrp: 0.925 ± 0.227
1.46LysTyr: 1.46 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
8.567LeuAla: 8.567 ± 0.639
1.071LeuCys: 1.071 ± 0.22
5.89LeuAsp: 5.89 ± 0.537
3.991LeuGlu: 3.991 ± 0.359
2.093LeuPhe: 2.093 ± 0.357
6.182LeuGly: 6.182 ± 0.78
1.412LeuHis: 1.412 ± 0.337
2.775LeuIle: 2.775 ± 0.429
3.553LeuLys: 3.553 ± 0.41
5.257LeuLeu: 5.257 ± 0.548
1.412LeuMet: 1.412 ± 0.209
2.093LeuAsn: 2.093 ± 0.291
4.527LeuPro: 4.527 ± 0.585
2.336LeuGln: 2.336 ± 0.317
5.792LeuArg: 5.792 ± 0.586
4.04LeuSer: 4.04 ± 0.426
5.208LeuThr: 5.208 ± 0.579
5.841LeuVal: 5.841 ± 0.477
1.217LeuTrp: 1.217 ± 0.215
1.752LeuTyr: 1.752 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
2.677MetAla: 2.677 ± 0.388
0.0MetCys: 0.0 ± 0.0
1.071MetAsp: 1.071 ± 0.262
1.168MetGlu: 1.168 ± 0.245
0.974MetPhe: 0.974 ± 0.219
0.925MetGly: 0.925 ± 0.21
0.438MetHis: 0.438 ± 0.18
1.363MetIle: 1.363 ± 0.316
1.071MetLys: 1.071 ± 0.223
1.606MetLeu: 1.606 ± 0.258
0.535MetMet: 0.535 ± 0.145
1.12MetAsn: 1.12 ± 0.214
1.704MetPro: 1.704 ± 0.289
0.487MetGln: 0.487 ± 0.131
1.947MetArg: 1.947 ± 0.296
1.704MetSer: 1.704 ± 0.302
1.704MetThr: 1.704 ± 0.292
0.827MetVal: 0.827 ± 0.191
0.389MetTrp: 0.389 ± 0.14
0.584MetTyr: 0.584 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
2.58AsnAla: 2.58 ± 0.28
0.292AsnCys: 0.292 ± 0.091
2.044AsnAsp: 2.044 ± 0.322
2.239AsnGlu: 2.239 ± 0.262
1.217AsnPhe: 1.217 ± 0.237
3.651AsnGly: 3.651 ± 0.529
0.73AsnHis: 0.73 ± 0.187
1.12AsnIle: 1.12 ± 0.202
1.168AsnLys: 1.168 ± 0.286
2.288AsnLeu: 2.288 ± 0.329
0.779AsnMet: 0.779 ± 0.199
0.974AsnAsn: 0.974 ± 0.217
2.434AsnPro: 2.434 ± 0.37
0.925AsnGln: 0.925 ± 0.186
2.142AsnArg: 2.142 ± 0.314
1.752AsnSer: 1.752 ± 0.358
2.19AsnThr: 2.19 ± 0.301
2.288AsnVal: 2.288 ± 0.385
0.876AsnTrp: 0.876 ± 0.203
0.925AsnTyr: 0.925 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
5.306ProAla: 5.306 ± 0.461
0.341ProCys: 0.341 ± 0.131
4.527ProAsp: 4.527 ± 0.501
3.894ProGlu: 3.894 ± 0.459
1.898ProPhe: 1.898 ± 0.276
4.673ProGly: 4.673 ± 0.497
0.633ProHis: 0.633 ± 0.138
2.482ProIle: 2.482 ± 0.406
2.629ProLys: 2.629 ± 0.395
4.04ProLeu: 4.04 ± 0.497
1.168ProMet: 1.168 ± 0.225
1.947ProAsn: 1.947 ± 0.335
3.261ProPro: 3.261 ± 0.503
1.655ProGln: 1.655 ± 0.237
4.04ProArg: 4.04 ± 0.489
3.067ProSer: 3.067 ± 0.307
3.991ProThr: 3.991 ± 0.559
3.991ProVal: 3.991 ± 0.398
0.584ProTrp: 0.584 ± 0.127
1.168ProTyr: 1.168 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
3.505GlnAla: 3.505 ± 0.42
0.195GlnCys: 0.195 ± 0.099
1.363GlnAsp: 1.363 ± 0.224
1.558GlnGlu: 1.558 ± 0.305
1.022GlnPhe: 1.022 ± 0.232
1.752GlnGly: 1.752 ± 0.3
0.389GlnHis: 0.389 ± 0.129
1.655GlnIle: 1.655 ± 0.271
1.217GlnLys: 1.217 ± 0.197
3.213GlnLeu: 3.213 ± 0.466
1.412GlnMet: 1.412 ± 0.236
1.46GlnAsn: 1.46 ± 0.282
1.46GlnPro: 1.46 ± 0.212
1.266GlnGln: 1.266 ± 0.284
3.31GlnArg: 3.31 ± 0.414
1.168GlnSer: 1.168 ± 0.197
2.239GlnThr: 2.239 ± 0.346
2.239GlnVal: 2.239 ± 0.325
0.779GlnTrp: 0.779 ± 0.176
0.584GlnTyr: 0.584 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
7.399ArgAla: 7.399 ± 0.688
1.071ArgCys: 1.071 ± 0.268
4.624ArgAsp: 4.624 ± 0.642
5.841ArgGlu: 5.841 ± 0.658
2.775ArgPhe: 2.775 ± 0.476
6.571ArgGly: 6.571 ± 0.62
1.996ArgHis: 1.996 ± 0.332
2.775ArgIle: 2.775 ± 0.332
3.456ArgLys: 3.456 ± 0.41
4.381ArgLeu: 4.381 ± 0.52
2.19ArgMet: 2.19 ± 0.304
2.482ArgAsn: 2.482 ± 0.33
3.456ArgPro: 3.456 ± 0.402
2.531ArgGln: 2.531 ± 0.397
7.496ArgArg: 7.496 ± 0.739
3.261ArgSer: 3.261 ± 0.523
3.894ArgThr: 3.894 ± 0.53
5.354ArgVal: 5.354 ± 0.552
1.412ArgTrp: 1.412 ± 0.279
1.655ArgTyr: 1.655 ± 0.239
0.0ArgXaa: 0.0 ± 0.0
Ser
6.523SerAla: 6.523 ± 0.654
0.243SerCys: 0.243 ± 0.102
3.894SerAsp: 3.894 ± 0.565
3.261SerGlu: 3.261 ± 0.4
1.217SerPhe: 1.217 ± 0.248
5.403SerGly: 5.403 ± 0.564
1.266SerHis: 1.266 ± 0.261
2.872SerIle: 2.872 ± 0.379
2.142SerLys: 2.142 ± 0.449
3.115SerLeu: 3.115 ± 0.344
1.509SerMet: 1.509 ± 0.294
1.996SerAsn: 1.996 ± 0.26
3.018SerPro: 3.018 ± 0.459
1.704SerGln: 1.704 ± 0.276
3.553SerArg: 3.553 ± 0.494
3.164SerSer: 3.164 ± 0.587
3.553SerThr: 3.553 ± 0.404
3.359SerVal: 3.359 ± 0.305
1.12SerTrp: 1.12 ± 0.204
1.217SerTyr: 1.217 ± 0.2
0.0SerXaa: 0.0 ± 0.0
Thr
6.815ThrAla: 6.815 ± 0.734
0.487ThrCys: 0.487 ± 0.145
3.553ThrAsp: 3.553 ± 0.366
2.677ThrGlu: 2.677 ± 0.405
2.336ThrPhe: 2.336 ± 0.298
7.253ThrGly: 7.253 ± 0.502
0.974ThrHis: 0.974 ± 0.21
2.872ThrIle: 2.872 ± 0.386
2.629ThrLys: 2.629 ± 0.398
4.673ThrLeu: 4.673 ± 0.441
1.071ThrMet: 1.071 ± 0.197
1.85ThrAsn: 1.85 ± 0.284
4.965ThrPro: 4.965 ± 0.636
1.704ThrGln: 1.704 ± 0.312
4.04ThrArg: 4.04 ± 0.491
3.018ThrSer: 3.018 ± 0.304
4.43ThrThr: 4.43 ± 0.452
4.673ThrVal: 4.673 ± 0.559
1.46ThrTrp: 1.46 ± 0.271
1.704ThrTyr: 1.704 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
7.301ValAla: 7.301 ± 0.747
0.535ValCys: 0.535 ± 0.183
5.89ValAsp: 5.89 ± 0.568
5.257ValGlu: 5.257 ± 0.505
1.898ValPhe: 1.898 ± 0.269
5.257ValGly: 5.257 ± 0.505
1.022ValHis: 1.022 ± 0.21
3.261ValIle: 3.261 ± 0.484
2.385ValLys: 2.385 ± 0.301
5.062ValLeu: 5.062 ± 0.42
0.827ValMet: 0.827 ± 0.19
2.434ValAsn: 2.434 ± 0.321
3.651ValPro: 3.651 ± 0.392
2.385ValGln: 2.385 ± 0.389
5.354ValArg: 5.354 ± 0.505
4.43ValSer: 4.43 ± 0.454
6.182ValThr: 6.182 ± 0.555
5.549ValVal: 5.549 ± 0.55
1.12ValTrp: 1.12 ± 0.27
1.509ValTyr: 1.509 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
1.752TrpAla: 1.752 ± 0.312
0.195TrpCys: 0.195 ± 0.085
1.314TrpAsp: 1.314 ± 0.25
1.266TrpGlu: 1.266 ± 0.259
0.535TrpPhe: 0.535 ± 0.163
1.266TrpGly: 1.266 ± 0.231
0.633TrpHis: 0.633 ± 0.207
0.974TrpIle: 0.974 ± 0.225
0.487TrpLys: 0.487 ± 0.18
1.655TrpLeu: 1.655 ± 0.334
0.243TrpMet: 0.243 ± 0.101
0.681TrpAsn: 0.681 ± 0.193
0.779TrpPro: 0.779 ± 0.16
0.779TrpGln: 0.779 ± 0.208
0.925TrpArg: 0.925 ± 0.238
1.46TrpSer: 1.46 ± 0.268
1.558TrpThr: 1.558 ± 0.265
1.606TrpVal: 1.606 ± 0.241
0.779TrpTrp: 0.779 ± 0.199
0.292TrpTyr: 0.292 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.288TyrAla: 2.288 ± 0.326
0.341TyrCys: 0.341 ± 0.143
1.752TyrAsp: 1.752 ± 0.337
1.314TyrGlu: 1.314 ± 0.271
0.633TyrPhe: 0.633 ± 0.186
2.677TyrGly: 2.677 ± 0.407
0.487TyrHis: 0.487 ± 0.178
0.487TyrIle: 0.487 ± 0.145
0.633TyrLys: 0.633 ± 0.166
2.629TyrLeu: 2.629 ± 0.454
0.633TyrMet: 0.633 ± 0.148
1.071TyrAsn: 1.071 ± 0.238
2.19TyrPro: 2.19 ± 0.346
0.438TyrGln: 0.438 ± 0.114
2.385TyrArg: 2.385 ± 0.322
1.217TyrSer: 1.217 ± 0.246
1.412TyrThr: 1.412 ± 0.277
1.898TyrVal: 1.898 ± 0.414
0.681TyrTrp: 0.681 ± 0.17
0.633TyrTyr: 0.633 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (20545 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski