Amino acid dipepetide frequency for Gordonia phage GMA7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.257AlaAla: 9.257 ± 1.963
0.658AlaCys: 0.658 ± 0.179
5.177AlaAsp: 5.177 ± 0.457
5.572AlaGlu: 5.572 ± 0.661
3.203AlaPhe: 3.203 ± 0.393
6.011AlaGly: 6.011 ± 1.086
0.965AlaHis: 0.965 ± 0.197
5.353AlaIle: 5.353 ± 0.719
3.992AlaLys: 3.992 ± 0.499
7.107AlaLeu: 7.107 ± 0.826
2.983AlaMet: 2.983 ± 0.527
2.939AlaAsn: 2.939 ± 0.315
4.431AlaPro: 4.431 ± 0.557
3.685AlaGln: 3.685 ± 0.63
4.124AlaArg: 4.124 ± 0.444
5.791AlaSer: 5.791 ± 0.636
6.537AlaThr: 6.537 ± 0.601
6.142AlaVal: 6.142 ± 0.633
2.413AlaTrp: 2.413 ± 0.799
2.106AlaTyr: 2.106 ± 0.301
0.0AlaXaa: 0.0 ± 0.0
Cys
0.702CysAla: 0.702 ± 0.207
0.132CysCys: 0.132 ± 0.075
0.614CysAsp: 0.614 ± 0.199
0.439CysGlu: 0.439 ± 0.176
0.263CysPhe: 0.263 ± 0.11
0.921CysGly: 0.921 ± 0.236
0.263CysHis: 0.263 ± 0.125
0.483CysIle: 0.483 ± 0.152
0.658CysLys: 0.658 ± 0.197
0.395CysLeu: 0.395 ± 0.147
0.132CysMet: 0.132 ± 0.084
0.395CysAsn: 0.395 ± 0.155
0.395CysPro: 0.395 ± 0.135
0.175CysGln: 0.175 ± 0.096
0.351CysArg: 0.351 ± 0.158
0.57CysSer: 0.57 ± 0.189
0.702CysThr: 0.702 ± 0.198
0.834CysVal: 0.834 ± 0.22
0.088CysTrp: 0.088 ± 0.064
0.263CysTyr: 0.263 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
5.309AspAla: 5.309 ± 0.444
0.614AspCys: 0.614 ± 0.189
3.905AspAsp: 3.905 ± 0.682
4.87AspGlu: 4.87 ± 0.706
2.238AspPhe: 2.238 ± 0.29
4.87AspGly: 4.87 ± 0.41
1.228AspHis: 1.228 ± 0.235
2.896AspIle: 2.896 ± 0.407
2.852AspLys: 2.852 ± 0.406
4.431AspLeu: 4.431 ± 0.458
2.238AspMet: 2.238 ± 0.297
2.72AspAsn: 2.72 ± 0.356
3.992AspPro: 3.992 ± 0.546
2.106AspGln: 2.106 ± 0.29
3.29AspArg: 3.29 ± 0.387
3.992AspSer: 3.992 ± 0.319
3.115AspThr: 3.115 ± 0.464
4.475AspVal: 4.475 ± 0.389
1.097AspTrp: 1.097 ± 0.207
2.15AspTyr: 2.15 ± 0.275
0.0AspXaa: 0.0 ± 0.0
Glu
5.616GluAla: 5.616 ± 0.562
0.439GluCys: 0.439 ± 0.173
4.519GluAsp: 4.519 ± 0.605
5.791GluGlu: 5.791 ± 0.863
2.852GluPhe: 2.852 ± 0.537
4.563GluGly: 4.563 ± 0.516
1.228GluHis: 1.228 ± 0.228
4.431GluIle: 4.431 ± 0.482
3.466GluLys: 3.466 ± 0.437
6.142GluLeu: 6.142 ± 0.516
2.238GluMet: 2.238 ± 0.324
2.457GluAsn: 2.457 ± 0.35
2.764GluPro: 2.764 ± 0.453
2.281GluGln: 2.281 ± 0.373
5.177GluArg: 5.177 ± 0.826
4.914GluSer: 4.914 ± 0.512
3.027GluThr: 3.027 ± 0.452
4.08GluVal: 4.08 ± 0.536
1.228GluTrp: 1.228 ± 0.23
2.281GluTyr: 2.281 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 0.377
0.483PheCys: 0.483 ± 0.155
2.676PheAsp: 2.676 ± 0.46
2.413PheGlu: 2.413 ± 0.425
1.053PhePhe: 1.053 ± 0.211
3.203PheGly: 3.203 ± 0.449
0.483PheHis: 0.483 ± 0.198
1.755PheIle: 1.755 ± 0.258
2.062PheLys: 2.062 ± 0.323
1.799PheLeu: 1.799 ± 0.291
1.492PheMet: 1.492 ± 0.242
2.15PheAsn: 2.15 ± 0.27
1.623PhePro: 1.623 ± 0.361
1.185PheGln: 1.185 ± 0.249
1.623PheArg: 1.623 ± 0.272
2.72PheSer: 2.72 ± 0.356
2.369PheThr: 2.369 ± 0.303
2.018PheVal: 2.018 ± 0.309
0.351PheTrp: 0.351 ± 0.128
0.834PheTyr: 0.834 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
6.098GlyAla: 6.098 ± 0.648
0.834GlyCys: 0.834 ± 0.243
4.387GlyAsp: 4.387 ± 0.358
4.343GlyGlu: 4.343 ± 0.354
2.369GlyPhe: 2.369 ± 0.28
7.195GlyGly: 7.195 ± 1.068
1.404GlyHis: 1.404 ± 0.278
4.387GlyIle: 4.387 ± 0.518
5.791GlyLys: 5.791 ± 0.634
5.66GlyLeu: 5.66 ± 0.611
1.755GlyMet: 1.755 ± 0.316
3.466GlyAsn: 3.466 ± 0.494
3.29GlyPro: 3.29 ± 0.425
2.632GlyGln: 2.632 ± 0.339
4.651GlyArg: 4.651 ± 0.507
5.353GlySer: 5.353 ± 0.609
5.528GlyThr: 5.528 ± 0.591
5.221GlyVal: 5.221 ± 0.475
1.492GlyTrp: 1.492 ± 0.284
2.501GlyTyr: 2.501 ± 0.308
0.0GlyXaa: 0.0 ± 0.0
His
1.185HisAla: 1.185 ± 0.234
0.263HisCys: 0.263 ± 0.112
0.834HisAsp: 0.834 ± 0.214
1.141HisGlu: 1.141 ± 0.296
0.57HisPhe: 0.57 ± 0.182
1.053HisGly: 1.053 ± 0.219
0.57HisHis: 0.57 ± 0.185
1.053HisIle: 1.053 ± 0.257
0.702HisLys: 0.702 ± 0.227
1.623HisLeu: 1.623 ± 0.291
0.439HisMet: 0.439 ± 0.137
0.746HisAsn: 0.746 ± 0.181
0.921HisPro: 0.921 ± 0.235
0.658HisGln: 0.658 ± 0.185
0.79HisArg: 0.79 ± 0.195
0.965HisSer: 0.965 ± 0.187
0.79HisThr: 0.79 ± 0.144
1.579HisVal: 1.579 ± 0.304
0.351HisTrp: 0.351 ± 0.113
0.702HisTyr: 0.702 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
5.089IleAla: 5.089 ± 0.724
0.395IleCys: 0.395 ± 0.126
3.598IleAsp: 3.598 ± 0.397
4.651IleGlu: 4.651 ± 0.581
1.974IlePhe: 1.974 ± 0.291
4.87IleGly: 4.87 ± 0.558
0.921IleHis: 0.921 ± 0.25
2.896IleIle: 2.896 ± 0.461
3.466IleLys: 3.466 ± 0.43
3.159IleLeu: 3.159 ± 0.38
1.141IleMet: 1.141 ± 0.176
3.159IleAsn: 3.159 ± 0.389
2.413IlePro: 2.413 ± 0.564
2.281IleGln: 2.281 ± 0.357
2.632IleArg: 2.632 ± 0.297
3.115IleSer: 3.115 ± 0.448
3.378IleThr: 3.378 ± 0.493
3.641IleVal: 3.641 ± 0.419
1.009IleTrp: 1.009 ± 0.258
1.097IleTyr: 1.097 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
5.002LysAla: 5.002 ± 0.554
0.395LysCys: 0.395 ± 0.15
3.334LysAsp: 3.334 ± 0.413
3.378LysGlu: 3.378 ± 0.505
2.018LysPhe: 2.018 ± 0.319
3.861LysGly: 3.861 ± 0.457
0.658LysHis: 0.658 ± 0.177
2.939LysIle: 2.939 ± 0.318
3.203LysLys: 3.203 ± 0.397
4.475LysLeu: 4.475 ± 0.739
1.799LysMet: 1.799 ± 0.239
2.589LysAsn: 2.589 ± 0.451
2.589LysPro: 2.589 ± 0.518
2.15LysGln: 2.15 ± 0.356
3.159LysArg: 3.159 ± 0.508
3.817LysSer: 3.817 ± 0.465
3.203LysThr: 3.203 ± 0.361
3.378LysVal: 3.378 ± 0.35
0.483LysTrp: 0.483 ± 0.171
1.272LysTyr: 1.272 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
5.967LeuAla: 5.967 ± 0.633
0.57LeuCys: 0.57 ± 0.198
5.572LeuAsp: 5.572 ± 0.526
6.142LeuGlu: 6.142 ± 0.557
2.194LeuPhe: 2.194 ± 0.313
5.835LeuGly: 5.835 ± 0.611
1.316LeuHis: 1.316 ± 0.302
4.212LeuIle: 4.212 ± 0.381
4.519LeuLys: 4.519 ± 0.53
5.089LeuLeu: 5.089 ± 0.655
2.369LeuMet: 2.369 ± 0.358
2.939LeuAsn: 2.939 ± 0.269
3.992LeuPro: 3.992 ± 0.439
2.808LeuGln: 2.808 ± 0.426
3.773LeuArg: 3.773 ± 0.423
4.958LeuSer: 4.958 ± 0.493
5.133LeuThr: 5.133 ± 0.531
5.396LeuVal: 5.396 ± 0.497
1.492LeuTrp: 1.492 ± 0.282
1.843LeuTyr: 1.843 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.589MetAla: 2.589 ± 0.321
0.132MetCys: 0.132 ± 0.087
1.316MetAsp: 1.316 ± 0.231
1.93MetGlu: 1.93 ± 0.285
1.009MetPhe: 1.009 ± 0.239
1.843MetGly: 1.843 ± 0.418
0.614MetHis: 0.614 ± 0.182
1.536MetIle: 1.536 ± 0.278
1.492MetLys: 1.492 ± 0.242
2.808MetLeu: 2.808 ± 0.369
0.439MetMet: 0.439 ± 0.159
1.272MetAsn: 1.272 ± 0.316
1.667MetPro: 1.667 ± 0.333
1.492MetGln: 1.492 ± 0.325
1.492MetArg: 1.492 ± 0.218
2.72MetSer: 2.72 ± 0.462
2.238MetThr: 2.238 ± 0.326
1.228MetVal: 1.228 ± 0.227
0.263MetTrp: 0.263 ± 0.086
0.658MetTyr: 0.658 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.378AsnAla: 3.378 ± 0.601
0.307AsnCys: 0.307 ± 0.111
2.457AsnAsp: 2.457 ± 0.361
2.501AsnGlu: 2.501 ± 0.309
1.579AsnPhe: 1.579 ± 0.243
4.607AsnGly: 4.607 ± 0.486
0.658AsnHis: 0.658 ± 0.182
2.194AsnIle: 2.194 ± 0.424
2.72AsnLys: 2.72 ± 0.467
2.983AsnLeu: 2.983 ± 0.402
1.009AsnMet: 1.009 ± 0.224
1.448AsnAsn: 1.448 ± 0.234
2.896AsnPro: 2.896 ± 0.308
1.887AsnGln: 1.887 ± 0.298
1.974AsnArg: 1.974 ± 0.326
2.72AsnSer: 2.72 ± 0.374
2.194AsnThr: 2.194 ± 0.322
2.808AsnVal: 2.808 ± 0.443
0.526AsnTrp: 0.526 ± 0.184
1.272AsnTyr: 1.272 ± 0.265
0.0AsnXaa: 0.0 ± 0.0
Pro
4.651ProAla: 4.651 ± 0.579
0.395ProCys: 0.395 ± 0.148
3.071ProAsp: 3.071 ± 0.492
4.036ProGlu: 4.036 ± 0.594
1.579ProPhe: 1.579 ± 0.289
4.387ProGly: 4.387 ± 0.418
0.921ProHis: 0.921 ± 0.235
1.93ProIle: 1.93 ± 0.258
2.852ProLys: 2.852 ± 0.621
3.422ProLeu: 3.422 ± 0.345
1.492ProMet: 1.492 ± 0.235
1.93ProAsn: 1.93 ± 0.275
2.194ProPro: 2.194 ± 0.418
1.141ProGln: 1.141 ± 0.235
2.018ProArg: 2.018 ± 0.422
3.905ProSer: 3.905 ± 0.442
2.457ProThr: 2.457 ± 0.406
3.949ProVal: 3.949 ± 0.415
0.921ProTrp: 0.921 ± 0.238
1.887ProTyr: 1.887 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
3.905GlnAla: 3.905 ± 0.542
0.263GlnCys: 0.263 ± 0.121
1.492GlnAsp: 1.492 ± 0.267
2.72GlnGlu: 2.72 ± 0.362
1.448GlnPhe: 1.448 ± 0.251
2.106GlnGly: 2.106 ± 0.324
0.526GlnHis: 0.526 ± 0.175
2.676GlnIle: 2.676 ± 0.34
1.36GlnLys: 1.36 ± 0.247
3.203GlnLeu: 3.203 ± 0.34
0.834GlnMet: 0.834 ± 0.213
1.536GlnAsn: 1.536 ± 0.292
1.711GlnPro: 1.711 ± 0.305
1.448GlnGln: 1.448 ± 0.221
2.062GlnArg: 2.062 ± 0.324
1.711GlnSer: 1.711 ± 0.298
2.764GlnThr: 2.764 ± 0.386
2.72GlnVal: 2.72 ± 0.404
0.526GlnTrp: 0.526 ± 0.214
0.834GlnTyr: 0.834 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
4.694ArgAla: 4.694 ± 0.423
0.702ArgCys: 0.702 ± 0.175
2.939ArgAsp: 2.939 ± 0.46
4.387ArgGlu: 4.387 ± 0.454
2.106ArgPhe: 2.106 ± 0.346
3.115ArgGly: 3.115 ± 0.455
0.834ArgHis: 0.834 ± 0.22
2.676ArgIle: 2.676 ± 0.405
2.983ArgLys: 2.983 ± 0.447
4.607ArgLeu: 4.607 ± 0.534
1.887ArgMet: 1.887 ± 0.376
2.062ArgAsn: 2.062 ± 0.34
2.238ArgPro: 2.238 ± 0.363
2.062ArgGln: 2.062 ± 0.337
3.554ArgArg: 3.554 ± 0.536
3.071ArgSer: 3.071 ± 0.412
2.676ArgThr: 2.676 ± 0.426
4.651ArgVal: 4.651 ± 0.628
0.614ArgTrp: 0.614 ± 0.188
2.325ArgTyr: 2.325 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
6.362SerAla: 6.362 ± 0.711
0.307SerCys: 0.307 ± 0.133
4.651SerAsp: 4.651 ± 0.474
4.212SerGlu: 4.212 ± 0.459
2.325SerPhe: 2.325 ± 0.41
5.133SerGly: 5.133 ± 0.501
1.053SerHis: 1.053 ± 0.248
3.905SerIle: 3.905 ± 0.492
2.808SerLys: 2.808 ± 0.292
5.045SerLeu: 5.045 ± 0.485
1.887SerMet: 1.887 ± 0.298
2.852SerAsn: 2.852 ± 0.345
3.159SerPro: 3.159 ± 0.373
1.843SerGln: 1.843 ± 0.398
3.598SerArg: 3.598 ± 0.524
4.563SerSer: 4.563 ± 0.479
4.212SerThr: 4.212 ± 0.512
4.607SerVal: 4.607 ± 0.474
1.448SerTrp: 1.448 ± 0.253
2.238SerTyr: 2.238 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
5.309ThrAla: 5.309 ± 0.864
0.483ThrCys: 0.483 ± 0.165
3.51ThrAsp: 3.51 ± 0.526
3.115ThrGlu: 3.115 ± 0.372
2.238ThrPhe: 2.238 ± 0.3
6.098ThrGly: 6.098 ± 0.595
0.921ThrHis: 0.921 ± 0.228
3.685ThrIle: 3.685 ± 0.414
3.334ThrLys: 3.334 ± 0.383
5.396ThrLeu: 5.396 ± 0.451
1.448ThrMet: 1.448 ± 0.417
2.72ThrAsn: 2.72 ± 0.45
3.159ThrPro: 3.159 ± 0.472
2.062ThrGln: 2.062 ± 0.322
2.983ThrArg: 2.983 ± 0.368
4.168ThrSer: 4.168 ± 0.585
4.256ThrThr: 4.256 ± 0.689
4.431ThrVal: 4.431 ± 0.515
1.536ThrTrp: 1.536 ± 0.31
1.93ThrTyr: 1.93 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
6.274ValAla: 6.274 ± 0.587
0.746ValCys: 0.746 ± 0.233
4.738ValAsp: 4.738 ± 0.5
4.519ValGlu: 4.519 ± 0.506
2.413ValPhe: 2.413 ± 0.291
4.651ValGly: 4.651 ± 0.553
1.36ValHis: 1.36 ± 0.226
3.861ValIle: 3.861 ± 0.389
3.554ValLys: 3.554 ± 0.354
4.914ValLeu: 4.914 ± 0.689
1.843ValMet: 1.843 ± 0.284
2.808ValAsn: 2.808 ± 0.336
4.08ValPro: 4.08 ± 0.592
2.194ValGln: 2.194 ± 0.277
4.212ValArg: 4.212 ± 0.454
4.036ValSer: 4.036 ± 0.349
4.914ValThr: 4.914 ± 0.416
5.133ValVal: 5.133 ± 0.476
1.272ValTrp: 1.272 ± 0.222
1.93ValTyr: 1.93 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
1.272TrpAla: 1.272 ± 0.384
0.351TrpCys: 0.351 ± 0.149
1.36TrpAsp: 1.36 ± 0.379
1.228TrpGlu: 1.228 ± 0.212
0.921TrpPhe: 0.921 ± 0.241
1.228TrpGly: 1.228 ± 0.211
0.351TrpHis: 0.351 ± 0.124
0.877TrpIle: 0.877 ± 0.206
0.834TrpLys: 0.834 ± 0.147
1.755TrpLeu: 1.755 ± 0.471
0.395TrpMet: 0.395 ± 0.126
0.57TrpAsn: 0.57 ± 0.261
0.658TrpPro: 0.658 ± 0.167
0.658TrpGln: 0.658 ± 0.19
1.009TrpArg: 1.009 ± 0.219
1.185TrpSer: 1.185 ± 0.258
1.185TrpThr: 1.185 ± 0.362
0.877TrpVal: 0.877 ± 0.156
0.175TrpTrp: 0.175 ± 0.089
0.658TrpTyr: 0.658 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.808TyrAla: 2.808 ± 0.309
0.307TyrCys: 0.307 ± 0.12
2.238TyrAsp: 2.238 ± 0.385
2.15TyrGlu: 2.15 ± 0.358
0.921TyrPhe: 0.921 ± 0.209
2.808TyrGly: 2.808 ± 0.372
0.658TyrHis: 0.658 ± 0.218
1.141TyrIle: 1.141 ± 0.241
1.053TyrLys: 1.053 ± 0.194
2.106TyrLeu: 2.106 ± 0.315
0.877TyrMet: 0.877 ± 0.177
1.36TyrAsn: 1.36 ± 0.313
0.965TyrPro: 0.965 ± 0.196
1.097TyrGln: 1.097 ± 0.255
1.711TyrArg: 1.711 ± 0.344
1.93TyrSer: 1.93 ± 0.348
2.106TyrThr: 2.106 ± 0.333
2.281TyrVal: 2.281 ± 0.358
0.263TyrTrp: 0.263 ± 0.108
0.834TyrTyr: 0.834 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (22794 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski