Amino acid dipepetide frequency for Gordonia phage Cozz

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.958AlaAla: 10.958 ± 1.803
0.541AlaCys: 0.541 ± 0.193
6.02AlaAsp: 6.02 ± 0.783
5.885AlaGlu: 5.885 ± 0.529
3.72AlaPhe: 3.72 ± 0.604
8.185AlaGly: 8.185 ± 1.001
1.556AlaHis: 1.556 ± 0.319
5.614AlaIle: 5.614 ± 0.883
5.344AlaLys: 5.344 ± 0.692
9.131AlaLeu: 9.131 ± 0.795
2.773AlaMet: 2.773 ± 0.5
2.706AlaAsn: 2.706 ± 0.521
4.464AlaPro: 4.464 ± 0.453
3.45AlaGln: 3.45 ± 0.587
6.02AlaArg: 6.02 ± 0.655
5.614AlaSer: 5.614 ± 0.622
6.291AlaThr: 6.291 ± 0.853
6.494AlaVal: 6.494 ± 0.769
2.3AlaTrp: 2.3 ± 0.433
2.706AlaTyr: 2.706 ± 0.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.278
0.068CysCys: 0.068 ± 0.076
0.812CysAsp: 0.812 ± 0.31
0.406CysGlu: 0.406 ± 0.169
0.473CysPhe: 0.473 ± 0.166
0.541CysGly: 0.541 ± 0.212
0.135CysHis: 0.135 ± 0.081
0.135CysIle: 0.135 ± 0.093
0.406CysLys: 0.406 ± 0.184
0.406CysLeu: 0.406 ± 0.161
0.203CysMet: 0.203 ± 0.16
0.271CysAsn: 0.271 ± 0.146
0.406CysPro: 0.406 ± 0.204
0.338CysGln: 0.338 ± 0.143
0.271CysArg: 0.271 ± 0.139
0.879CysSer: 0.879 ± 0.236
0.406CysThr: 0.406 ± 0.176
0.271CysVal: 0.271 ± 0.145
0.203CysTrp: 0.203 ± 0.098
0.406CysTyr: 0.406 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.494AspAla: 6.494 ± 0.579
0.676AspCys: 0.676 ± 0.23
4.194AspAsp: 4.194 ± 0.716
5.073AspGlu: 5.073 ± 0.647
2.097AspPhe: 2.097 ± 0.341
5.411AspGly: 5.411 ± 0.622
1.218AspHis: 1.218 ± 0.286
2.841AspIle: 2.841 ± 0.387
3.179AspLys: 3.179 ± 0.49
6.088AspLeu: 6.088 ± 0.742
1.556AspMet: 1.556 ± 0.312
1.759AspAsn: 1.759 ± 0.321
3.788AspPro: 3.788 ± 0.576
1.826AspGln: 1.826 ± 0.4
3.856AspArg: 3.856 ± 0.373
3.788AspSer: 3.788 ± 0.475
3.585AspThr: 3.585 ± 0.471
3.517AspVal: 3.517 ± 0.538
1.488AspTrp: 1.488 ± 0.328
2.367AspTyr: 2.367 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
6.088GluAla: 6.088 ± 0.444
0.473GluCys: 0.473 ± 0.163
5.885GluAsp: 5.885 ± 0.84
5.073GluGlu: 5.073 ± 0.983
1.691GluPhe: 1.691 ± 0.348
3.72GluGly: 3.72 ± 0.488
1.623GluHis: 1.623 ± 0.274
2.503GluIle: 2.503 ± 0.395
3.044GluLys: 3.044 ± 0.698
5.682GluLeu: 5.682 ± 0.647
1.556GluMet: 1.556 ± 0.325
1.353GluAsn: 1.353 ± 0.301
2.706GluPro: 2.706 ± 0.354
3.314GluGln: 3.314 ± 0.435
4.532GluArg: 4.532 ± 0.565
3.517GluSer: 3.517 ± 0.485
2.57GluThr: 2.57 ± 0.378
4.6GluVal: 4.6 ± 0.708
1.285GluTrp: 1.285 ± 0.259
2.165GluTyr: 2.165 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
3.382PheAla: 3.382 ± 0.525
0.676PheCys: 0.676 ± 0.263
2.706PheAsp: 2.706 ± 0.455
1.691PheGlu: 1.691 ± 0.405
0.947PhePhe: 0.947 ± 0.225
2.367PheGly: 2.367 ± 0.405
0.812PheHis: 0.812 ± 0.195
2.029PheIle: 2.029 ± 0.523
1.691PheLys: 1.691 ± 0.319
2.367PheLeu: 2.367 ± 0.365
0.879PheMet: 0.879 ± 0.187
1.015PheAsn: 1.015 ± 0.184
1.015PhePro: 1.015 ± 0.25
0.812PheGln: 0.812 ± 0.219
1.759PheArg: 1.759 ± 0.361
1.691PheSer: 1.691 ± 0.43
2.435PheThr: 2.435 ± 0.373
2.435PheVal: 2.435 ± 0.402
0.406PheTrp: 0.406 ± 0.18
0.609PheTyr: 0.609 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
7.035GlyAla: 7.035 ± 0.742
0.744GlyCys: 0.744 ± 0.265
5.885GlyAsp: 5.885 ± 0.786
4.667GlyGlu: 4.667 ± 0.611
3.044GlyPhe: 3.044 ± 0.431
6.02GlyGly: 6.02 ± 0.762
1.488GlyHis: 1.488 ± 0.295
3.923GlyIle: 3.923 ± 0.471
4.329GlyLys: 4.329 ± 0.608
5.952GlyLeu: 5.952 ± 1.023
1.962GlyMet: 1.962 ± 0.276
2.3GlyAsn: 2.3 ± 0.368
3.044GlyPro: 3.044 ± 0.539
3.247GlyGln: 3.247 ± 0.358
5.682GlyArg: 5.682 ± 0.748
4.464GlySer: 4.464 ± 0.634
6.358GlyThr: 6.358 ± 0.847
6.426GlyVal: 6.426 ± 0.71
1.623GlyTrp: 1.623 ± 0.351
3.111GlyTyr: 3.111 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
1.42HisAla: 1.42 ± 0.288
0.271HisCys: 0.271 ± 0.129
1.15HisAsp: 1.15 ± 0.253
1.42HisGlu: 1.42 ± 0.393
0.744HisPhe: 0.744 ± 0.254
1.488HisGly: 1.488 ± 0.273
0.812HisHis: 0.812 ± 0.234
0.609HisIle: 0.609 ± 0.219
0.812HisLys: 0.812 ± 0.259
1.962HisLeu: 1.962 ± 0.452
0.812HisMet: 0.812 ± 0.232
0.744HisAsn: 0.744 ± 0.217
1.488HisPro: 1.488 ± 0.344
0.676HisGln: 0.676 ± 0.163
1.218HisArg: 1.218 ± 0.281
1.082HisSer: 1.082 ± 0.242
1.759HisThr: 1.759 ± 0.305
1.285HisVal: 1.285 ± 0.29
0.203HisTrp: 0.203 ± 0.122
0.541HisTyr: 0.541 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
5.411IleAla: 5.411 ± 0.747
0.203IleCys: 0.203 ± 0.105
3.585IleAsp: 3.585 ± 0.431
3.111IleGlu: 3.111 ± 0.449
1.082IlePhe: 1.082 ± 0.262
3.517IleGly: 3.517 ± 0.542
0.676IleHis: 0.676 ± 0.221
2.165IleIle: 2.165 ± 0.444
2.435IleLys: 2.435 ± 0.687
3.111IleLeu: 3.111 ± 0.498
0.812IleMet: 0.812 ± 0.244
1.759IleAsn: 1.759 ± 0.301
3.044IlePro: 3.044 ± 0.485
1.759IleGln: 1.759 ± 0.346
4.126IleArg: 4.126 ± 0.585
2.909IleSer: 2.909 ± 0.603
3.653IleThr: 3.653 ± 0.458
2.773IleVal: 2.773 ± 0.364
1.015IleTrp: 1.015 ± 0.245
1.285IleTyr: 1.285 ± 0.359
0.0IleXaa: 0.0 ± 0.0
Lys
5.141LysAla: 5.141 ± 0.552
0.406LysCys: 0.406 ± 0.215
2.976LysAsp: 2.976 ± 0.397
3.517LysGlu: 3.517 ± 0.554
1.962LysPhe: 1.962 ± 0.373
4.464LysGly: 4.464 ± 0.578
0.879LysHis: 0.879 ± 0.261
2.165LysIle: 2.165 ± 0.346
3.382LysLys: 3.382 ± 0.468
4.87LysLeu: 4.87 ± 0.576
1.218LysMet: 1.218 ± 0.249
1.623LysAsn: 1.623 ± 0.249
2.706LysPro: 2.706 ± 0.368
2.165LysGln: 2.165 ± 0.375
2.909LysArg: 2.909 ± 0.495
2.706LysSer: 2.706 ± 0.423
3.111LysThr: 3.111 ± 0.427
3.653LysVal: 3.653 ± 0.425
0.676LysTrp: 0.676 ± 0.247
1.353LysTyr: 1.353 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
7.643LeuAla: 7.643 ± 0.798
0.676LeuCys: 0.676 ± 0.226
4.802LeuAsp: 4.802 ± 0.542
5.817LeuGlu: 5.817 ± 0.667
2.029LeuPhe: 2.029 ± 0.458
6.967LeuGly: 6.967 ± 0.669
1.488LeuHis: 1.488 ± 0.273
3.991LeuIle: 3.991 ± 0.447
4.532LeuLys: 4.532 ± 0.604
7.17LeuLeu: 7.17 ± 0.881
2.435LeuMet: 2.435 ± 0.33
3.111LeuAsn: 3.111 ± 0.512
4.329LeuPro: 4.329 ± 0.489
2.773LeuGln: 2.773 ± 0.468
6.494LeuArg: 6.494 ± 0.736
5.141LeuSer: 5.141 ± 0.566
4.667LeuThr: 4.667 ± 0.539
5.411LeuVal: 5.411 ± 0.655
1.42LeuTrp: 1.42 ± 0.311
2.503LeuTyr: 2.503 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
3.585MetAla: 3.585 ± 0.352
0.068MetCys: 0.068 ± 0.073
0.812MetAsp: 0.812 ± 0.236
1.488MetGlu: 1.488 ± 0.313
1.082MetPhe: 1.082 ± 0.259
2.097MetGly: 2.097 ± 0.421
0.744MetHis: 0.744 ± 0.239
1.488MetIle: 1.488 ± 0.308
1.691MetLys: 1.691 ± 0.377
1.894MetLeu: 1.894 ± 0.37
0.744MetMet: 0.744 ± 0.165
1.015MetAsn: 1.015 ± 0.237
1.623MetPro: 1.623 ± 0.307
0.879MetGln: 0.879 ± 0.238
1.15MetArg: 1.15 ± 0.202
2.3MetSer: 2.3 ± 0.351
1.691MetThr: 1.691 ± 0.331
1.285MetVal: 1.285 ± 0.271
0.676MetTrp: 0.676 ± 0.235
0.271MetTyr: 0.271 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
3.653AsnAla: 3.653 ± 0.735
0.135AsnCys: 0.135 ± 0.096
1.759AsnAsp: 1.759 ± 0.495
1.894AsnGlu: 1.894 ± 0.319
1.15AsnPhe: 1.15 ± 0.294
2.841AsnGly: 2.841 ± 0.424
0.609AsnHis: 0.609 ± 0.216
1.42AsnIle: 1.42 ± 0.388
1.285AsnLys: 1.285 ± 0.34
2.165AsnLeu: 2.165 ± 0.321
1.218AsnMet: 1.218 ± 0.317
1.082AsnAsn: 1.082 ± 0.375
3.314AsnPro: 3.314 ± 0.47
0.947AsnGln: 0.947 ± 0.214
2.029AsnArg: 2.029 ± 0.397
1.623AsnSer: 1.623 ± 0.335
1.42AsnThr: 1.42 ± 0.272
2.435AsnVal: 2.435 ± 0.459
0.676AsnTrp: 0.676 ± 0.198
0.744AsnTyr: 0.744 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
5.208ProAla: 5.208 ± 0.528
0.541ProCys: 0.541 ± 0.195
3.585ProAsp: 3.585 ± 0.543
3.72ProGlu: 3.72 ± 0.57
1.894ProPhe: 1.894 ± 0.411
5.479ProGly: 5.479 ± 0.624
1.15ProHis: 1.15 ± 0.224
2.3ProIle: 2.3 ± 0.373
2.706ProLys: 2.706 ± 0.35
4.058ProLeu: 4.058 ± 0.483
1.488ProMet: 1.488 ± 0.308
2.029ProAsn: 2.029 ± 0.418
2.706ProPro: 2.706 ± 0.455
1.691ProGln: 1.691 ± 0.4
2.57ProArg: 2.57 ± 0.462
2.773ProSer: 2.773 ± 0.34
3.247ProThr: 3.247 ± 0.448
3.585ProVal: 3.585 ± 0.478
1.15ProTrp: 1.15 ± 0.246
0.812ProTyr: 0.812 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
3.788GlnAla: 3.788 ± 0.627
0.406GlnCys: 0.406 ± 0.191
1.488GlnAsp: 1.488 ± 0.273
2.367GlnGlu: 2.367 ± 0.407
1.42GlnPhe: 1.42 ± 0.399
2.165GlnGly: 2.165 ± 0.384
0.879GlnHis: 0.879 ± 0.308
2.435GlnIle: 2.435 ± 0.47
2.165GlnLys: 2.165 ± 0.429
3.314GlnLeu: 3.314 ± 0.405
0.676GlnMet: 0.676 ± 0.213
1.15GlnAsn: 1.15 ± 0.345
2.232GlnPro: 2.232 ± 0.339
1.556GlnGln: 1.556 ± 0.367
2.435GlnArg: 2.435 ± 0.334
2.3GlnSer: 2.3 ± 0.379
1.556GlnThr: 1.556 ± 0.277
2.3GlnVal: 2.3 ± 0.304
0.947GlnTrp: 0.947 ± 0.229
1.082GlnTyr: 1.082 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
6.223ArgAla: 6.223 ± 0.62
0.338ArgCys: 0.338 ± 0.19
4.667ArgAsp: 4.667 ± 0.522
3.856ArgGlu: 3.856 ± 0.624
2.232ArgPhe: 2.232 ± 0.358
5.682ArgGly: 5.682 ± 0.553
1.015ArgHis: 1.015 ± 0.288
3.044ArgIle: 3.044 ± 0.39
3.45ArgLys: 3.45 ± 0.571
5.208ArgLeu: 5.208 ± 0.532
1.42ArgMet: 1.42 ± 0.327
1.962ArgAsn: 1.962 ± 0.358
3.111ArgPro: 3.111 ± 0.692
2.57ArgGln: 2.57 ± 0.354
5.344ArgArg: 5.344 ± 0.681
3.314ArgSer: 3.314 ± 0.54
3.111ArgThr: 3.111 ± 0.42
4.735ArgVal: 4.735 ± 0.697
1.556ArgTrp: 1.556 ± 0.359
1.691ArgTyr: 1.691 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
5.749SerAla: 5.749 ± 0.782
0.541SerCys: 0.541 ± 0.238
3.179SerAsp: 3.179 ± 0.501
3.585SerGlu: 3.585 ± 0.533
1.218SerPhe: 1.218 ± 0.226
5.952SerGly: 5.952 ± 0.54
1.15SerHis: 1.15 ± 0.333
3.111SerIle: 3.111 ± 0.52
2.841SerLys: 2.841 ± 0.47
5.344SerLeu: 5.344 ± 0.686
1.556SerMet: 1.556 ± 0.304
1.623SerAsn: 1.623 ± 0.254
3.044SerPro: 3.044 ± 0.515
2.097SerGln: 2.097 ± 0.396
3.179SerArg: 3.179 ± 0.389
2.909SerSer: 2.909 ± 0.368
3.788SerThr: 3.788 ± 0.46
3.179SerVal: 3.179 ± 0.469
1.285SerTrp: 1.285 ± 0.278
1.353SerTyr: 1.353 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
5.817ThrAla: 5.817 ± 0.563
0.473ThrCys: 0.473 ± 0.181
3.45ThrAsp: 3.45 ± 0.504
2.435ThrGlu: 2.435 ± 0.368
1.556ThrPhe: 1.556 ± 0.306
5.817ThrGly: 5.817 ± 0.63
1.015ThrHis: 1.015 ± 0.267
2.841ThrIle: 2.841 ± 0.529
3.585ThrLys: 3.585 ± 0.6
4.6ThrLeu: 4.6 ± 0.683
1.285ThrMet: 1.285 ± 0.345
1.962ThrAsn: 1.962 ± 0.31
4.329ThrPro: 4.329 ± 0.571
2.097ThrGln: 2.097 ± 0.322
3.72ThrArg: 3.72 ± 0.525
2.841ThrSer: 2.841 ± 0.436
3.585ThrThr: 3.585 ± 0.534
5.885ThrVal: 5.885 ± 0.72
0.947ThrTrp: 0.947 ± 0.238
1.623ThrTyr: 1.623 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
6.832ValAla: 6.832 ± 0.637
0.068ValCys: 0.068 ± 0.066
4.397ValAsp: 4.397 ± 0.54
4.464ValGlu: 4.464 ± 0.459
1.691ValPhe: 1.691 ± 0.358
5.073ValGly: 5.073 ± 0.633
1.556ValHis: 1.556 ± 0.275
4.464ValIle: 4.464 ± 0.461
2.638ValLys: 2.638 ± 0.395
5.479ValLeu: 5.479 ± 0.56
2.232ValMet: 2.232 ± 0.388
3.044ValAsn: 3.044 ± 0.47
3.45ValPro: 3.45 ± 0.501
2.57ValGln: 2.57 ± 0.379
3.856ValArg: 3.856 ± 0.48
4.194ValSer: 4.194 ± 0.488
4.464ValThr: 4.464 ± 0.635
4.532ValVal: 4.532 ± 0.465
1.962ValTrp: 1.962 ± 0.3
1.962ValTyr: 1.962 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
2.3TrpAla: 2.3 ± 0.371
0.473TrpCys: 0.473 ± 0.183
1.488TrpAsp: 1.488 ± 0.264
1.353TrpGlu: 1.353 ± 0.331
0.676TrpPhe: 0.676 ± 0.182
1.285TrpGly: 1.285 ± 0.337
0.609TrpHis: 0.609 ± 0.184
0.338TrpIle: 0.338 ± 0.142
0.744TrpLys: 0.744 ± 0.21
2.3TrpLeu: 2.3 ± 0.475
0.473TrpMet: 0.473 ± 0.159
1.15TrpAsn: 1.15 ± 0.411
0.744TrpPro: 0.744 ± 0.193
1.082TrpGln: 1.082 ± 0.221
1.082TrpArg: 1.082 ± 0.229
1.082TrpSer: 1.082 ± 0.277
0.541TrpThr: 0.541 ± 0.16
1.962TrpVal: 1.962 ± 0.381
0.406TrpTrp: 0.406 ± 0.161
0.676TrpTyr: 0.676 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 0.302
0.135TyrCys: 0.135 ± 0.086
1.962TyrAsp: 1.962 ± 0.426
1.556TyrGlu: 1.556 ± 0.36
1.015TyrPhe: 1.015 ± 0.258
2.232TyrGly: 2.232 ± 0.401
1.082TyrHis: 1.082 ± 0.279
1.015TyrIle: 1.015 ± 0.257
1.623TyrLys: 1.623 ± 0.335
2.3TyrLeu: 2.3 ± 0.42
1.218TyrMet: 1.218 ± 0.331
0.744TyrAsn: 0.744 ± 0.173
1.15TyrPro: 1.15 ± 0.304
0.812TyrGln: 0.812 ± 0.171
2.367TyrArg: 2.367 ± 0.399
1.556TyrSer: 1.556 ± 0.308
1.623TyrThr: 1.623 ± 0.311
2.165TyrVal: 2.165 ± 0.321
0.541TyrTrp: 0.541 ± 0.207
0.541TyrTyr: 0.541 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (14785 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski