Amino acid dipepetide frequency for Clostridium virus phiCD27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.753AlaAla: 1.753 ± 0.409
0.714AlaCys: 0.714 ± 0.232
2.663AlaAsp: 2.663 ± 0.346
4.611AlaGlu: 4.611 ± 0.446
1.753AlaPhe: 1.753 ± 0.426
3.182AlaGly: 3.182 ± 0.718
0.52AlaHis: 0.52 ± 0.189
4.221AlaIle: 4.221 ± 0.667
5.131AlaLys: 5.131 ± 0.528
6.17AlaLeu: 6.17 ± 0.67
1.559AlaMet: 1.559 ± 0.404
2.533AlaAsn: 2.533 ± 0.334
1.039AlaPro: 1.039 ± 0.265
1.364AlaGln: 1.364 ± 0.271
2.013AlaArg: 2.013 ± 0.356
3.832AlaSer: 3.832 ± 0.718
4.091AlaThr: 4.091 ± 0.557
2.858AlaVal: 2.858 ± 0.592
0.649AlaTrp: 0.649 ± 0.192
1.883AlaTyr: 1.883 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.584CysAla: 0.584 ± 0.217
0.26CysCys: 0.26 ± 0.145
0.844CysAsp: 0.844 ± 0.296
1.104CysGlu: 1.104 ± 0.314
0.26CysPhe: 0.26 ± 0.136
0.714CysGly: 0.714 ± 0.293
0.195CysHis: 0.195 ± 0.113
1.299CysIle: 1.299 ± 0.312
1.234CysLys: 1.234 ± 0.338
0.649CysLeu: 0.649 ± 0.216
0.52CysMet: 0.52 ± 0.192
0.455CysAsn: 0.455 ± 0.175
0.13CysPro: 0.13 ± 0.089
0.325CysGln: 0.325 ± 0.145
0.39CysArg: 0.39 ± 0.152
0.455CysSer: 0.455 ± 0.21
0.26CysThr: 0.26 ± 0.122
0.649CysVal: 0.649 ± 0.195
0.26CysTrp: 0.26 ± 0.121
0.714CysTyr: 0.714 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
2.922AspAla: 2.922 ± 0.456
0.909AspCys: 0.909 ± 0.279
3.767AspAsp: 3.767 ± 0.642
5.325AspGlu: 5.325 ± 0.705
2.663AspPhe: 2.663 ± 0.379
3.312AspGly: 3.312 ± 0.551
0.065AspHis: 0.065 ± 0.061
6.105AspIle: 6.105 ± 0.684
6.3AspLys: 6.3 ± 0.775
3.897AspLeu: 3.897 ± 0.453
1.299AspMet: 1.299 ± 0.278
3.702AspAsn: 3.702 ± 0.457
0.714AspPro: 0.714 ± 0.254
0.39AspGln: 0.39 ± 0.164
2.078AspArg: 2.078 ± 0.328
3.637AspSer: 3.637 ± 0.485
2.728AspThr: 2.728 ± 0.338
3.312AspVal: 3.312 ± 0.396
0.52AspTrp: 0.52 ± 0.169
2.013AspTyr: 2.013 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
5.066GluAla: 5.066 ± 0.657
0.974GluCys: 0.974 ± 0.251
4.351GluAsp: 4.351 ± 0.635
7.728GluGlu: 7.728 ± 0.887
3.377GluPhe: 3.377 ± 0.525
4.156GluGly: 4.156 ± 0.51
0.39GluHis: 0.39 ± 0.15
7.533GluIle: 7.533 ± 0.761
9.027GluLys: 9.027 ± 1.108
9.027GluLeu: 9.027 ± 0.725
3.247GluMet: 3.247 ± 0.663
6.364GluAsn: 6.364 ± 0.692
1.494GluPro: 1.494 ± 0.386
3.312GluGln: 3.312 ± 0.551
2.858GluArg: 2.858 ± 0.43
3.507GluSer: 3.507 ± 0.47
5.066GluThr: 5.066 ± 0.612
4.676GluVal: 4.676 ± 0.536
0.714GluTrp: 0.714 ± 0.25
4.351GluTyr: 4.351 ± 0.725
0.0GluXaa: 0.0 ± 0.0
Phe
1.689PheAla: 1.689 ± 0.311
0.26PheCys: 0.26 ± 0.127
2.208PheAsp: 2.208 ± 0.381
2.987PheGlu: 2.987 ± 0.468
1.429PhePhe: 1.429 ± 0.284
2.013PheGly: 2.013 ± 0.395
0.39PheHis: 0.39 ± 0.15
3.507PheIle: 3.507 ± 0.461
3.962PheLys: 3.962 ± 0.472
3.507PheLeu: 3.507 ± 0.46
0.909PheMet: 0.909 ± 0.2
4.026PheAsn: 4.026 ± 0.521
1.039PhePro: 1.039 ± 0.315
0.844PheGln: 0.844 ± 0.173
1.689PheArg: 1.689 ± 0.349
2.922PheSer: 2.922 ± 0.475
2.143PheThr: 2.143 ± 0.335
2.728PheVal: 2.728 ± 0.61
0.26PheTrp: 0.26 ± 0.141
1.299PheTyr: 1.299 ± 0.331
0.0PheXaa: 0.0 ± 0.0
Gly
2.922GlyAla: 2.922 ± 0.604
1.039GlyCys: 1.039 ± 0.259
2.468GlyAsp: 2.468 ± 0.369
6.04GlyGlu: 6.04 ± 0.649
3.507GlyPhe: 3.507 ± 0.627
4.741GlyGly: 4.741 ± 1.58
1.104GlyHis: 1.104 ± 0.326
4.741GlyIle: 4.741 ± 0.717
4.676GlyLys: 4.676 ± 0.508
3.897GlyLeu: 3.897 ± 0.652
1.883GlyMet: 1.883 ± 0.31
3.052GlyAsn: 3.052 ± 0.394
0.325GlyPro: 0.325 ± 0.146
1.559GlyGln: 1.559 ± 0.337
1.818GlyArg: 1.818 ± 0.287
3.377GlySer: 3.377 ± 0.565
3.247GlyThr: 3.247 ± 0.747
4.221GlyVal: 4.221 ± 0.689
0.844GlyTrp: 0.844 ± 0.193
2.728GlyTyr: 2.728 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
0.39HisAla: 0.39 ± 0.184
0.325HisCys: 0.325 ± 0.111
0.26HisAsp: 0.26 ± 0.133
0.974HisGlu: 0.974 ± 0.307
0.52HisPhe: 0.52 ± 0.221
0.649HisGly: 0.649 ± 0.196
0.13HisHis: 0.13 ± 0.092
0.974HisIle: 0.974 ± 0.324
0.974HisLys: 0.974 ± 0.299
0.909HisLeu: 0.909 ± 0.268
0.325HisMet: 0.325 ± 0.165
0.844HisAsn: 0.844 ± 0.277
0.649HisPro: 0.649 ± 0.216
0.325HisGln: 0.325 ± 0.139
0.13HisArg: 0.13 ± 0.099
0.455HisSer: 0.455 ± 0.183
0.649HisThr: 0.649 ± 0.234
0.26HisVal: 0.26 ± 0.105
0.13HisTrp: 0.13 ± 0.089
0.52HisTyr: 0.52 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
5.066IleAla: 5.066 ± 0.59
1.169IleCys: 1.169 ± 0.335
6.494IleAsp: 6.494 ± 0.841
7.598IleGlu: 7.598 ± 0.716
2.987IlePhe: 2.987 ± 0.453
5.26IleGly: 5.26 ± 0.67
0.909IleHis: 0.909 ± 0.252
6.17IleIle: 6.17 ± 0.748
10.001IleLys: 10.001 ± 1.14
6.364IleLeu: 6.364 ± 0.719
2.208IleMet: 2.208 ± 0.487
6.235IleAsn: 6.235 ± 0.543
2.468IlePro: 2.468 ± 0.381
2.338IleGln: 2.338 ± 0.423
3.572IleArg: 3.572 ± 0.562
6.04IleSer: 6.04 ± 0.971
4.806IleThr: 4.806 ± 0.534
4.871IleVal: 4.871 ± 0.536
0.649IleTrp: 0.649 ± 0.206
3.832IleTyr: 3.832 ± 0.603
0.0IleXaa: 0.0 ± 0.0
Lys
6.04LysAla: 6.04 ± 0.795
0.844LysCys: 0.844 ± 0.276
6.235LysAsp: 6.235 ± 0.585
10.001LysGlu: 10.001 ± 0.646
3.247LysPhe: 3.247 ± 0.333
5.715LysGly: 5.715 ± 0.545
0.909LysHis: 0.909 ± 0.346
9.092LysIle: 9.092 ± 0.738
10.911LysLys: 10.911 ± 1.648
7.663LysLeu: 7.663 ± 0.779
2.533LysMet: 2.533 ± 0.519
7.858LysAsn: 7.858 ± 0.745
2.078LysPro: 2.078 ± 0.423
4.026LysGln: 4.026 ± 0.459
4.611LysArg: 4.611 ± 0.559
5.845LysSer: 5.845 ± 0.655
5.52LysThr: 5.52 ± 0.593
6.494LysVal: 6.494 ± 0.468
1.039LysTrp: 1.039 ± 0.239
4.351LysTyr: 4.351 ± 0.596
0.0LysXaa: 0.0 ± 0.0
Leu
4.351LeuAla: 4.351 ± 0.578
0.714LeuCys: 0.714 ± 0.226
5.195LeuAsp: 5.195 ± 0.669
7.728LeuGlu: 7.728 ± 0.753
2.922LeuPhe: 2.922 ± 0.438
5.325LeuGly: 5.325 ± 0.681
1.299LeuHis: 1.299 ± 0.351
6.105LeuIle: 6.105 ± 0.65
10.651LeuLys: 10.651 ± 0.991
6.689LeuLeu: 6.689 ± 0.889
1.559LeuMet: 1.559 ± 0.294
6.494LeuAsn: 6.494 ± 0.73
1.753LeuPro: 1.753 ± 0.397
3.117LeuGln: 3.117 ± 0.462
2.987LeuArg: 2.987 ± 0.458
5.585LeuSer: 5.585 ± 0.627
4.806LeuThr: 4.806 ± 0.567
4.026LeuVal: 4.026 ± 0.541
0.649LeuTrp: 0.649 ± 0.193
2.922LeuTyr: 2.922 ± 0.542
0.0LeuXaa: 0.0 ± 0.0
Met
1.559MetAla: 1.559 ± 0.49
0.195MetCys: 0.195 ± 0.114
1.624MetAsp: 1.624 ± 0.334
1.753MetGlu: 1.753 ± 0.325
0.52MetPhe: 0.52 ± 0.157
0.974MetGly: 0.974 ± 0.275
0.26MetHis: 0.26 ± 0.138
1.559MetIle: 1.559 ± 0.35
2.273MetLys: 2.273 ± 0.268
2.728MetLeu: 2.728 ± 0.404
0.325MetMet: 0.325 ± 0.161
2.013MetAsn: 2.013 ± 0.382
0.714MetPro: 0.714 ± 0.211
0.584MetGln: 0.584 ± 0.179
0.39MetArg: 0.39 ± 0.135
1.299MetSer: 1.299 ± 0.319
0.974MetThr: 0.974 ± 0.215
1.429MetVal: 1.429 ± 0.28
0.325MetTrp: 0.325 ± 0.119
1.234MetTyr: 1.234 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
4.026AsnAla: 4.026 ± 0.58
0.844AsnCys: 0.844 ± 0.243
3.182AsnAsp: 3.182 ± 0.538
5.65AsnGlu: 5.65 ± 0.816
3.182AsnPhe: 3.182 ± 0.547
4.416AsnGly: 4.416 ± 0.593
0.26AsnHis: 0.26 ± 0.114
7.339AsnIle: 7.339 ± 0.629
8.053AsnLys: 8.053 ± 0.837
5.585AsnLeu: 5.585 ± 0.593
1.234AsnMet: 1.234 ± 0.271
5.001AsnAsn: 5.001 ± 0.681
1.429AsnPro: 1.429 ± 0.314
1.299AsnGln: 1.299 ± 0.319
3.247AsnArg: 3.247 ± 0.499
4.156AsnSer: 4.156 ± 0.506
3.572AsnThr: 3.572 ± 0.545
4.221AsnVal: 4.221 ± 0.458
0.584AsnTrp: 0.584 ± 0.214
2.338AsnTyr: 2.338 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
0.909ProAla: 0.909 ± 0.268
0.26ProCys: 0.26 ± 0.122
0.649ProAsp: 0.649 ± 0.224
1.494ProGlu: 1.494 ± 0.347
0.779ProPhe: 0.779 ± 0.191
0.844ProGly: 0.844 ± 0.204
0.325ProHis: 0.325 ± 0.151
2.793ProIle: 2.793 ± 0.441
1.948ProLys: 1.948 ± 0.357
1.818ProLeu: 1.818 ± 0.317
0.065ProMet: 0.065 ± 0.06
1.818ProAsn: 1.818 ± 0.37
0.325ProPro: 0.325 ± 0.126
0.974ProGln: 0.974 ± 0.223
0.779ProArg: 0.779 ± 0.262
0.779ProSer: 0.779 ± 0.182
2.013ProThr: 2.013 ± 0.297
1.429ProVal: 1.429 ± 0.258
0.13ProTrp: 0.13 ± 0.085
0.52ProTyr: 0.52 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
1.883GlnAla: 1.883 ± 0.441
0.195GlnCys: 0.195 ± 0.105
1.689GlnAsp: 1.689 ± 0.37
3.182GlnGlu: 3.182 ± 0.514
0.974GlnPhe: 0.974 ± 0.207
1.494GlnGly: 1.494 ± 0.265
0.26GlnHis: 0.26 ± 0.129
2.598GlnIle: 2.598 ± 0.397
2.533GlnLys: 2.533 ± 0.522
2.922GlnLeu: 2.922 ± 0.357
0.584GlnMet: 0.584 ± 0.204
2.468GlnAsn: 2.468 ± 0.372
0.779GlnPro: 0.779 ± 0.215
0.844GlnGln: 0.844 ± 0.271
0.974GlnArg: 0.974 ± 0.231
1.753GlnSer: 1.753 ± 0.327
1.948GlnThr: 1.948 ± 0.271
1.364GlnVal: 1.364 ± 0.297
0.26GlnTrp: 0.26 ± 0.114
1.104GlnTyr: 1.104 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
1.494ArgAla: 1.494 ± 0.318
0.714ArgCys: 0.714 ± 0.236
1.299ArgAsp: 1.299 ± 0.297
3.962ArgGlu: 3.962 ± 0.515
1.429ArgPhe: 1.429 ± 0.362
2.078ArgGly: 2.078 ± 0.367
0.39ArgHis: 0.39 ± 0.165
4.221ArgIle: 4.221 ± 0.527
3.247ArgLys: 3.247 ± 0.438
3.312ArgLeu: 3.312 ± 0.512
0.974ArgMet: 0.974 ± 0.269
1.429ArgAsn: 1.429 ± 0.311
0.909ArgPro: 0.909 ± 0.298
0.844ArgGln: 0.844 ± 0.199
1.169ArgArg: 1.169 ± 0.41
2.208ArgSer: 2.208 ± 0.422
2.078ArgThr: 2.078 ± 0.407
2.403ArgVal: 2.403 ± 0.364
0.39ArgTrp: 0.39 ± 0.14
1.429ArgTyr: 1.429 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
3.442SerAla: 3.442 ± 0.537
0.26SerCys: 0.26 ± 0.121
3.442SerAsp: 3.442 ± 0.675
4.481SerGlu: 4.481 ± 0.457
3.182SerPhe: 3.182 ± 0.475
3.442SerGly: 3.442 ± 0.474
0.909SerHis: 0.909 ± 0.283
5.325SerIle: 5.325 ± 0.602
6.235SerLys: 6.235 ± 0.579
5.066SerLeu: 5.066 ± 0.689
1.494SerMet: 1.494 ± 0.306
5.325SerAsn: 5.325 ± 0.62
0.844SerPro: 0.844 ± 0.2
2.078SerGln: 2.078 ± 0.37
1.818SerArg: 1.818 ± 0.31
4.871SerSer: 4.871 ± 0.655
3.442SerThr: 3.442 ± 0.496
2.793SerVal: 2.793 ± 0.46
0.325SerTrp: 0.325 ± 0.137
2.793SerTyr: 2.793 ± 0.455
0.0SerXaa: 0.0 ± 0.0
Thr
3.247ThrAla: 3.247 ± 0.602
0.325ThrCys: 0.325 ± 0.145
2.533ThrAsp: 2.533 ± 0.4
3.832ThrGlu: 3.832 ± 0.535
2.663ThrPhe: 2.663 ± 0.419
3.507ThrGly: 3.507 ± 0.536
0.909ThrHis: 0.909 ± 0.237
5.26ThrIle: 5.26 ± 0.664
6.235ThrLys: 6.235 ± 0.582
5.39ThrLeu: 5.39 ± 0.574
0.584ThrMet: 0.584 ± 0.209
2.922ThrAsn: 2.922 ± 0.374
1.948ThrPro: 1.948 ± 0.424
2.208ThrGln: 2.208 ± 0.321
1.883ThrArg: 1.883 ± 0.321
4.221ThrSer: 4.221 ± 0.646
3.702ThrThr: 3.702 ± 0.594
3.767ThrVal: 3.767 ± 0.462
0.39ThrTrp: 0.39 ± 0.143
2.143ThrTyr: 2.143 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
3.247ValAla: 3.247 ± 0.509
0.39ValCys: 0.39 ± 0.155
3.897ValAsp: 3.897 ± 0.567
4.416ValGlu: 4.416 ± 0.554
2.533ValPhe: 2.533 ± 0.423
4.026ValGly: 4.026 ± 0.596
0.39ValHis: 0.39 ± 0.154
5.78ValIle: 5.78 ± 0.655
5.585ValLys: 5.585 ± 0.684
4.871ValLeu: 4.871 ± 0.495
0.584ValMet: 0.584 ± 0.161
4.091ValAsn: 4.091 ± 0.573
1.039ValPro: 1.039 ± 0.214
1.689ValGln: 1.689 ± 0.347
1.624ValArg: 1.624 ± 0.282
3.247ValSer: 3.247 ± 0.449
3.897ValThr: 3.897 ± 0.485
3.962ValVal: 3.962 ± 0.705
0.584ValTrp: 0.584 ± 0.221
2.208ValTyr: 2.208 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.584TrpAla: 0.584 ± 0.212
0.195TrpCys: 0.195 ± 0.108
0.584TrpAsp: 0.584 ± 0.168
0.714TrpGlu: 0.714 ± 0.192
0.325TrpPhe: 0.325 ± 0.135
0.584TrpGly: 0.584 ± 0.16
0.0TrpHis: 0.0 ± 0.0
0.649TrpIle: 0.649 ± 0.228
0.714TrpLys: 0.714 ± 0.186
0.974TrpLeu: 0.974 ± 0.245
0.26TrpMet: 0.26 ± 0.121
0.714TrpAsn: 0.714 ± 0.219
0.0TrpPro: 0.0 ± 0.0
0.325TrpGln: 0.325 ± 0.138
0.52TrpArg: 0.52 ± 0.168
0.39TrpSer: 0.39 ± 0.131
0.52TrpThr: 0.52 ± 0.144
0.52TrpVal: 0.52 ± 0.178
0.195TrpTrp: 0.195 ± 0.117
0.455TrpTyr: 0.455 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.364TyrAla: 1.364 ± 0.382
0.714TyrCys: 0.714 ± 0.22
2.273TyrAsp: 2.273 ± 0.47
3.637TyrGlu: 3.637 ± 0.525
1.559TyrPhe: 1.559 ± 0.295
1.753TyrGly: 1.753 ± 0.37
0.779TyrHis: 0.779 ± 0.194
3.832TyrIle: 3.832 ± 0.606
5.325TyrLys: 5.325 ± 0.75
3.442TyrLeu: 3.442 ± 0.481
0.52TyrMet: 0.52 ± 0.199
2.273TyrAsn: 2.273 ± 0.399
0.974TyrPro: 0.974 ± 0.277
1.364TyrGln: 1.364 ± 0.317
1.429TyrArg: 1.429 ± 0.294
2.987TyrSer: 2.987 ± 0.495
2.208TyrThr: 2.208 ± 0.392
2.078TyrVal: 2.078 ± 0.338
0.325TyrTrp: 0.325 ± 0.145
1.818TyrTyr: 1.818 ± 0.393
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15399 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski