Amino acid dipepetide frequency for Pelagibacter phage HTVC119P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.383AlaAla: 5.383 ± 0.935
0.673AlaCys: 0.673 ± 0.231
4.542AlaAsp: 4.542 ± 0.471
4.542AlaGlu: 4.542 ± 0.607
2.607AlaPhe: 2.607 ± 0.466
5.299AlaGly: 5.299 ± 0.84
0.589AlaHis: 0.589 ± 0.242
4.289AlaIle: 4.289 ± 0.514
7.401AlaLys: 7.401 ± 1.072
6.728AlaLeu: 6.728 ± 0.618
1.766AlaMet: 1.766 ± 0.489
5.887AlaAsn: 5.887 ± 1.436
2.019AlaPro: 2.019 ± 0.516
2.607AlaGln: 2.607 ± 0.528
3.196AlaArg: 3.196 ± 0.545
4.794AlaSer: 4.794 ± 0.793
5.299AlaThr: 5.299 ± 0.896
4.458AlaVal: 4.458 ± 0.89
1.009AlaTrp: 1.009 ± 0.294
3.28AlaTyr: 3.28 ± 0.53
0.0AlaXaa: 0.0 ± 0.0
Cys
0.841CysAla: 0.841 ± 0.302
0.0CysCys: 0.0 ± 0.0
0.336CysAsp: 0.336 ± 0.17
0.589CysGlu: 0.589 ± 0.229
0.505CysPhe: 0.505 ± 0.21
0.589CysGly: 0.589 ± 0.292
0.252CysHis: 0.252 ± 0.148
0.505CysIle: 0.505 ± 0.248
1.009CysLys: 1.009 ± 0.279
1.009CysLeu: 1.009 ± 0.306
0.168CysMet: 0.168 ± 0.114
0.925CysAsn: 0.925 ± 0.305
0.421CysPro: 0.421 ± 0.185
0.336CysGln: 0.336 ± 0.166
0.252CysArg: 0.252 ± 0.174
0.421CysSer: 0.421 ± 0.231
0.589CysThr: 0.589 ± 0.2
0.589CysVal: 0.589 ± 0.209
0.084CysTrp: 0.084 ± 0.082
0.084CysTyr: 0.084 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
5.046AspAla: 5.046 ± 0.573
0.505AspCys: 0.505 ± 0.209
3.448AspAsp: 3.448 ± 0.477
3.785AspGlu: 3.785 ± 0.523
2.775AspPhe: 2.775 ± 0.416
4.205AspGly: 4.205 ± 0.663
0.336AspHis: 0.336 ± 0.144
3.364AspIle: 3.364 ± 0.453
4.71AspLys: 4.71 ± 0.892
6.224AspLeu: 6.224 ± 0.959
1.262AspMet: 1.262 ± 0.381
2.775AspAsn: 2.775 ± 0.361
2.523AspPro: 2.523 ± 0.444
2.187AspGln: 2.187 ± 0.391
1.934AspArg: 1.934 ± 0.508
3.701AspSer: 3.701 ± 0.685
3.869AspThr: 3.869 ± 0.568
3.616AspVal: 3.616 ± 0.722
0.505AspTrp: 0.505 ± 0.183
2.607AspTyr: 2.607 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
4.962GluAla: 4.962 ± 0.597
0.841GluCys: 0.841 ± 0.324
3.196GluAsp: 3.196 ± 0.493
5.299GluGlu: 5.299 ± 0.606
2.523GluPhe: 2.523 ± 0.43
2.523GluGly: 2.523 ± 0.574
1.177GluHis: 1.177 ± 0.36
3.869GluIle: 3.869 ± 0.55
4.794GluLys: 4.794 ± 0.612
6.56GluLeu: 6.56 ± 0.885
1.682GluMet: 1.682 ± 0.377
3.028GluAsn: 3.028 ± 0.537
1.514GluPro: 1.514 ± 0.455
3.112GluGln: 3.112 ± 0.66
3.448GluArg: 3.448 ± 0.532
2.691GluSer: 2.691 ± 0.576
2.86GluThr: 2.86 ± 0.522
3.616GluVal: 3.616 ± 0.546
0.925GluTrp: 0.925 ± 0.235
2.187GluTyr: 2.187 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
1.598PheAla: 1.598 ± 0.3
0.421PheCys: 0.421 ± 0.211
2.775PheAsp: 2.775 ± 0.413
1.346PheGlu: 1.346 ± 0.445
1.85PhePhe: 1.85 ± 0.327
2.355PheGly: 2.355 ± 0.519
0.336PheHis: 0.336 ± 0.201
2.607PheIle: 2.607 ± 0.445
3.364PheLys: 3.364 ± 0.39
3.448PheLeu: 3.448 ± 0.45
0.757PheMet: 0.757 ± 0.242
2.607PheAsn: 2.607 ± 0.376
1.514PhePro: 1.514 ± 0.302
1.43PheGln: 1.43 ± 0.326
1.85PheArg: 1.85 ± 0.317
2.439PheSer: 2.439 ± 0.486
2.775PheThr: 2.775 ± 0.619
1.934PheVal: 1.934 ± 0.428
0.673PheTrp: 0.673 ± 0.188
1.262PheTyr: 1.262 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
3.953GlyAla: 3.953 ± 0.919
0.084GlyCys: 0.084 ± 0.087
3.785GlyAsp: 3.785 ± 0.604
2.439GlyGlu: 2.439 ± 0.485
3.364GlyPhe: 3.364 ± 0.422
4.794GlyGly: 4.794 ± 0.827
1.346GlyHis: 1.346 ± 0.475
4.542GlyIle: 4.542 ± 0.592
4.794GlyLys: 4.794 ± 0.661
5.551GlyLeu: 5.551 ± 0.755
0.925GlyMet: 0.925 ± 0.243
4.205GlyAsn: 4.205 ± 0.582
1.598GlyPro: 1.598 ± 0.355
2.439GlyGln: 2.439 ± 0.412
3.196GlyArg: 3.196 ± 0.586
4.878GlySer: 4.878 ± 0.884
5.383GlyThr: 5.383 ± 1.363
3.869GlyVal: 3.869 ± 0.479
0.757GlyTrp: 0.757 ± 0.257
2.691GlyTyr: 2.691 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
1.009HisAla: 1.009 ± 0.313
0.421HisCys: 0.421 ± 0.231
0.336HisAsp: 0.336 ± 0.141
1.262HisGlu: 1.262 ± 0.417
0.757HisPhe: 0.757 ± 0.224
0.841HisGly: 0.841 ± 0.203
0.252HisHis: 0.252 ± 0.135
0.841HisIle: 0.841 ± 0.237
1.598HisLys: 1.598 ± 0.466
1.682HisLeu: 1.682 ± 0.51
0.421HisMet: 0.421 ± 0.198
0.757HisAsn: 0.757 ± 0.248
0.336HisPro: 0.336 ± 0.191
0.336HisGln: 0.336 ± 0.196
0.757HisArg: 0.757 ± 0.229
1.009HisSer: 1.009 ± 0.292
1.093HisThr: 1.093 ± 0.277
0.841HisVal: 0.841 ± 0.25
0.336HisTrp: 0.336 ± 0.165
0.505HisTyr: 0.505 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.214IleAla: 5.214 ± 0.614
0.252IleCys: 0.252 ± 0.146
5.467IleAsp: 5.467 ± 0.664
5.046IleGlu: 5.046 ± 0.699
1.682IlePhe: 1.682 ± 0.353
5.214IleGly: 5.214 ± 0.619
1.346IleHis: 1.346 ± 0.377
4.373IleIle: 4.373 ± 0.735
5.467IleLys: 5.467 ± 0.685
5.214IleLeu: 5.214 ± 0.766
1.43IleMet: 1.43 ± 0.37
4.878IleAsn: 4.878 ± 0.772
2.523IlePro: 2.523 ± 0.493
2.523IleGln: 2.523 ± 0.436
2.944IleArg: 2.944 ± 0.486
4.542IleSer: 4.542 ± 0.719
4.794IleThr: 4.794 ± 0.655
3.448IleVal: 3.448 ± 0.455
0.925IleTrp: 0.925 ± 0.254
1.934IleTyr: 1.934 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
7.653LysAla: 7.653 ± 1.37
0.421LysCys: 0.421 ± 0.181
5.13LysAsp: 5.13 ± 0.677
6.392LysGlu: 6.392 ± 0.91
2.439LysPhe: 2.439 ± 0.422
4.037LysGly: 4.037 ± 0.555
1.346LysHis: 1.346 ± 0.329
5.803LysIle: 5.803 ± 0.748
8.326LysLys: 8.326 ± 1.315
8.242LysLeu: 8.242 ± 0.942
2.103LysMet: 2.103 ± 0.415
4.289LysAsn: 4.289 ± 0.727
2.607LysPro: 2.607 ± 0.493
3.785LysGln: 3.785 ± 0.462
4.205LysArg: 4.205 ± 0.717
5.383LysSer: 5.383 ± 0.705
5.299LysThr: 5.299 ± 0.691
5.214LysVal: 5.214 ± 0.559
1.009LysTrp: 1.009 ± 0.391
3.364LysTyr: 3.364 ± 0.575
0.0LysXaa: 0.0 ± 0.0
Leu
6.812LeuAla: 6.812 ± 0.58
1.093LeuCys: 1.093 ± 0.402
5.803LeuAsp: 5.803 ± 0.763
5.719LeuGlu: 5.719 ± 0.599
1.682LeuPhe: 1.682 ± 0.456
4.878LeuGly: 4.878 ± 0.672
1.009LeuHis: 1.009 ± 0.2
4.542LeuIle: 4.542 ± 0.562
8.999LeuLys: 8.999 ± 1.05
6.224LeuLeu: 6.224 ± 1.154
2.019LeuMet: 2.019 ± 0.422
5.214LeuAsn: 5.214 ± 0.753
3.028LeuPro: 3.028 ± 0.598
4.121LeuGln: 4.121 ± 0.499
3.196LeuArg: 3.196 ± 0.633
6.981LeuSer: 6.981 ± 0.884
4.205LeuThr: 4.205 ± 0.445
4.373LeuVal: 4.373 ± 0.38
0.589LeuTrp: 0.589 ± 0.237
2.944LeuTyr: 2.944 ± 0.486
0.0LeuXaa: 0.0 ± 0.0
Met
2.271MetAla: 2.271 ± 0.435
0.589MetCys: 0.589 ± 0.269
1.093MetAsp: 1.093 ± 0.389
1.093MetGlu: 1.093 ± 0.313
0.925MetPhe: 0.925 ± 0.312
1.009MetGly: 1.009 ± 0.231
0.336MetHis: 0.336 ± 0.187
1.346MetIle: 1.346 ± 0.359
1.766MetLys: 1.766 ± 0.412
1.934MetLeu: 1.934 ± 0.395
0.252MetMet: 0.252 ± 0.143
1.766MetAsn: 1.766 ± 0.347
1.43MetPro: 1.43 ± 0.291
0.589MetGln: 0.589 ± 0.177
0.673MetArg: 0.673 ± 0.244
1.514MetSer: 1.514 ± 0.387
1.346MetThr: 1.346 ± 0.348
1.262MetVal: 1.262 ± 0.367
0.252MetTrp: 0.252 ± 0.135
0.505MetTyr: 0.505 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
4.458AsnAla: 4.458 ± 1.026
0.673AsnCys: 0.673 ± 0.267
2.691AsnAsp: 2.691 ± 0.49
3.869AsnGlu: 3.869 ± 0.507
3.364AsnPhe: 3.364 ± 0.552
5.13AsnGly: 5.13 ± 0.719
0.925AsnHis: 0.925 ± 0.264
6.224AsnIle: 6.224 ± 0.901
5.299AsnLys: 5.299 ± 0.766
4.794AsnLeu: 4.794 ± 0.657
1.177AsnMet: 1.177 ± 0.279
4.542AsnAsn: 4.542 ± 0.815
2.019AsnPro: 2.019 ± 0.485
1.598AsnGln: 1.598 ± 0.365
2.271AsnArg: 2.271 ± 0.394
3.448AsnSer: 3.448 ± 0.671
4.458AsnThr: 4.458 ± 0.811
4.121AsnVal: 4.121 ± 0.796
0.589AsnTrp: 0.589 ± 0.198
2.187AsnTyr: 2.187 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
2.523ProAla: 2.523 ± 0.519
0.505ProCys: 0.505 ± 0.188
1.43ProAsp: 1.43 ± 0.292
2.103ProGlu: 2.103 ± 0.4
1.093ProPhe: 1.093 ± 0.273
1.682ProGly: 1.682 ± 0.386
0.505ProHis: 0.505 ± 0.19
1.177ProIle: 1.177 ± 0.349
3.532ProLys: 3.532 ± 0.499
2.019ProLeu: 2.019 ± 0.502
1.093ProMet: 1.093 ± 0.3
2.691ProAsn: 2.691 ± 0.364
1.346ProPro: 1.346 ± 0.429
0.841ProGln: 0.841 ± 0.206
1.346ProArg: 1.346 ± 0.375
2.271ProSer: 2.271 ± 0.347
2.523ProThr: 2.523 ± 0.494
2.355ProVal: 2.355 ± 0.439
0.505ProTrp: 0.505 ± 0.204
1.177ProTyr: 1.177 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
3.869GlnAla: 3.869 ± 0.643
0.421GlnCys: 0.421 ± 0.177
2.355GlnAsp: 2.355 ± 0.475
2.187GlnGlu: 2.187 ± 0.507
1.682GlnPhe: 1.682 ± 0.369
2.187GlnGly: 2.187 ± 0.474
1.43GlnHis: 1.43 ± 0.349
3.196GlnIle: 3.196 ± 0.625
3.028GlnLys: 3.028 ± 0.473
3.028GlnLeu: 3.028 ± 0.767
0.925GlnMet: 0.925 ± 0.238
2.355GlnAsn: 2.355 ± 0.536
0.673GlnPro: 0.673 ± 0.229
1.514GlnGln: 1.514 ± 0.452
0.925GlnArg: 0.925 ± 0.325
2.607GlnSer: 2.607 ± 0.439
2.944GlnThr: 2.944 ± 0.555
1.85GlnVal: 1.85 ± 0.335
0.336GlnTrp: 0.336 ± 0.14
1.514GlnTyr: 1.514 ± 0.365
0.0GlnXaa: 0.0 ± 0.0
Arg
1.934ArgAla: 1.934 ± 0.395
0.0ArgCys: 0.0 ± 0.0
2.944ArgAsp: 2.944 ± 0.505
2.607ArgGlu: 2.607 ± 0.497
2.523ArgPhe: 2.523 ± 0.456
1.766ArgGly: 1.766 ± 0.393
0.757ArgHis: 0.757 ± 0.257
2.944ArgIle: 2.944 ± 0.601
4.205ArgLys: 4.205 ± 0.941
3.364ArgLeu: 3.364 ± 0.472
1.177ArgMet: 1.177 ± 0.329
2.187ArgAsn: 2.187 ± 0.416
1.262ArgPro: 1.262 ± 0.226
1.177ArgGln: 1.177 ± 0.283
1.682ArgArg: 1.682 ± 0.386
3.112ArgSer: 3.112 ± 0.497
2.523ArgThr: 2.523 ± 0.39
1.85ArgVal: 1.85 ± 0.387
0.505ArgTrp: 0.505 ± 0.191
1.934ArgTyr: 1.934 ± 0.667
0.0ArgXaa: 0.0 ± 0.0
Ser
5.383SerAla: 5.383 ± 1.072
0.925SerCys: 0.925 ± 0.304
3.616SerAsp: 3.616 ± 0.612
3.196SerGlu: 3.196 ± 0.528
2.355SerPhe: 2.355 ± 0.488
6.392SerGly: 6.392 ± 1.215
1.262SerHis: 1.262 ± 0.323
5.214SerIle: 5.214 ± 0.556
5.719SerLys: 5.719 ± 0.794
4.878SerLeu: 4.878 ± 0.834
1.43SerMet: 1.43 ± 0.351
4.205SerAsn: 4.205 ± 0.545
1.598SerPro: 1.598 ± 0.42
2.944SerGln: 2.944 ± 0.389
2.691SerArg: 2.691 ± 0.51
5.046SerSer: 5.046 ± 1.132
4.037SerThr: 4.037 ± 1.015
4.205SerVal: 4.205 ± 0.653
0.841SerTrp: 0.841 ± 0.242
1.934SerTyr: 1.934 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
5.551ThrAla: 5.551 ± 1.187
0.589ThrCys: 0.589 ± 0.23
4.373ThrAsp: 4.373 ± 0.564
3.364ThrGlu: 3.364 ± 0.519
2.355ThrPhe: 2.355 ± 0.446
4.458ThrGly: 4.458 ± 0.792
0.757ThrHis: 0.757 ± 0.244
6.056ThrIle: 6.056 ± 0.735
4.542ThrLys: 4.542 ± 0.617
4.373ThrLeu: 4.373 ± 0.594
1.177ThrMet: 1.177 ± 0.376
4.458ThrAsn: 4.458 ± 1.03
2.691ThrPro: 2.691 ± 0.474
3.364ThrGln: 3.364 ± 0.567
2.271ThrArg: 2.271 ± 0.405
4.542ThrSer: 4.542 ± 0.9
5.046ThrThr: 5.046 ± 0.929
4.373ThrVal: 4.373 ± 1.07
0.589ThrTrp: 0.589 ± 0.183
1.85ThrTyr: 1.85 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
5.383ValAla: 5.383 ± 0.97
0.673ValCys: 0.673 ± 0.278
3.364ValAsp: 3.364 ± 0.562
3.616ValGlu: 3.616 ± 0.715
1.262ValPhe: 1.262 ± 0.302
3.869ValGly: 3.869 ± 0.708
1.093ValHis: 1.093 ± 0.345
4.121ValIle: 4.121 ± 0.51
4.71ValLys: 4.71 ± 0.605
3.532ValLeu: 3.532 ± 0.445
1.346ValMet: 1.346 ± 0.308
4.121ValAsn: 4.121 ± 1.186
2.103ValPro: 2.103 ± 0.357
2.187ValGln: 2.187 ± 0.423
2.019ValArg: 2.019 ± 0.502
5.299ValSer: 5.299 ± 0.609
4.289ValThr: 4.289 ± 0.6
2.523ValVal: 2.523 ± 0.451
0.336ValTrp: 0.336 ± 0.141
1.934ValTyr: 1.934 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
0.505TrpAla: 0.505 ± 0.242
0.084TrpCys: 0.084 ± 0.083
0.757TrpAsp: 0.757 ± 0.248
0.336TrpGlu: 0.336 ± 0.147
0.336TrpPhe: 0.336 ± 0.154
0.673TrpGly: 0.673 ± 0.285
0.084TrpHis: 0.084 ± 0.063
1.009TrpIle: 1.009 ± 0.305
1.177TrpLys: 1.177 ± 0.364
1.514TrpLeu: 1.514 ± 0.489
0.168TrpMet: 0.168 ± 0.127
0.336TrpAsn: 0.336 ± 0.151
0.421TrpPro: 0.421 ± 0.156
0.589TrpGln: 0.589 ± 0.195
0.589TrpArg: 0.589 ± 0.216
0.673TrpSer: 0.673 ± 0.251
1.093TrpThr: 1.093 ± 0.281
0.505TrpVal: 0.505 ± 0.169
0.084TrpTrp: 0.084 ± 0.083
0.252TrpTyr: 0.252 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.019TyrAla: 2.019 ± 0.365
0.336TyrCys: 0.336 ± 0.198
1.934TyrAsp: 1.934 ± 0.337
2.019TyrGlu: 2.019 ± 0.349
1.262TyrPhe: 1.262 ± 0.381
2.691TyrGly: 2.691 ± 0.441
0.252TyrHis: 0.252 ± 0.155
3.196TyrIle: 3.196 ± 0.449
2.691TyrLys: 2.691 ± 0.453
3.112TyrLeu: 3.112 ± 0.54
0.589TyrMet: 0.589 ± 0.241
2.607TyrAsn: 2.607 ± 0.383
1.093TyrPro: 1.093 ± 0.262
1.346TyrGln: 1.346 ± 0.429
1.009TyrArg: 1.009 ± 0.323
2.523TyrSer: 2.523 ± 0.451
2.355TyrThr: 2.355 ± 0.512
2.775TyrVal: 2.775 ± 0.528
0.336TyrTrp: 0.336 ± 0.161
1.177TyrTyr: 1.177 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski