Amino acid dipepetide frequency for Psychrobacter phage Psymv2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.624AlaAla: 11.624 ± 1.34
1.171AlaCys: 1.171 ± 0.373
6.758AlaAsp: 6.758 ± 0.787
5.767AlaGlu: 5.767 ± 0.876
2.974AlaPhe: 2.974 ± 0.451
6.217AlaGly: 6.217 ± 0.616
1.532AlaHis: 1.532 ± 0.392
5.767AlaIle: 5.767 ± 0.827
5.857AlaLys: 5.857 ± 0.773
7.749AlaLeu: 7.749 ± 0.952
2.883AlaMet: 2.883 ± 0.422
5.046AlaAsn: 5.046 ± 0.845
2.343AlaPro: 2.343 ± 0.547
4.325AlaGln: 4.325 ± 0.577
4.956AlaArg: 4.956 ± 0.838
7.209AlaSer: 7.209 ± 1.018
6.578AlaThr: 6.578 ± 1.465
5.947AlaVal: 5.947 ± 0.858
1.352AlaTrp: 1.352 ± 0.283
2.253AlaTyr: 2.253 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.631CysAla: 0.631 ± 0.282
0.0CysCys: 0.0 ± 0.0
0.721CysAsp: 0.721 ± 0.262
0.811CysGlu: 0.811 ± 0.254
0.09CysPhe: 0.09 ± 0.079
0.451CysGly: 0.451 ± 0.189
0.451CysHis: 0.451 ± 0.256
0.451CysIle: 0.451 ± 0.206
0.541CysLys: 0.541 ± 0.216
0.541CysLeu: 0.541 ± 0.233
0.451CysMet: 0.451 ± 0.212
0.27CysAsn: 0.27 ± 0.117
0.27CysPro: 0.27 ± 0.158
0.631CysGln: 0.631 ± 0.262
0.721CysArg: 0.721 ± 0.327
0.811CysSer: 0.811 ± 0.29
0.631CysThr: 0.631 ± 0.246
0.36CysVal: 0.36 ± 0.188
0.0CysTrp: 0.0 ± 0.0
0.27CysTyr: 0.27 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
5.677AspAla: 5.677 ± 0.683
0.631AspCys: 0.631 ± 0.239
4.686AspAsp: 4.686 ± 0.936
2.974AspGlu: 2.974 ± 0.457
2.343AspPhe: 2.343 ± 0.58
6.217AspGly: 6.217 ± 0.816
0.631AspHis: 0.631 ± 0.261
4.505AspIle: 4.505 ± 0.579
4.776AspLys: 4.776 ± 0.834
4.595AspLeu: 4.595 ± 0.794
1.982AspMet: 1.982 ± 0.528
3.514AspAsn: 3.514 ± 0.809
1.442AspPro: 1.442 ± 0.378
1.081AspGln: 1.081 ± 0.271
2.974AspArg: 2.974 ± 0.441
4.866AspSer: 4.866 ± 0.586
4.325AspThr: 4.325 ± 1.187
4.505AspVal: 4.505 ± 0.844
1.171AspTrp: 1.171 ± 0.297
1.622AspTyr: 1.622 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
5.587GluAla: 5.587 ± 0.637
0.451GluCys: 0.451 ± 0.215
2.703GluAsp: 2.703 ± 0.548
0.631GluGlu: 0.631 ± 0.26
2.343GluPhe: 2.343 ± 0.468
2.523GluGly: 2.523 ± 0.505
1.982GluHis: 1.982 ± 0.498
4.055GluIle: 4.055 ± 0.748
3.334GluLys: 3.334 ± 0.825
6.578GluLeu: 6.578 ± 1.041
1.802GluMet: 1.802 ± 0.501
3.064GluAsn: 3.064 ± 0.578
1.712GluPro: 1.712 ± 0.328
3.154GluGln: 3.154 ± 0.543
3.604GluArg: 3.604 ± 0.52
2.703GluSer: 2.703 ± 0.488
3.424GluThr: 3.424 ± 0.564
4.415GluVal: 4.415 ± 0.737
0.811GluTrp: 0.811 ± 0.298
2.253GluTyr: 2.253 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
3.514PheAla: 3.514 ± 0.719
0.27PheCys: 0.27 ± 0.184
2.613PheAsp: 2.613 ± 0.35
2.703PheGlu: 2.703 ± 0.461
1.171PhePhe: 1.171 ± 0.437
3.154PheGly: 3.154 ± 0.616
0.27PheHis: 0.27 ± 0.15
1.802PheIle: 1.802 ± 0.31
2.253PheLys: 2.253 ± 0.487
2.072PheLeu: 2.072 ± 0.534
1.171PheMet: 1.171 ± 0.342
2.253PheAsn: 2.253 ± 0.413
0.721PhePro: 0.721 ± 0.219
0.991PheGln: 0.991 ± 0.269
1.261PheArg: 1.261 ± 0.35
2.072PheSer: 2.072 ± 0.448
1.712PheThr: 1.712 ± 0.436
1.892PheVal: 1.892 ± 0.352
0.18PheTrp: 0.18 ± 0.11
1.532PheTyr: 1.532 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
4.686GlyAla: 4.686 ± 0.835
0.36GlyCys: 0.36 ± 0.158
3.784GlyAsp: 3.784 ± 0.533
3.604GlyGlu: 3.604 ± 0.578
3.334GlyPhe: 3.334 ± 0.63
4.595GlyGly: 4.595 ± 0.683
1.712GlyHis: 1.712 ± 0.381
4.055GlyIle: 4.055 ± 0.657
4.235GlyLys: 4.235 ± 0.671
6.668GlyLeu: 6.668 ± 0.62
2.343GlyMet: 2.343 ± 0.453
4.145GlyAsn: 4.145 ± 0.488
0.451GlyPro: 0.451 ± 0.208
1.622GlyGln: 1.622 ± 0.423
3.334GlyArg: 3.334 ± 0.487
3.875GlySer: 3.875 ± 0.551
3.334GlyThr: 3.334 ± 0.506
5.677GlyVal: 5.677 ± 0.64
0.811GlyTrp: 0.811 ± 0.322
2.523GlyTyr: 2.523 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
1.982HisAla: 1.982 ± 0.523
0.36HisCys: 0.36 ± 0.225
1.352HisAsp: 1.352 ± 0.458
1.171HisGlu: 1.171 ± 0.342
0.811HisPhe: 0.811 ± 0.236
1.261HisGly: 1.261 ± 0.31
0.811HisHis: 0.811 ± 0.386
1.081HisIle: 1.081 ± 0.342
1.352HisLys: 1.352 ± 0.445
1.712HisLeu: 1.712 ± 0.404
0.27HisMet: 0.27 ± 0.15
0.901HisAsn: 0.901 ± 0.228
0.811HisPro: 0.811 ± 0.283
0.36HisGln: 0.36 ± 0.199
1.261HisArg: 1.261 ± 0.362
0.541HisSer: 0.541 ± 0.233
1.442HisThr: 1.442 ± 0.365
1.352HisVal: 1.352 ± 0.497
0.27HisTrp: 0.27 ± 0.156
0.721HisTyr: 0.721 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
6.488IleAla: 6.488 ± 1.246
0.721IleCys: 0.721 ± 0.241
7.299IleAsp: 7.299 ± 0.717
4.415IleGlu: 4.415 ± 0.806
1.261IlePhe: 1.261 ± 0.34
3.965IleGly: 3.965 ± 0.791
0.721IleHis: 0.721 ± 0.279
3.784IleIle: 3.784 ± 0.512
4.145IleLys: 4.145 ± 0.611
2.793IleLeu: 2.793 ± 0.612
1.352IleMet: 1.352 ± 0.387
4.145IleAsn: 4.145 ± 0.562
2.703IlePro: 2.703 ± 0.535
2.433IleGln: 2.433 ± 0.519
1.892IleArg: 1.892 ± 0.366
4.145IleSer: 4.145 ± 0.815
5.046IleThr: 5.046 ± 1.313
3.154IleVal: 3.154 ± 0.538
0.811IleTrp: 0.811 ± 0.239
1.982IleTyr: 1.982 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
6.578LysAla: 6.578 ± 0.826
0.36LysCys: 0.36 ± 0.187
3.694LysAsp: 3.694 ± 0.503
4.505LysGlu: 4.505 ± 0.844
1.622LysPhe: 1.622 ± 0.378
3.875LysGly: 3.875 ± 0.629
0.541LysHis: 0.541 ± 0.255
3.514LysIle: 3.514 ± 0.635
3.514LysLys: 3.514 ± 0.688
5.136LysLeu: 5.136 ± 0.743
1.352LysMet: 1.352 ± 0.434
3.154LysAsn: 3.154 ± 0.677
2.253LysPro: 2.253 ± 0.396
2.703LysGln: 2.703 ± 0.553
3.154LysArg: 3.154 ± 0.599
4.415LysSer: 4.415 ± 0.528
4.866LysThr: 4.866 ± 0.537
2.974LysVal: 2.974 ± 0.615
1.081LysTrp: 1.081 ± 0.337
1.982LysTyr: 1.982 ± 0.577
0.0LysXaa: 0.0 ± 0.0
Leu
6.938LeuAla: 6.938 ± 0.876
0.901LeuCys: 0.901 ± 0.352
5.226LeuAsp: 5.226 ± 0.582
5.136LeuGlu: 5.136 ± 0.689
2.793LeuPhe: 2.793 ± 0.541
5.316LeuGly: 5.316 ± 0.657
0.991LeuHis: 0.991 ± 0.321
5.587LeuIle: 5.587 ± 0.824
4.866LeuLys: 4.866 ± 0.757
6.127LeuLeu: 6.127 ± 1.102
2.433LeuMet: 2.433 ± 0.523
4.866LeuAsn: 4.866 ± 0.723
3.514LeuPro: 3.514 ± 0.505
4.235LeuGln: 4.235 ± 0.499
3.875LeuArg: 3.875 ± 0.863
5.587LeuSer: 5.587 ± 0.763
5.136LeuThr: 5.136 ± 0.831
4.145LeuVal: 4.145 ± 0.689
0.721LeuTrp: 0.721 ± 0.238
2.072LeuTyr: 2.072 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.793MetAla: 2.793 ± 0.828
0.09MetCys: 0.09 ± 0.104
1.442MetAsp: 1.442 ± 0.336
0.901MetGlu: 0.901 ± 0.284
0.721MetPhe: 0.721 ± 0.21
2.343MetGly: 2.343 ± 0.419
1.081MetHis: 1.081 ± 0.348
2.613MetIle: 2.613 ± 0.476
1.622MetLys: 1.622 ± 0.461
2.974MetLeu: 2.974 ± 0.511
0.901MetMet: 0.901 ± 0.26
1.261MetAsn: 1.261 ± 0.327
1.261MetPro: 1.261 ± 0.334
1.261MetGln: 1.261 ± 0.311
1.171MetArg: 1.171 ± 0.29
2.613MetSer: 2.613 ± 0.448
2.433MetThr: 2.433 ± 0.553
1.352MetVal: 1.352 ± 0.386
0.09MetTrp: 0.09 ± 0.1
0.811MetTyr: 0.811 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
5.496AsnAla: 5.496 ± 0.847
0.36AsnCys: 0.36 ± 0.195
2.974AsnAsp: 2.974 ± 0.354
3.064AsnGlu: 3.064 ± 0.538
1.622AsnPhe: 1.622 ± 0.411
4.325AsnGly: 4.325 ± 0.512
1.081AsnHis: 1.081 ± 0.425
2.974AsnIle: 2.974 ± 0.492
2.974AsnLys: 2.974 ± 0.474
3.334AsnLeu: 3.334 ± 0.485
1.352AsnMet: 1.352 ± 0.345
4.235AsnAsn: 4.235 ± 0.398
2.974AsnPro: 2.974 ± 0.53
3.064AsnGln: 3.064 ± 0.705
1.622AsnArg: 1.622 ± 0.393
3.334AsnSer: 3.334 ± 0.571
4.325AsnThr: 4.325 ± 0.963
4.055AsnVal: 4.055 ± 0.656
0.901AsnTrp: 0.901 ± 0.239
1.622AsnTyr: 1.622 ± 0.387
0.0AsnXaa: 0.0 ± 0.0
Pro
4.055ProAla: 4.055 ± 0.558
0.541ProCys: 0.541 ± 0.217
1.712ProAsp: 1.712 ± 0.437
2.883ProGlu: 2.883 ± 0.413
1.171ProPhe: 1.171 ± 0.288
0.991ProGly: 0.991 ± 0.273
0.721ProHis: 0.721 ± 0.231
1.892ProIle: 1.892 ± 0.479
1.802ProLys: 1.802 ± 0.365
2.343ProLeu: 2.343 ± 0.644
1.442ProMet: 1.442 ± 0.282
2.163ProAsn: 2.163 ± 0.502
0.901ProPro: 0.901 ± 0.261
1.261ProGln: 1.261 ± 0.277
1.081ProArg: 1.081 ± 0.293
2.883ProSer: 2.883 ± 0.599
1.802ProThr: 1.802 ± 0.518
2.253ProVal: 2.253 ± 0.433
0.631ProTrp: 0.631 ± 0.228
0.811ProTyr: 0.811 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
4.235GlnAla: 4.235 ± 0.675
0.36GlnCys: 0.36 ± 0.188
1.442GlnAsp: 1.442 ± 0.353
1.171GlnGlu: 1.171 ± 0.316
1.442GlnPhe: 1.442 ± 0.342
1.892GlnGly: 1.892 ± 0.437
0.811GlnHis: 0.811 ± 0.314
2.793GlnIle: 2.793 ± 0.55
2.072GlnLys: 2.072 ± 0.458
4.776GlnLeu: 4.776 ± 0.949
1.532GlnMet: 1.532 ± 0.435
1.352GlnAsn: 1.352 ± 0.291
1.802GlnPro: 1.802 ± 0.477
2.343GlnGln: 2.343 ± 0.542
3.064GlnArg: 3.064 ± 0.415
3.154GlnSer: 3.154 ± 0.579
3.334GlnThr: 3.334 ± 0.69
2.163GlnVal: 2.163 ± 0.433
0.18GlnTrp: 0.18 ± 0.157
1.442GlnTyr: 1.442 ± 0.385
0.0GlnXaa: 0.0 ± 0.0
Arg
2.703ArgAla: 2.703 ± 0.464
0.631ArgCys: 0.631 ± 0.245
2.883ArgAsp: 2.883 ± 0.483
2.883ArgGlu: 2.883 ± 0.496
2.163ArgPhe: 2.163 ± 0.365
1.892ArgGly: 1.892 ± 0.452
1.982ArgHis: 1.982 ± 0.445
4.055ArgIle: 4.055 ± 0.954
2.974ArgLys: 2.974 ± 0.508
5.226ArgLeu: 5.226 ± 0.91
0.991ArgMet: 0.991 ± 0.334
1.892ArgAsn: 1.892 ± 0.449
1.532ArgPro: 1.532 ± 0.307
2.433ArgGln: 2.433 ± 0.567
1.622ArgArg: 1.622 ± 0.376
3.154ArgSer: 3.154 ± 0.695
2.343ArgThr: 2.343 ± 0.752
2.883ArgVal: 2.883 ± 0.472
0.811ArgTrp: 0.811 ± 0.315
1.352ArgTyr: 1.352 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
6.938SerAla: 6.938 ± 1.121
0.451SerCys: 0.451 ± 0.192
4.145SerAsp: 4.145 ± 0.777
5.136SerGlu: 5.136 ± 0.64
2.343SerPhe: 2.343 ± 0.464
4.866SerGly: 4.866 ± 0.683
1.171SerHis: 1.171 ± 0.421
5.316SerIle: 5.316 ± 0.84
3.875SerLys: 3.875 ± 0.55
5.046SerLeu: 5.046 ± 0.608
1.892SerMet: 1.892 ± 0.449
3.875SerAsn: 3.875 ± 0.806
2.072SerPro: 2.072 ± 0.464
2.793SerGln: 2.793 ± 0.467
3.784SerArg: 3.784 ± 0.436
4.686SerSer: 4.686 ± 0.883
3.784SerThr: 3.784 ± 0.582
3.965SerVal: 3.965 ± 0.529
0.36SerTrp: 0.36 ± 0.182
1.261SerTyr: 1.261 ± 0.293
0.0SerXaa: 0.0 ± 0.0
Thr
8.74ThrAla: 8.74 ± 2.211
0.27ThrCys: 0.27 ± 0.154
3.875ThrAsp: 3.875 ± 0.563
3.424ThrGlu: 3.424 ± 0.579
2.072ThrPhe: 2.072 ± 0.359
5.587ThrGly: 5.587 ± 0.661
1.352ThrHis: 1.352 ± 0.372
2.523ThrIle: 2.523 ± 0.548
4.145ThrLys: 4.145 ± 0.556
4.866ThrLeu: 4.866 ± 0.749
2.072ThrMet: 2.072 ± 0.423
3.875ThrAsn: 3.875 ± 0.872
2.974ThrPro: 2.974 ± 0.516
2.974ThrGln: 2.974 ± 0.6
2.072ThrArg: 2.072 ± 0.477
4.325ThrSer: 4.325 ± 0.692
4.325ThrThr: 4.325 ± 1.067
5.406ThrVal: 5.406 ± 1.239
0.631ThrTrp: 0.631 ± 0.235
1.622ThrTyr: 1.622 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
6.307ValAla: 6.307 ± 0.925
0.18ValCys: 0.18 ± 0.14
4.505ValAsp: 4.505 ± 0.723
3.694ValGlu: 3.694 ± 0.62
1.982ValPhe: 1.982 ± 0.434
3.875ValGly: 3.875 ± 0.742
1.081ValHis: 1.081 ± 0.393
4.055ValIle: 4.055 ± 0.74
4.956ValLys: 4.956 ± 0.667
4.595ValLeu: 4.595 ± 0.5
1.802ValMet: 1.802 ± 0.387
3.154ValAsn: 3.154 ± 0.589
2.523ValPro: 2.523 ± 0.617
1.712ValGln: 1.712 ± 0.292
2.793ValArg: 2.793 ± 0.433
4.235ValSer: 4.235 ± 0.628
5.226ValThr: 5.226 ± 1.026
3.875ValVal: 3.875 ± 0.522
0.901ValTrp: 0.901 ± 0.242
1.892ValTyr: 1.892 ± 0.401
0.0ValXaa: 0.0 ± 0.0
Trp
1.171TrpAla: 1.171 ± 0.309
0.09TrpCys: 0.09 ± 0.092
0.631TrpAsp: 0.631 ± 0.228
0.631TrpGlu: 0.631 ± 0.24
0.451TrpPhe: 0.451 ± 0.252
0.631TrpGly: 0.631 ± 0.232
0.27TrpHis: 0.27 ± 0.146
0.991TrpIle: 0.991 ± 0.259
0.18TrpLys: 0.18 ± 0.129
1.081TrpLeu: 1.081 ± 0.41
0.721TrpMet: 0.721 ± 0.313
0.631TrpAsn: 0.631 ± 0.183
0.27TrpPro: 0.27 ± 0.158
0.991TrpGln: 0.991 ± 0.238
0.27TrpArg: 0.27 ± 0.131
1.261TrpSer: 1.261 ± 0.398
0.811TrpThr: 0.811 ± 0.244
0.991TrpVal: 0.991 ± 0.266
0.18TrpTrp: 0.18 ± 0.138
0.451TrpTyr: 0.451 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.163TyrAla: 2.163 ± 0.395
0.901TyrCys: 0.901 ± 0.295
2.072TyrAsp: 2.072 ± 0.411
1.622TyrGlu: 1.622 ± 0.445
0.901TyrPhe: 0.901 ± 0.271
1.171TyrGly: 1.171 ± 0.338
0.631TyrHis: 0.631 ± 0.259
1.442TyrIle: 1.442 ± 0.475
1.802TyrLys: 1.802 ± 0.48
2.253TyrLeu: 2.253 ± 0.425
0.901TyrMet: 0.901 ± 0.232
1.982TyrAsn: 1.982 ± 0.473
0.811TyrPro: 0.811 ± 0.302
0.991TyrGln: 0.991 ± 0.312
1.892TyrArg: 1.892 ± 0.408
1.892TyrSer: 1.892 ± 0.393
2.253TyrThr: 2.253 ± 0.456
1.982TyrVal: 1.982 ± 0.449
0.811TyrTrp: 0.811 ± 0.268
1.442TyrTyr: 1.442 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski