Amino acid dipepetide frequency for Pelagibacter phage HTVC121P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.408AlaAla: 4.408 ± 0.973
0.523AlaCys: 0.523 ± 0.266
5.23AlaAsp: 5.23 ± 0.704
4.782AlaGlu: 4.782 ± 0.912
2.391AlaPhe: 2.391 ± 0.522
5.081AlaGly: 5.081 ± 1.174
0.672AlaHis: 0.672 ± 0.2
5.753AlaIle: 5.753 ± 0.82
6.65AlaLys: 6.65 ± 0.748
5.23AlaLeu: 5.23 ± 0.737
2.391AlaMet: 2.391 ± 0.342
6.65AlaAsn: 6.65 ± 1.555
2.391AlaPro: 2.391 ± 0.505
2.466AlaGln: 2.466 ± 0.578
2.092AlaArg: 2.092 ± 0.414
5.977AlaSer: 5.977 ± 0.978
5.155AlaThr: 5.155 ± 1.116
4.707AlaVal: 4.707 ± 0.9
0.971AlaTrp: 0.971 ± 0.225
2.391AlaTyr: 2.391 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.523CysAla: 0.523 ± 0.16
0.149CysCys: 0.149 ± 0.108
0.374CysAsp: 0.374 ± 0.202
0.523CysGlu: 0.523 ± 0.202
0.448CysPhe: 0.448 ± 0.167
0.374CysGly: 0.374 ± 0.174
0.299CysHis: 0.299 ± 0.142
0.523CysIle: 0.523 ± 0.247
1.195CysLys: 1.195 ± 0.422
0.822CysLeu: 0.822 ± 0.346
0.224CysMet: 0.224 ± 0.135
0.523CysAsn: 0.523 ± 0.225
0.299CysPro: 0.299 ± 0.147
0.299CysGln: 0.299 ± 0.137
0.523CysArg: 0.523 ± 0.22
0.224CysSer: 0.224 ± 0.133
0.672CysThr: 0.672 ± 0.193
0.598CysVal: 0.598 ± 0.25
0.299CysTrp: 0.299 ± 0.186
0.374CysTyr: 0.374 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
4.857AspAla: 4.857 ± 0.806
0.523AspCys: 0.523 ± 0.238
3.063AspAsp: 3.063 ± 0.51
3.661AspGlu: 3.661 ± 0.635
2.241AspPhe: 2.241 ± 0.342
3.96AspGly: 3.96 ± 0.597
0.448AspHis: 0.448 ± 0.168
4.483AspIle: 4.483 ± 0.472
5.305AspLys: 5.305 ± 0.803
5.753AspLeu: 5.753 ± 0.649
1.569AspMet: 1.569 ± 0.338
2.69AspAsn: 2.69 ± 0.385
1.868AspPro: 1.868 ± 0.516
0.598AspGln: 0.598 ± 0.24
3.213AspArg: 3.213 ± 0.603
3.213AspSer: 3.213 ± 0.514
3.288AspThr: 3.288 ± 0.536
3.736AspVal: 3.736 ± 0.683
0.747AspTrp: 0.747 ± 0.241
2.764AspTyr: 2.764 ± 0.497
0.0AspXaa: 0.0 ± 0.0
Glu
5.006GluAla: 5.006 ± 0.66
1.046GluCys: 1.046 ± 0.377
3.811GluAsp: 3.811 ± 0.593
4.334GluGlu: 4.334 ± 0.667
2.241GluPhe: 2.241 ± 0.506
4.259GluGly: 4.259 ± 0.525
1.569GluHis: 1.569 ± 0.339
4.782GluIle: 4.782 ± 0.634
6.426GluLys: 6.426 ± 1.073
5.454GluLeu: 5.454 ± 0.851
2.017GluMet: 2.017 ± 0.422
3.362GluAsn: 3.362 ± 0.462
1.868GluPro: 1.868 ± 0.419
2.764GluGln: 2.764 ± 0.714
2.764GluArg: 2.764 ± 0.574
2.615GluSer: 2.615 ± 0.456
4.483GluThr: 4.483 ± 0.596
3.811GluVal: 3.811 ± 0.54
1.195GluTrp: 1.195 ± 0.367
2.615GluTyr: 2.615 ± 0.614
0.0GluXaa: 0.0 ± 0.0
Phe
2.466PheAla: 2.466 ± 0.482
0.299PheCys: 0.299 ± 0.193
2.914PheAsp: 2.914 ± 0.497
2.092PheGlu: 2.092 ± 0.371
1.27PhePhe: 1.27 ± 0.438
3.213PheGly: 3.213 ± 0.654
0.672PheHis: 0.672 ± 0.235
1.943PheIle: 1.943 ± 0.354
3.661PheLys: 3.661 ± 0.564
3.661PheLeu: 3.661 ± 0.534
1.644PheMet: 1.644 ± 0.399
2.54PheAsn: 2.54 ± 0.551
0.897PhePro: 0.897 ± 0.343
1.195PheGln: 1.195 ± 0.344
1.42PheArg: 1.42 ± 0.257
2.54PheSer: 2.54 ± 0.412
2.989PheThr: 2.989 ± 0.563
2.54PheVal: 2.54 ± 0.369
0.523PheTrp: 0.523 ± 0.158
1.046PheTyr: 1.046 ± 0.204
0.0PheXaa: 0.0 ± 0.0
Gly
4.259GlyAla: 4.259 ± 0.639
0.374GlyCys: 0.374 ± 0.152
3.288GlyAsp: 3.288 ± 0.421
4.558GlyGlu: 4.558 ± 0.56
3.736GlyPhe: 3.736 ± 0.541
3.96GlyGly: 3.96 ± 0.686
1.27GlyHis: 1.27 ± 0.355
4.035GlyIle: 4.035 ± 0.572
5.828GlyLys: 5.828 ± 0.896
4.408GlyLeu: 4.408 ± 0.578
1.868GlyMet: 1.868 ± 0.398
4.035GlyAsn: 4.035 ± 0.666
0.0GlyPro: 0.0 ± 0.0
2.092GlyGln: 2.092 ± 0.311
2.316GlyArg: 2.316 ± 0.497
5.006GlySer: 5.006 ± 0.973
6.426GlyThr: 6.426 ± 1.001
4.109GlyVal: 4.109 ± 0.479
0.971GlyTrp: 0.971 ± 0.277
2.764GlyTyr: 2.764 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
1.195HisAla: 1.195 ± 0.269
0.374HisCys: 0.374 ± 0.172
0.822HisAsp: 0.822 ± 0.24
1.27HisGlu: 1.27 ± 0.408
1.121HisPhe: 1.121 ± 0.324
1.121HisGly: 1.121 ± 0.26
0.523HisHis: 0.523 ± 0.197
1.569HisIle: 1.569 ± 0.337
1.27HisLys: 1.27 ± 0.361
1.718HisLeu: 1.718 ± 0.399
0.598HisMet: 0.598 ± 0.224
0.672HisAsn: 0.672 ± 0.183
0.523HisPro: 0.523 ± 0.159
0.822HisGln: 0.822 ± 0.246
0.672HisArg: 0.672 ± 0.295
1.644HisSer: 1.644 ± 0.426
1.42HisThr: 1.42 ± 0.286
0.822HisVal: 0.822 ± 0.264
0.374HisTrp: 0.374 ± 0.163
0.598HisTyr: 0.598 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.081IleAla: 5.081 ± 0.624
0.598IleCys: 0.598 ± 0.246
3.885IleAsp: 3.885 ± 0.44
4.857IleGlu: 4.857 ± 0.799
2.316IlePhe: 2.316 ± 0.506
4.632IleGly: 4.632 ± 0.704
1.27IleHis: 1.27 ± 0.415
4.558IleIle: 4.558 ± 0.769
6.724IleLys: 6.724 ± 0.772
3.661IleLeu: 3.661 ± 0.554
2.092IleMet: 2.092 ± 0.364
4.184IleAsn: 4.184 ± 0.657
2.466IlePro: 2.466 ± 0.355
1.943IleGln: 1.943 ± 0.39
2.839IleArg: 2.839 ± 0.481
4.334IleSer: 4.334 ± 0.516
4.931IleThr: 4.931 ± 0.973
2.914IleVal: 2.914 ± 0.402
0.598IleTrp: 0.598 ± 0.24
2.092IleTyr: 2.092 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
5.38LysAla: 5.38 ± 0.825
0.747LysCys: 0.747 ± 0.424
4.931LysAsp: 4.931 ± 0.691
8.592LysGlu: 8.592 ± 1.166
4.109LysPhe: 4.109 ± 0.588
4.035LysGly: 4.035 ± 0.562
2.241LysHis: 2.241 ± 0.317
6.276LysIle: 6.276 ± 0.7
7.546LysLys: 7.546 ± 1.089
7.322LysLeu: 7.322 ± 0.897
2.092LysMet: 2.092 ± 0.427
3.437LysAsn: 3.437 ± 0.546
3.437LysPro: 3.437 ± 0.511
3.063LysGln: 3.063 ± 0.593
4.109LysArg: 4.109 ± 0.845
5.23LysSer: 5.23 ± 0.651
4.632LysThr: 4.632 ± 0.566
4.632LysVal: 4.632 ± 0.774
0.897LysTrp: 0.897 ± 0.298
3.138LysTyr: 3.138 ± 0.798
0.0LysXaa: 0.0 ± 0.0
Leu
6.201LeuAla: 6.201 ± 0.631
0.747LeuCys: 0.747 ± 0.238
5.155LeuAsp: 5.155 ± 0.527
5.454LeuGlu: 5.454 ± 0.744
2.466LeuPhe: 2.466 ± 0.42
4.483LeuGly: 4.483 ± 0.504
2.466LeuHis: 2.466 ± 0.459
4.109LeuIle: 4.109 ± 0.578
6.949LeuLys: 6.949 ± 0.747
6.426LeuLeu: 6.426 ± 0.924
1.42LeuMet: 1.42 ± 0.448
4.857LeuAsn: 4.857 ± 0.588
3.362LeuPro: 3.362 ± 0.549
3.138LeuGln: 3.138 ± 0.545
2.914LeuArg: 2.914 ± 0.594
5.604LeuSer: 5.604 ± 0.739
5.23LeuThr: 5.23 ± 0.542
4.931LeuVal: 4.931 ± 0.652
0.971LeuTrp: 0.971 ± 0.34
2.316LeuTyr: 2.316 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.764MetAla: 2.764 ± 0.485
0.374MetCys: 0.374 ± 0.192
1.121MetAsp: 1.121 ± 0.261
1.569MetGlu: 1.569 ± 0.376
0.971MetPhe: 0.971 ± 0.269
1.718MetGly: 1.718 ± 0.331
0.224MetHis: 0.224 ± 0.135
1.718MetIle: 1.718 ± 0.312
2.241MetLys: 2.241 ± 0.355
1.943MetLeu: 1.943 ± 0.484
0.672MetMet: 0.672 ± 0.218
0.897MetAsn: 0.897 ± 0.239
1.046MetPro: 1.046 ± 0.415
1.121MetGln: 1.121 ± 0.313
1.345MetArg: 1.345 ± 0.393
2.167MetSer: 2.167 ± 0.435
0.822MetThr: 0.822 ± 0.249
1.718MetVal: 1.718 ± 0.347
0.448MetTrp: 0.448 ± 0.197
0.598MetTyr: 0.598 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
4.558AsnAla: 4.558 ± 0.806
0.448AsnCys: 0.448 ± 0.162
2.914AsnAsp: 2.914 ± 0.406
3.063AsnGlu: 3.063 ± 0.381
2.914AsnPhe: 2.914 ± 0.549
3.885AsnGly: 3.885 ± 0.803
0.897AsnHis: 0.897 ± 0.226
4.184AsnIle: 4.184 ± 0.888
4.035AsnLys: 4.035 ± 0.493
4.857AsnLeu: 4.857 ± 0.583
1.27AsnMet: 1.27 ± 0.255
3.736AsnAsn: 3.736 ± 0.491
2.316AsnPro: 2.316 ± 0.386
2.017AsnGln: 2.017 ± 0.405
2.615AsnArg: 2.615 ± 0.594
4.782AsnSer: 4.782 ± 0.905
5.604AsnThr: 5.604 ± 0.782
3.138AsnVal: 3.138 ± 1.004
0.672AsnTrp: 0.672 ± 0.193
2.167AsnTyr: 2.167 ± 0.428
0.0AsnXaa: 0.0 ± 0.0
Pro
1.718ProAla: 1.718 ± 0.293
0.299ProCys: 0.299 ± 0.147
2.092ProAsp: 2.092 ± 0.42
2.391ProGlu: 2.391 ± 0.545
1.42ProPhe: 1.42 ± 0.281
0.0ProGly: 0.0 ± 0.0
0.448ProHis: 0.448 ± 0.172
1.494ProIle: 1.494 ± 0.398
2.316ProLys: 2.316 ± 0.461
2.914ProLeu: 2.914 ± 0.498
0.822ProMet: 0.822 ± 0.243
1.868ProAsn: 1.868 ± 0.351
1.27ProPro: 1.27 ± 0.43
1.046ProGln: 1.046 ± 0.286
0.672ProArg: 0.672 ± 0.245
2.989ProSer: 2.989 ± 0.397
3.063ProThr: 3.063 ± 0.448
1.718ProVal: 1.718 ± 0.395
0.0ProTrp: 0.0 ± 0.0
1.195ProTyr: 1.195 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
2.914GlnAla: 2.914 ± 0.519
0.075GlnCys: 0.075 ± 0.068
1.868GlnAsp: 1.868 ± 0.293
2.241GlnGlu: 2.241 ± 0.56
1.195GlnPhe: 1.195 ± 0.266
1.793GlnGly: 1.793 ± 0.354
0.822GlnHis: 0.822 ± 0.26
2.092GlnIle: 2.092 ± 0.486
2.989GlnLys: 2.989 ± 0.556
3.213GlnLeu: 3.213 ± 0.534
1.046GlnMet: 1.046 ± 0.332
1.345GlnAsn: 1.345 ± 0.406
0.897GlnPro: 0.897 ± 0.254
1.345GlnGln: 1.345 ± 0.369
1.42GlnArg: 1.42 ± 0.369
2.914GlnSer: 2.914 ± 0.396
2.466GlnThr: 2.466 ± 0.41
2.167GlnVal: 2.167 ± 0.473
0.672GlnTrp: 0.672 ± 0.273
1.195GlnTyr: 1.195 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
3.138ArgAla: 3.138 ± 0.53
0.0ArgCys: 0.0 ± 0.0
2.69ArgAsp: 2.69 ± 0.413
2.316ArgGlu: 2.316 ± 0.458
0.897ArgPhe: 0.897 ± 0.262
2.615ArgGly: 2.615 ± 0.464
0.672ArgHis: 0.672 ± 0.268
2.989ArgIle: 2.989 ± 0.811
3.138ArgLys: 3.138 ± 0.513
3.138ArgLeu: 3.138 ± 0.641
1.27ArgMet: 1.27 ± 0.467
2.466ArgAsn: 2.466 ± 0.442
1.046ArgPro: 1.046 ± 0.341
1.121ArgGln: 1.121 ± 0.274
1.793ArgArg: 1.793 ± 0.425
1.718ArgSer: 1.718 ± 0.322
1.569ArgThr: 1.569 ± 0.306
1.943ArgVal: 1.943 ± 0.392
0.672ArgTrp: 0.672 ± 0.276
2.391ArgTyr: 2.391 ± 0.473
0.0ArgXaa: 0.0 ± 0.0
Ser
6.052SerAla: 6.052 ± 1.052
0.598SerCys: 0.598 ± 0.199
3.437SerAsp: 3.437 ± 0.599
4.109SerGlu: 4.109 ± 0.753
2.54SerPhe: 2.54 ± 0.408
6.874SerGly: 6.874 ± 1.202
1.42SerHis: 1.42 ± 0.263
4.558SerIle: 4.558 ± 0.633
5.678SerLys: 5.678 ± 0.574
5.006SerLeu: 5.006 ± 0.66
1.195SerMet: 1.195 ± 0.314
4.109SerAsn: 4.109 ± 0.837
1.494SerPro: 1.494 ± 0.369
2.914SerGln: 2.914 ± 0.46
1.569SerArg: 1.569 ± 0.369
5.454SerSer: 5.454 ± 1.028
5.006SerThr: 5.006 ± 0.752
3.96SerVal: 3.96 ± 0.568
1.046SerTrp: 1.046 ± 0.305
2.391SerTyr: 2.391 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
6.575ThrAla: 6.575 ± 1.031
0.747ThrCys: 0.747 ± 0.336
4.483ThrAsp: 4.483 ± 0.46
4.184ThrGlu: 4.184 ± 0.646
2.914ThrPhe: 2.914 ± 0.563
6.351ThrGly: 6.351 ± 0.718
0.897ThrHis: 0.897 ± 0.233
4.259ThrIle: 4.259 ± 0.505
5.305ThrLys: 5.305 ± 0.587
4.931ThrLeu: 4.931 ± 0.584
0.971ThrMet: 0.971 ± 0.245
5.006ThrAsn: 5.006 ± 1.094
2.092ThrPro: 2.092 ± 0.35
2.391ThrGln: 2.391 ± 0.552
2.017ThrArg: 2.017 ± 0.386
5.828ThrSer: 5.828 ± 1.098
5.454ThrThr: 5.454 ± 1.358
4.109ThrVal: 4.109 ± 1.172
0.971ThrTrp: 0.971 ± 0.33
1.943ThrTyr: 1.943 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
6.052ValAla: 6.052 ± 1.54
0.672ValCys: 0.672 ± 0.278
3.063ValAsp: 3.063 ± 0.434
4.035ValGlu: 4.035 ± 0.548
2.391ValPhe: 2.391 ± 0.429
4.632ValGly: 4.632 ± 0.632
1.345ValHis: 1.345 ± 0.428
3.138ValIle: 3.138 ± 0.521
3.96ValLys: 3.96 ± 0.683
3.885ValLeu: 3.885 ± 0.527
1.195ValMet: 1.195 ± 0.305
4.259ValAsn: 4.259 ± 0.694
1.718ValPro: 1.718 ± 0.308
2.017ValGln: 2.017 ± 0.381
1.868ValArg: 1.868 ± 0.365
4.184ValSer: 4.184 ± 0.636
3.811ValThr: 3.811 ± 0.819
3.661ValVal: 3.661 ± 0.375
0.822ValTrp: 0.822 ± 0.224
1.569ValTyr: 1.569 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
1.121TrpAla: 1.121 ± 0.372
0.149TrpCys: 0.149 ± 0.118
1.121TrpAsp: 1.121 ± 0.232
0.672TrpGlu: 0.672 ± 0.223
0.672TrpPhe: 0.672 ± 0.223
0.448TrpGly: 0.448 ± 0.154
0.0TrpHis: 0.0 ± 0.0
0.897TrpIle: 0.897 ± 0.267
1.046TrpLys: 1.046 ± 0.287
1.943TrpLeu: 1.943 ± 0.395
0.149TrpMet: 0.149 ± 0.117
1.121TrpAsn: 1.121 ± 0.297
0.0TrpPro: 0.0 ± 0.0
0.672TrpGln: 0.672 ± 0.257
0.374TrpArg: 0.374 ± 0.209
0.747TrpSer: 0.747 ± 0.241
1.121TrpThr: 1.121 ± 0.34
0.971TrpVal: 0.971 ± 0.247
0.149TrpTrp: 0.149 ± 0.101
0.374TrpTyr: 0.374 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.644TyrAla: 1.644 ± 0.311
0.523TyrCys: 0.523 ± 0.194
1.718TyrAsp: 1.718 ± 0.266
1.718TyrGlu: 1.718 ± 0.344
1.27TyrPhe: 1.27 ± 0.299
1.868TyrGly: 1.868 ± 0.401
0.971TyrHis: 0.971 ± 0.303
2.615TyrIle: 2.615 ± 0.505
3.661TyrLys: 3.661 ± 0.612
2.914TyrLeu: 2.914 ± 0.453
0.897TyrMet: 0.897 ± 0.202
2.316TyrAsn: 2.316 ± 0.31
0.747TyrPro: 0.747 ± 0.221
1.644TyrGln: 1.644 ± 0.345
1.046TyrArg: 1.046 ± 0.206
2.241TyrSer: 2.241 ± 0.313
3.288TyrThr: 3.288 ± 0.589
2.092TyrVal: 2.092 ± 0.548
0.747TyrTrp: 0.747 ± 0.212
1.121TyrTyr: 1.121 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13385 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski