Amino acid dipepetide frequency for Staphylococcus phage P954

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.749AlaAla: 1.749 ± 0.653
0.318AlaCys: 0.318 ± 0.167
2.782AlaAsp: 2.782 ± 0.435
3.815AlaGlu: 3.815 ± 0.6
2.464AlaPhe: 2.464 ± 0.645
3.179AlaGly: 3.179 ± 0.534
0.954AlaHis: 0.954 ± 0.308
5.564AlaIle: 5.564 ± 0.653
4.848AlaLys: 4.848 ± 0.443
4.53AlaLeu: 4.53 ± 0.663
1.749AlaMet: 1.749 ± 0.431
2.384AlaAsn: 2.384 ± 0.491
1.749AlaPro: 1.749 ± 0.363
2.941AlaGln: 2.941 ± 0.544
2.702AlaArg: 2.702 ± 0.47
3.577AlaSer: 3.577 ± 0.487
3.338AlaThr: 3.338 ± 0.67
3.02AlaVal: 3.02 ± 0.567
0.715AlaTrp: 0.715 ± 0.255
2.623AlaTyr: 2.623 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
0.397CysAla: 0.397 ± 0.174
0.0CysCys: 0.0 ± 0.0
0.079CysAsp: 0.079 ± 0.095
0.318CysGlu: 0.318 ± 0.183
0.318CysPhe: 0.318 ± 0.17
0.159CysGly: 0.159 ± 0.108
0.079CysHis: 0.079 ± 0.082
0.556CysIle: 0.556 ± 0.208
0.556CysLys: 0.556 ± 0.216
0.318CysLeu: 0.318 ± 0.166
0.238CysMet: 0.238 ± 0.176
0.477CysAsn: 0.477 ± 0.213
0.238CysPro: 0.238 ± 0.133
0.159CysGln: 0.159 ± 0.12
0.318CysArg: 0.318 ± 0.167
0.238CysSer: 0.238 ± 0.151
0.318CysThr: 0.318 ± 0.133
0.159CysVal: 0.159 ± 0.098
0.079CysTrp: 0.079 ± 0.092
0.318CysTyr: 0.318 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
3.497AspAla: 3.497 ± 0.449
0.477AspCys: 0.477 ± 0.224
4.451AspAsp: 4.451 ± 0.799
4.769AspGlu: 4.769 ± 0.69
3.497AspPhe: 3.497 ± 0.514
5.087AspGly: 5.087 ± 0.784
0.556AspHis: 0.556 ± 0.21
5.325AspIle: 5.325 ± 0.606
5.881AspLys: 5.881 ± 0.839
5.246AspLeu: 5.246 ± 0.59
2.225AspMet: 2.225 ± 0.428
4.133AspAsn: 4.133 ± 0.62
1.987AspPro: 1.987 ± 0.422
1.033AspGln: 1.033 ± 0.26
2.384AspArg: 2.384 ± 0.542
3.418AspSer: 3.418 ± 0.595
2.702AspThr: 2.702 ± 0.531
4.053AspVal: 4.053 ± 0.487
0.715AspTrp: 0.715 ± 0.249
3.497AspTyr: 3.497 ± 0.638
0.0AspXaa: 0.0 ± 0.0
Glu
4.292GluAla: 4.292 ± 0.674
0.477GluCys: 0.477 ± 0.174
4.769GluAsp: 4.769 ± 0.511
7.392GluGlu: 7.392 ± 0.982
3.656GluPhe: 3.656 ± 0.462
3.02GluGly: 3.02 ± 0.533
1.033GluHis: 1.033 ± 0.238
7.233GluIle: 7.233 ± 0.852
7.153GluLys: 7.153 ± 0.844
7.948GluLeu: 7.948 ± 0.99
2.464GluMet: 2.464 ± 0.441
5.007GluAsn: 5.007 ± 0.632
1.51GluPro: 1.51 ± 0.301
3.179GluGln: 3.179 ± 0.475
4.848GluArg: 4.848 ± 0.609
3.656GluSer: 3.656 ± 0.63
3.815GluThr: 3.815 ± 0.95
5.166GluVal: 5.166 ± 0.552
1.351GluTrp: 1.351 ± 0.347
4.212GluTyr: 4.212 ± 0.591
0.0GluXaa: 0.0 ± 0.0
Phe
2.305PheAla: 2.305 ± 0.516
0.159PheCys: 0.159 ± 0.113
3.418PheAsp: 3.418 ± 0.491
3.577PheGlu: 3.577 ± 0.444
1.51PhePhe: 1.51 ± 0.45
2.782PheGly: 2.782 ± 0.444
0.397PheHis: 0.397 ± 0.158
3.418PheIle: 3.418 ± 0.471
4.133PheLys: 4.133 ± 0.425
2.941PheLeu: 2.941 ± 0.452
1.351PheMet: 1.351 ± 0.425
3.179PheAsn: 3.179 ± 0.415
1.033PhePro: 1.033 ± 0.295
0.556PheGln: 0.556 ± 0.205
1.987PheArg: 1.987 ± 0.414
3.02PheSer: 3.02 ± 0.741
2.941PheThr: 2.941 ± 0.461
2.623PheVal: 2.623 ± 0.425
0.238PheTrp: 0.238 ± 0.146
1.907PheTyr: 1.907 ± 0.511
0.0PheXaa: 0.0 ± 0.0
Gly
3.179GlyAla: 3.179 ± 0.618
0.477GlyCys: 0.477 ± 0.2
3.656GlyAsp: 3.656 ± 0.646
3.815GlyGlu: 3.815 ± 0.75
2.623GlyPhe: 2.623 ± 0.464
3.974GlyGly: 3.974 ± 1.083
1.033GlyHis: 1.033 ± 0.226
5.007GlyIle: 5.007 ± 0.883
6.279GlyLys: 6.279 ± 0.7
5.564GlyLeu: 5.564 ± 0.943
0.954GlyMet: 0.954 ± 0.24
2.702GlyAsn: 2.702 ± 0.388
1.192GlyPro: 1.192 ± 0.368
1.828GlyGln: 1.828 ± 0.42
2.941GlyArg: 2.941 ± 0.485
2.464GlySer: 2.464 ± 0.434
2.543GlyThr: 2.543 ± 0.424
3.577GlyVal: 3.577 ± 0.661
0.954GlyTrp: 0.954 ± 0.276
2.782GlyTyr: 2.782 ± 0.47
0.0GlyXaa: 0.0 ± 0.0
His
0.874HisAla: 0.874 ± 0.284
0.0HisCys: 0.0 ± 0.0
1.033HisAsp: 1.033 ± 0.281
1.113HisGlu: 1.113 ± 0.309
1.351HisPhe: 1.351 ± 0.31
0.795HisGly: 0.795 ± 0.296
0.159HisHis: 0.159 ± 0.1
1.51HisIle: 1.51 ± 0.33
1.272HisLys: 1.272 ± 0.297
0.874HisLeu: 0.874 ± 0.205
0.318HisMet: 0.318 ± 0.159
0.874HisAsn: 0.874 ± 0.265
0.477HisPro: 0.477 ± 0.175
0.715HisGln: 0.715 ± 0.221
0.397HisArg: 0.397 ± 0.158
0.636HisSer: 0.636 ± 0.2
0.795HisThr: 0.795 ± 0.235
0.874HisVal: 0.874 ± 0.252
0.0HisTrp: 0.0 ± 0.0
1.033HisTyr: 1.033 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
5.166IleAla: 5.166 ± 0.935
0.397IleCys: 0.397 ± 0.152
6.517IleAsp: 6.517 ± 0.845
7.074IleGlu: 7.074 ± 0.667
3.259IlePhe: 3.259 ± 0.494
4.053IleGly: 4.053 ± 0.611
1.669IleHis: 1.669 ± 0.431
4.53IleIle: 4.53 ± 0.541
8.186IleLys: 8.186 ± 0.717
3.974IleLeu: 3.974 ± 0.503
1.51IleMet: 1.51 ± 0.372
5.166IleAsn: 5.166 ± 0.753
1.907IlePro: 1.907 ± 0.459
3.179IleGln: 3.179 ± 0.537
2.861IleArg: 2.861 ± 0.533
5.166IleSer: 5.166 ± 0.641
4.212IleThr: 4.212 ± 0.559
5.166IleVal: 5.166 ± 0.541
1.033IleTrp: 1.033 ± 0.463
2.702IleTyr: 2.702 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
5.961LysAla: 5.961 ± 0.669
0.238LysCys: 0.238 ± 0.144
5.643LysAsp: 5.643 ± 0.528
9.14LysGlu: 9.14 ± 1.013
3.338LysPhe: 3.338 ± 0.546
6.358LysGly: 6.358 ± 0.926
1.828LysHis: 1.828 ± 0.407
6.676LysIle: 6.676 ± 0.687
7.55LysLys: 7.55 ± 0.833
8.584LysLeu: 8.584 ± 0.912
2.305LysMet: 2.305 ± 0.389
5.961LysAsn: 5.961 ± 0.917
2.225LysPro: 2.225 ± 0.463
4.133LysGln: 4.133 ± 0.472
4.133LysArg: 4.133 ± 0.537
4.689LysSer: 4.689 ± 0.642
5.246LysThr: 5.246 ± 0.679
6.756LysVal: 6.756 ± 0.659
1.033LysTrp: 1.033 ± 0.318
4.053LysTyr: 4.053 ± 0.616
0.0LysXaa: 0.0 ± 0.0
Leu
3.577LeuAla: 3.577 ± 0.459
0.715LeuCys: 0.715 ± 0.228
4.53LeuAsp: 4.53 ± 0.546
6.835LeuGlu: 6.835 ± 0.838
3.656LeuPhe: 3.656 ± 0.488
3.259LeuGly: 3.259 ± 0.724
0.954LeuHis: 0.954 ± 0.251
5.564LeuIle: 5.564 ± 0.775
9.537LeuLys: 9.537 ± 0.941
6.835LeuLeu: 6.835 ± 0.85
2.146LeuMet: 2.146 ± 0.448
5.722LeuAsn: 5.722 ± 0.643
1.987LeuPro: 1.987 ± 0.443
3.179LeuGln: 3.179 ± 0.366
3.735LeuArg: 3.735 ± 0.702
6.12LeuSer: 6.12 ± 0.705
4.848LeuThr: 4.848 ± 0.57
4.371LeuVal: 4.371 ± 0.569
0.636LeuTrp: 0.636 ± 0.241
3.418LeuTyr: 3.418 ± 0.672
0.0LeuXaa: 0.0 ± 0.0
Met
1.431MetAla: 1.431 ± 0.322
0.079MetCys: 0.079 ± 0.073
1.59MetAsp: 1.59 ± 0.392
1.351MetGlu: 1.351 ± 0.352
0.874MetPhe: 0.874 ± 0.215
1.51MetGly: 1.51 ± 0.546
0.159MetHis: 0.159 ± 0.105
2.146MetIle: 2.146 ± 0.418
1.907MetLys: 1.907 ± 0.354
2.464MetLeu: 2.464 ± 0.416
0.954MetMet: 0.954 ± 0.271
1.828MetAsn: 1.828 ± 0.346
1.033MetPro: 1.033 ± 0.264
1.033MetGln: 1.033 ± 0.272
1.51MetArg: 1.51 ± 0.279
1.749MetSer: 1.749 ± 0.31
1.669MetThr: 1.669 ± 0.403
1.033MetVal: 1.033 ± 0.309
0.636MetTrp: 0.636 ± 0.227
0.477MetTyr: 0.477 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
3.735AsnAla: 3.735 ± 0.561
0.318AsnCys: 0.318 ± 0.223
4.769AsnAsp: 4.769 ± 0.607
4.61AsnGlu: 4.61 ± 0.657
1.669AsnPhe: 1.669 ± 0.578
4.848AsnGly: 4.848 ± 0.55
0.795AsnHis: 0.795 ± 0.261
3.974AsnIle: 3.974 ± 0.576
7.63AsnLys: 7.63 ± 1.138
4.133AsnLeu: 4.133 ± 0.445
1.431AsnMet: 1.431 ± 0.257
4.451AsnAsn: 4.451 ± 0.678
2.066AsnPro: 2.066 ± 0.387
2.225AsnGln: 2.225 ± 0.372
2.941AsnArg: 2.941 ± 0.563
3.815AsnSer: 3.815 ± 0.491
3.894AsnThr: 3.894 ± 0.502
3.259AsnVal: 3.259 ± 0.696
0.954AsnTrp: 0.954 ± 0.412
3.02AsnTyr: 3.02 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
1.033ProAla: 1.033 ± 0.285
0.079ProCys: 0.079 ± 0.075
1.192ProAsp: 1.192 ± 0.252
2.861ProGlu: 2.861 ± 0.471
1.51ProPhe: 1.51 ± 0.346
0.795ProGly: 0.795 ± 0.23
0.477ProHis: 0.477 ± 0.16
2.543ProIle: 2.543 ± 0.402
3.1ProLys: 3.1 ± 0.607
1.669ProLeu: 1.669 ± 0.355
0.636ProMet: 0.636 ± 0.2
1.431ProAsn: 1.431 ± 0.256
1.192ProPro: 1.192 ± 0.246
0.795ProGln: 0.795 ± 0.251
1.033ProArg: 1.033 ± 0.21
1.828ProSer: 1.828 ± 0.356
1.272ProThr: 1.272 ± 0.28
1.351ProVal: 1.351 ± 0.293
0.159ProTrp: 0.159 ± 0.115
0.874ProTyr: 0.874 ± 0.229
0.0ProXaa: 0.0 ± 0.0
Gln
3.02GlnAla: 3.02 ± 0.452
0.318GlnCys: 0.318 ± 0.193
2.066GlnAsp: 2.066 ± 0.45
2.941GlnGlu: 2.941 ± 0.462
1.431GlnPhe: 1.431 ± 0.291
1.749GlnGly: 1.749 ± 0.334
0.795GlnHis: 0.795 ± 0.231
2.464GlnIle: 2.464 ± 0.354
2.782GlnLys: 2.782 ± 0.486
3.338GlnLeu: 3.338 ± 0.527
0.954GlnMet: 0.954 ± 0.256
3.179GlnAsn: 3.179 ± 0.521
1.113GlnPro: 1.113 ± 0.25
1.749GlnGln: 1.749 ± 0.383
1.907GlnArg: 1.907 ± 0.294
1.51GlnSer: 1.51 ± 0.464
1.51GlnThr: 1.51 ± 0.325
1.987GlnVal: 1.987 ± 0.439
0.397GlnTrp: 0.397 ± 0.161
1.272GlnTyr: 1.272 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
2.384ArgAla: 2.384 ± 0.329
0.238ArgCys: 0.238 ± 0.144
2.782ArgAsp: 2.782 ± 0.5
3.894ArgGlu: 3.894 ± 0.534
2.225ArgPhe: 2.225 ± 0.393
2.225ArgGly: 2.225 ± 0.429
0.556ArgHis: 0.556 ± 0.14
3.338ArgIle: 3.338 ± 0.582
3.259ArgLys: 3.259 ± 0.546
4.371ArgLeu: 4.371 ± 0.528
1.272ArgMet: 1.272 ± 0.24
2.861ArgAsn: 2.861 ± 0.411
1.192ArgPro: 1.192 ± 0.317
1.669ArgGln: 1.669 ± 0.329
2.464ArgArg: 2.464 ± 0.443
1.51ArgSer: 1.51 ± 0.388
2.861ArgThr: 2.861 ± 0.487
2.225ArgVal: 2.225 ± 0.345
0.636ArgTrp: 0.636 ± 0.257
2.543ArgTyr: 2.543 ± 0.552
0.0ArgXaa: 0.0 ± 0.0
Ser
4.292SerAla: 4.292 ± 0.605
0.318SerCys: 0.318 ± 0.169
4.928SerAsp: 4.928 ± 0.743
5.246SerGlu: 5.246 ± 0.791
2.543SerPhe: 2.543 ± 0.525
2.941SerGly: 2.941 ± 0.734
0.874SerHis: 0.874 ± 0.205
4.848SerIle: 4.848 ± 0.905
4.848SerLys: 4.848 ± 0.676
4.451SerLeu: 4.451 ± 0.557
1.033SerMet: 1.033 ± 0.228
4.371SerAsn: 4.371 ± 0.497
0.874SerPro: 0.874 ± 0.225
1.749SerGln: 1.749 ± 0.386
1.59SerArg: 1.59 ± 0.279
2.305SerSer: 2.305 ± 0.42
3.179SerThr: 3.179 ± 0.581
2.543SerVal: 2.543 ± 0.476
0.159SerTrp: 0.159 ± 0.084
2.543SerTyr: 2.543 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
2.384ThrAla: 2.384 ± 0.574
0.238ThrCys: 0.238 ± 0.151
3.735ThrAsp: 3.735 ± 0.809
4.053ThrGlu: 4.053 ± 0.566
2.066ThrPhe: 2.066 ± 0.357
4.053ThrGly: 4.053 ± 0.847
1.033ThrHis: 1.033 ± 0.258
3.656ThrIle: 3.656 ± 0.469
4.61ThrLys: 4.61 ± 0.578
4.928ThrLeu: 4.928 ± 0.574
1.033ThrMet: 1.033 ± 0.21
3.338ThrAsn: 3.338 ± 0.546
2.146ThrPro: 2.146 ± 0.316
1.669ThrGln: 1.669 ± 0.381
2.225ThrArg: 2.225 ± 0.469
3.338ThrSer: 3.338 ± 0.667
2.941ThrThr: 2.941 ± 0.502
3.735ThrVal: 3.735 ± 0.51
0.715ThrTrp: 0.715 ± 0.188
2.384ThrTyr: 2.384 ± 0.504
0.0ThrXaa: 0.0 ± 0.0
Val
3.497ValAla: 3.497 ± 0.568
0.079ValCys: 0.079 ± 0.075
3.259ValAsp: 3.259 ± 0.572
4.053ValGlu: 4.053 ± 0.682
2.543ValPhe: 2.543 ± 0.609
3.894ValGly: 3.894 ± 0.522
0.795ValHis: 0.795 ± 0.213
4.61ValIle: 4.61 ± 0.58
6.597ValLys: 6.597 ± 0.718
5.007ValLeu: 5.007 ± 0.573
1.59ValMet: 1.59 ± 0.33
4.053ValAsn: 4.053 ± 0.542
1.033ValPro: 1.033 ± 0.247
1.828ValGln: 1.828 ± 0.612
2.464ValArg: 2.464 ± 0.387
2.782ValSer: 2.782 ± 0.437
3.179ValThr: 3.179 ± 0.573
3.179ValVal: 3.179 ± 0.573
0.556ValTrp: 0.556 ± 0.232
3.1ValTyr: 3.1 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.318TrpAla: 0.318 ± 0.185
0.079TrpCys: 0.079 ± 0.096
0.715TrpAsp: 0.715 ± 0.234
1.033TrpGlu: 1.033 ± 0.26
0.715TrpPhe: 0.715 ± 0.216
0.477TrpGly: 0.477 ± 0.196
0.318TrpHis: 0.318 ± 0.173
1.113TrpIle: 1.113 ± 0.24
1.113TrpLys: 1.113 ± 0.264
0.954TrpLeu: 0.954 ± 0.276
0.397TrpMet: 0.397 ± 0.17
0.715TrpAsn: 0.715 ± 0.268
0.079TrpPro: 0.079 ± 0.07
0.556TrpGln: 0.556 ± 0.192
0.397TrpArg: 0.397 ± 0.166
0.874TrpSer: 0.874 ± 0.28
0.556TrpThr: 0.556 ± 0.179
0.636TrpVal: 0.636 ± 0.198
0.079TrpTrp: 0.079 ± 0.064
0.636TrpTyr: 0.636 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.907TyrAla: 1.907 ± 0.392
0.318TyrCys: 0.318 ± 0.154
3.02TyrAsp: 3.02 ± 0.513
4.292TyrGlu: 4.292 ± 0.676
2.225TyrPhe: 2.225 ± 0.568
2.384TyrGly: 2.384 ± 0.454
0.636TyrHis: 0.636 ± 0.227
3.656TyrIle: 3.656 ± 0.699
4.689TyrLys: 4.689 ± 0.694
3.497TyrLeu: 3.497 ± 0.475
0.795TyrMet: 0.795 ± 0.239
2.702TyrAsn: 2.702 ± 0.314
0.795TyrPro: 0.795 ± 0.235
2.305TyrGln: 2.305 ± 0.466
1.669TyrArg: 1.669 ± 0.369
2.941TyrSer: 2.941 ± 0.548
2.464TyrThr: 2.464 ± 0.421
2.384TyrVal: 2.384 ± 0.375
0.715TyrTrp: 0.715 ± 0.292
1.272TyrTyr: 1.272 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12583 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski