Amino acid dipepetide frequency for Staphylococcus phage phiSP15-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.325AlaAla: 3.325 ± 0.972
0.145AlaCys: 0.145 ± 0.101
3.109AlaAsp: 3.109 ± 0.423
3.615AlaGlu: 3.615 ± 0.682
2.53AlaPhe: 2.53 ± 0.557
2.819AlaGly: 2.819 ± 0.607
0.651AlaHis: 0.651 ± 0.207
3.976AlaIle: 3.976 ± 0.439
4.916AlaLys: 4.916 ± 0.588
5.35AlaLeu: 5.35 ± 0.741
1.952AlaMet: 1.952 ± 0.437
3.109AlaAsn: 3.109 ± 0.535
1.157AlaPro: 1.157 ± 0.261
1.88AlaGln: 1.88 ± 0.32
2.458AlaArg: 2.458 ± 0.426
3.398AlaSer: 3.398 ± 0.522
3.325AlaThr: 3.325 ± 0.348
3.181AlaVal: 3.181 ± 0.466
0.795AlaTrp: 0.795 ± 0.232
2.747AlaTyr: 2.747 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
0.217CysAla: 0.217 ± 0.108
0.145CysCys: 0.145 ± 0.146
0.217CysAsp: 0.217 ± 0.141
0.434CysGlu: 0.434 ± 0.214
0.289CysPhe: 0.289 ± 0.135
0.361CysGly: 0.361 ± 0.175
0.361CysHis: 0.361 ± 0.221
0.506CysIle: 0.506 ± 0.195
0.434CysLys: 0.434 ± 0.177
0.578CysLeu: 0.578 ± 0.229
0.145CysMet: 0.145 ± 0.105
0.072CysAsn: 0.072 ± 0.073
0.361CysPro: 0.361 ± 0.169
0.361CysGln: 0.361 ± 0.176
0.506CysArg: 0.506 ± 0.2
0.361CysSer: 0.361 ± 0.135
0.072CysThr: 0.072 ± 0.054
0.506CysVal: 0.506 ± 0.189
0.0CysTrp: 0.0 ± 0.0
0.072CysTyr: 0.072 ± 0.054
0.0CysXaa: 0.0 ± 0.0
Asp
3.398AspAla: 3.398 ± 0.684
0.217AspCys: 0.217 ± 0.157
4.988AspAsp: 4.988 ± 0.887
6.506AspGlu: 6.506 ± 0.81
3.253AspPhe: 3.253 ± 0.484
5.205AspGly: 5.205 ± 0.675
0.795AspHis: 0.795 ± 0.195
5.205AspIle: 5.205 ± 0.593
6.506AspLys: 6.506 ± 0.732
5.277AspLeu: 5.277 ± 0.706
2.53AspMet: 2.53 ± 0.37
4.265AspAsn: 4.265 ± 0.671
1.374AspPro: 1.374 ± 0.317
1.084AspGln: 1.084 ± 0.315
3.181AspArg: 3.181 ± 0.541
3.398AspSer: 3.398 ± 0.509
3.181AspThr: 3.181 ± 0.4
4.337AspVal: 4.337 ± 0.514
0.723AspTrp: 0.723 ± 0.256
3.325AspTyr: 3.325 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
4.121GluAla: 4.121 ± 0.573
0.795GluCys: 0.795 ± 0.293
4.337GluAsp: 4.337 ± 0.683
5.783GluGlu: 5.783 ± 1.012
3.398GluPhe: 3.398 ± 0.507
3.181GluGly: 3.181 ± 0.517
1.301GluHis: 1.301 ± 0.353
4.265GluIle: 4.265 ± 0.753
5.133GluLys: 5.133 ± 0.752
6.94GluLeu: 6.94 ± 0.639
2.313GluMet: 2.313 ± 0.416
4.843GluAsn: 4.843 ± 0.768
1.518GluPro: 1.518 ± 0.378
3.398GluGln: 3.398 ± 0.623
4.121GluArg: 4.121 ± 0.562
4.048GluSer: 4.048 ± 0.517
3.759GluThr: 3.759 ± 0.681
5.711GluVal: 5.711 ± 0.797
1.446GluTrp: 1.446 ± 0.292
3.109GluTyr: 3.109 ± 0.585
0.0GluXaa: 0.0 ± 0.0
Phe
2.747PheAla: 2.747 ± 0.546
0.578PheCys: 0.578 ± 0.17
2.892PheAsp: 2.892 ± 0.556
2.241PheGlu: 2.241 ± 0.517
2.096PhePhe: 2.096 ± 0.422
3.325PheGly: 3.325 ± 0.525
0.506PheHis: 0.506 ± 0.192
4.048PheIle: 4.048 ± 0.557
5.133PheLys: 5.133 ± 0.542
2.819PheLeu: 2.819 ± 0.639
0.795PheMet: 0.795 ± 0.22
2.892PheAsn: 2.892 ± 0.503
0.795PhePro: 0.795 ± 0.203
0.867PheGln: 0.867 ± 0.212
2.024PheArg: 2.024 ± 0.387
2.096PheSer: 2.096 ± 0.444
3.325PheThr: 3.325 ± 0.562
3.181PheVal: 3.181 ± 0.445
0.723PheTrp: 0.723 ± 0.215
1.735PheTyr: 1.735 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
3.542GlyAla: 3.542 ± 0.526
0.145GlyCys: 0.145 ± 0.094
4.048GlyAsp: 4.048 ± 0.411
3.181GlyGlu: 3.181 ± 0.434
2.892GlyPhe: 2.892 ± 0.501
4.265GlyGly: 4.265 ± 1.19
1.518GlyHis: 1.518 ± 0.363
4.482GlyIle: 4.482 ± 1.005
5.856GlyLys: 5.856 ± 0.712
5.566GlyLeu: 5.566 ± 0.978
1.229GlyMet: 1.229 ± 0.263
3.759GlyAsn: 3.759 ± 0.46
1.229GlyPro: 1.229 ± 0.415
2.169GlyGln: 2.169 ± 0.4
3.325GlyArg: 3.325 ± 0.652
2.602GlySer: 2.602 ± 0.566
3.542GlyThr: 3.542 ± 0.479
3.976GlyVal: 3.976 ± 0.532
0.651GlyTrp: 0.651 ± 0.211
2.241GlyTyr: 2.241 ± 0.336
0.0GlyXaa: 0.0 ± 0.0
His
0.578HisAla: 0.578 ± 0.233
0.361HisCys: 0.361 ± 0.148
0.94HisAsp: 0.94 ± 0.319
1.735HisGlu: 1.735 ± 0.303
0.723HisPhe: 0.723 ± 0.211
1.012HisGly: 1.012 ± 0.277
1.012HisHis: 1.012 ± 0.294
1.157HisIle: 1.157 ± 0.314
1.012HisLys: 1.012 ± 0.264
2.024HisLeu: 2.024 ± 0.363
0.0HisMet: 0.0 ± 0.0
1.012HisAsn: 1.012 ± 0.249
0.506HisPro: 0.506 ± 0.156
0.434HisGln: 0.434 ± 0.16
0.867HisArg: 0.867 ± 0.314
1.446HisSer: 1.446 ± 0.372
1.374HisThr: 1.374 ± 0.306
1.012HisVal: 1.012 ± 0.319
0.217HisTrp: 0.217 ± 0.131
0.94HisTyr: 0.94 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
3.904IleAla: 3.904 ± 0.521
0.217IleCys: 0.217 ± 0.121
5.133IleAsp: 5.133 ± 0.678
6.0IleGlu: 6.0 ± 0.766
3.253IlePhe: 3.253 ± 0.563
3.398IleGly: 3.398 ± 0.787
1.518IleHis: 1.518 ± 0.345
5.783IleIle: 5.783 ± 0.747
7.229IleLys: 7.229 ± 0.723
4.699IleLeu: 4.699 ± 0.522
1.663IleMet: 1.663 ± 0.384
4.699IleAsn: 4.699 ± 0.567
2.096IlePro: 2.096 ± 0.424
2.53IleGln: 2.53 ± 0.419
2.747IleArg: 2.747 ± 0.483
4.699IleSer: 4.699 ± 0.626
3.759IleThr: 3.759 ± 0.523
4.048IleVal: 4.048 ± 0.576
0.867IleTrp: 0.867 ± 0.315
2.675IleTyr: 2.675 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
4.554LysAla: 4.554 ± 0.77
0.217LysCys: 0.217 ± 0.132
7.085LysAsp: 7.085 ± 0.671
7.301LysGlu: 7.301 ± 1.006
3.325LysPhe: 3.325 ± 0.416
6.0LysGly: 6.0 ± 0.646
2.241LysHis: 2.241 ± 0.483
5.494LysIle: 5.494 ± 0.516
6.795LysLys: 6.795 ± 0.974
6.145LysLeu: 6.145 ± 0.587
2.602LysMet: 2.602 ± 0.378
5.711LysAsn: 5.711 ± 0.652
2.096LysPro: 2.096 ± 0.381
4.265LysGln: 4.265 ± 0.474
5.277LysArg: 5.277 ± 0.572
4.121LysSer: 4.121 ± 0.685
6.072LysThr: 6.072 ± 0.76
5.928LysVal: 5.928 ± 0.522
1.301LysTrp: 1.301 ± 0.369
4.048LysTyr: 4.048 ± 0.556
0.0LysXaa: 0.0 ± 0.0
Leu
4.193LeuAla: 4.193 ± 0.625
0.94LeuCys: 0.94 ± 0.246
6.578LeuAsp: 6.578 ± 0.547
4.843LeuGlu: 4.843 ± 0.627
3.109LeuPhe: 3.109 ± 0.557
3.759LeuGly: 3.759 ± 0.873
1.374LeuHis: 1.374 ± 0.349
6.0LeuIle: 6.0 ± 0.63
6.651LeuLys: 6.651 ± 0.624
5.494LeuLeu: 5.494 ± 0.691
2.313LeuMet: 2.313 ± 0.522
4.916LeuAsn: 4.916 ± 0.428
3.109LeuPro: 3.109 ± 0.47
3.976LeuGln: 3.976 ± 0.65
3.687LeuArg: 3.687 ± 0.555
5.205LeuSer: 5.205 ± 0.437
4.048LeuThr: 4.048 ± 0.424
4.121LeuVal: 4.121 ± 0.515
1.157LeuTrp: 1.157 ± 0.611
3.253LeuTyr: 3.253 ± 0.526
0.0LeuXaa: 0.0 ± 0.0
Met
1.88MetAla: 1.88 ± 0.357
0.0MetCys: 0.0 ± 0.0
1.518MetAsp: 1.518 ± 0.304
1.663MetGlu: 1.663 ± 0.278
1.012MetPhe: 1.012 ± 0.294
0.651MetGly: 0.651 ± 0.258
0.434MetHis: 0.434 ± 0.145
1.663MetIle: 1.663 ± 0.342
3.687MetLys: 3.687 ± 0.494
2.024MetLeu: 2.024 ± 0.447
0.723MetMet: 0.723 ± 0.193
1.446MetAsn: 1.446 ± 0.303
1.157MetPro: 1.157 ± 0.299
1.084MetGln: 1.084 ± 0.283
1.374MetArg: 1.374 ± 0.327
1.663MetSer: 1.663 ± 0.346
2.386MetThr: 2.386 ± 0.296
0.795MetVal: 0.795 ± 0.219
0.289MetTrp: 0.289 ± 0.152
0.723MetTyr: 0.723 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.687AsnAla: 3.687 ± 0.415
0.145AsnCys: 0.145 ± 0.105
4.265AsnAsp: 4.265 ± 0.679
4.554AsnGlu: 4.554 ± 0.924
2.747AsnPhe: 2.747 ± 0.465
4.699AsnGly: 4.699 ± 0.528
0.867AsnHis: 0.867 ± 0.333
3.325AsnIle: 3.325 ± 0.468
6.072AsnLys: 6.072 ± 0.596
4.554AsnLeu: 4.554 ± 0.575
1.157AsnMet: 1.157 ± 0.265
4.121AsnAsn: 4.121 ± 0.63
2.096AsnPro: 2.096 ± 0.429
2.458AsnGln: 2.458 ± 0.431
2.747AsnArg: 2.747 ± 0.4
3.904AsnSer: 3.904 ± 0.593
3.831AsnThr: 3.831 ± 0.334
3.904AsnVal: 3.904 ± 0.574
1.084AsnTrp: 1.084 ± 0.314
2.602AsnTyr: 2.602 ± 0.412
0.0AsnXaa: 0.0 ± 0.0
Pro
0.867ProAla: 0.867 ± 0.262
0.145ProCys: 0.145 ± 0.104
1.446ProAsp: 1.446 ± 0.369
2.313ProGlu: 2.313 ± 0.438
1.301ProPhe: 1.301 ± 0.326
1.59ProGly: 1.59 ± 0.28
0.434ProHis: 0.434 ± 0.177
1.952ProIle: 1.952 ± 0.354
2.241ProLys: 2.241 ± 0.344
2.241ProLeu: 2.241 ± 0.434
0.434ProMet: 0.434 ± 0.155
1.229ProAsn: 1.229 ± 0.232
0.506ProPro: 0.506 ± 0.197
1.229ProGln: 1.229 ± 0.347
1.157ProArg: 1.157 ± 0.334
2.169ProSer: 2.169 ± 0.383
1.446ProThr: 1.446 ± 0.386
2.024ProVal: 2.024 ± 0.346
0.217ProTrp: 0.217 ± 0.136
1.446ProTyr: 1.446 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
2.096GlnAla: 2.096 ± 0.34
0.289GlnCys: 0.289 ± 0.147
1.952GlnAsp: 1.952 ± 0.278
2.675GlnGlu: 2.675 ± 0.55
1.157GlnPhe: 1.157 ± 0.312
2.096GlnGly: 2.096 ± 0.351
0.361GlnHis: 0.361 ± 0.139
2.747GlnIle: 2.747 ± 0.493
2.675GlnLys: 2.675 ± 0.32
3.398GlnLeu: 3.398 ± 0.379
0.723GlnMet: 0.723 ± 0.27
2.313GlnAsn: 2.313 ± 0.486
0.94GlnPro: 0.94 ± 0.366
1.952GlnGln: 1.952 ± 0.576
1.735GlnArg: 1.735 ± 0.347
2.096GlnSer: 2.096 ± 0.37
2.386GlnThr: 2.386 ± 0.442
1.952GlnVal: 1.952 ± 0.292
0.506GlnTrp: 0.506 ± 0.165
1.735GlnTyr: 1.735 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
1.807ArgAla: 1.807 ± 0.308
0.145ArgCys: 0.145 ± 0.1
3.109ArgAsp: 3.109 ± 0.319
3.542ArgGlu: 3.542 ± 0.433
2.386ArgPhe: 2.386 ± 0.395
2.53ArgGly: 2.53 ± 0.375
0.651ArgHis: 0.651 ± 0.217
4.048ArgIle: 4.048 ± 0.494
3.904ArgLys: 3.904 ± 0.542
4.482ArgLeu: 4.482 ± 0.459
1.518ArgMet: 1.518 ± 0.287
2.964ArgAsn: 2.964 ± 0.392
1.518ArgPro: 1.518 ± 0.307
1.229ArgGln: 1.229 ± 0.334
1.952ArgArg: 1.952 ± 0.479
2.675ArgSer: 2.675 ± 0.421
2.096ArgThr: 2.096 ± 0.499
2.602ArgVal: 2.602 ± 0.404
0.361ArgTrp: 0.361 ± 0.196
2.386ArgTyr: 2.386 ± 0.288
0.0ArgXaa: 0.0 ± 0.0
Ser
3.109SerAla: 3.109 ± 0.437
0.361SerCys: 0.361 ± 0.147
4.482SerAsp: 4.482 ± 0.477
4.048SerGlu: 4.048 ± 0.662
3.109SerPhe: 3.109 ± 0.489
4.337SerGly: 4.337 ± 1.068
1.012SerHis: 1.012 ± 0.278
4.554SerIle: 4.554 ± 0.765
5.928SerLys: 5.928 ± 0.571
3.831SerLeu: 3.831 ± 0.563
2.241SerMet: 2.241 ± 0.596
4.554SerAsn: 4.554 ± 0.569
0.867SerPro: 0.867 ± 0.299
1.88SerGln: 1.88 ± 0.421
1.518SerArg: 1.518 ± 0.379
4.482SerSer: 4.482 ± 0.595
3.47SerThr: 3.47 ± 0.468
3.47SerVal: 3.47 ± 0.36
0.651SerTrp: 0.651 ± 0.231
2.024SerTyr: 2.024 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
3.398ThrAla: 3.398 ± 0.584
0.0ThrCys: 0.0 ± 0.0
3.759ThrAsp: 3.759 ± 0.391
3.904ThrGlu: 3.904 ± 0.694
3.181ThrPhe: 3.181 ± 0.518
4.048ThrGly: 4.048 ± 0.771
1.301ThrHis: 1.301 ± 0.429
4.482ThrIle: 4.482 ± 0.695
4.771ThrLys: 4.771 ± 0.573
4.843ThrLeu: 4.843 ± 0.687
0.651ThrMet: 0.651 ± 0.201
3.036ThrAsn: 3.036 ± 0.414
2.53ThrPro: 2.53 ± 0.348
1.663ThrGln: 1.663 ± 0.375
2.096ThrArg: 2.096 ± 0.413
3.687ThrSer: 3.687 ± 0.475
3.542ThrThr: 3.542 ± 0.547
3.831ThrVal: 3.831 ± 0.659
0.867ThrTrp: 0.867 ± 0.281
2.747ThrTyr: 2.747 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
3.976ValAla: 3.976 ± 0.454
0.578ValCys: 0.578 ± 0.213
4.916ValAsp: 4.916 ± 0.505
4.699ValGlu: 4.699 ± 0.74
2.096ValPhe: 2.096 ± 0.389
3.759ValGly: 3.759 ± 0.52
0.867ValHis: 0.867 ± 0.234
3.831ValIle: 3.831 ± 0.606
6.723ValLys: 6.723 ± 1.217
4.121ValLeu: 4.121 ± 0.412
1.59ValMet: 1.59 ± 0.256
4.193ValAsn: 4.193 ± 0.598
1.446ValPro: 1.446 ± 0.385
1.59ValGln: 1.59 ± 0.293
2.386ValArg: 2.386 ± 0.276
4.337ValSer: 4.337 ± 0.508
3.181ValThr: 3.181 ± 0.438
4.41ValVal: 4.41 ± 0.686
0.651ValTrp: 0.651 ± 0.22
2.747ValTyr: 2.747 ± 0.479
0.0ValXaa: 0.0 ± 0.0
Trp
0.651TrpAla: 0.651 ± 0.2
0.217TrpCys: 0.217 ± 0.165
0.795TrpAsp: 0.795 ± 0.227
0.723TrpGlu: 0.723 ± 0.221
0.651TrpPhe: 0.651 ± 0.181
0.434TrpGly: 0.434 ± 0.132
0.217TrpHis: 0.217 ± 0.117
0.795TrpIle: 0.795 ± 0.242
1.229TrpLys: 1.229 ± 0.323
0.94TrpLeu: 0.94 ± 0.227
0.289TrpMet: 0.289 ± 0.154
1.446TrpAsn: 1.446 ± 0.737
0.361TrpPro: 0.361 ± 0.201
0.578TrpGln: 0.578 ± 0.159
0.506TrpArg: 0.506 ± 0.17
0.867TrpSer: 0.867 ± 0.334
0.651TrpThr: 0.651 ± 0.208
1.012TrpVal: 1.012 ± 0.251
0.072TrpTrp: 0.072 ± 0.078
0.578TrpTyr: 0.578 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.386TyrAla: 2.386 ± 0.435
0.361TyrCys: 0.361 ± 0.16
3.181TyrAsp: 3.181 ± 0.498
3.831TyrGlu: 3.831 ± 0.645
2.313TyrPhe: 2.313 ± 0.527
2.747TyrGly: 2.747 ± 0.384
0.94TyrHis: 0.94 ± 0.306
2.602TyrIle: 2.602 ± 0.522
3.687TyrLys: 3.687 ± 0.771
3.398TyrLeu: 3.398 ± 0.421
1.229TyrMet: 1.229 ± 0.318
2.241TyrAsn: 2.241 ± 0.41
0.723TyrPro: 0.723 ± 0.213
1.157TyrGln: 1.157 ± 0.257
2.241TyrArg: 2.241 ± 0.474
2.675TyrSer: 2.675 ± 0.293
2.892TyrThr: 2.892 ± 0.674
2.096TyrVal: 2.096 ± 0.297
0.434TyrTrp: 0.434 ± 0.167
2.096TyrTyr: 2.096 ± 0.404
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13834 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski